Full Paper View Go Back
Review on Multi Pitch Detection in Speech
Rupinder Kaur1 , Navdeep Kumar2
Section:Review Paper, Product Type: Isroset-Journal
Vol.3 ,
Issue.3 , pp.6-10, May-2015
Online published on Jun 30, 2015
Copyright © Rupinder Kaur , Navdeep Kumar . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: Rupinder Kaur , Navdeep Kumar, “Review on Multi Pitch Detection in Speech,” International Journal of Scientific Research in Computer Science and Engineering, Vol.3, Issue.3, pp.6-10, 2015.
MLA Style Citation: Rupinder Kaur , Navdeep Kumar "Review on Multi Pitch Detection in Speech." International Journal of Scientific Research in Computer Science and Engineering 3.3 (2015): 6-10.
APA Style Citation: Rupinder Kaur , Navdeep Kumar, (2015). Review on Multi Pitch Detection in Speech. International Journal of Scientific Research in Computer Science and Engineering, 3(3), 6-10.
BibTex Style Citation:
@article{Kaur_2015,
author = {Rupinder Kaur , Navdeep Kumar},
title = {Review on Multi Pitch Detection in Speech},
journal = {International Journal of Scientific Research in Computer Science and Engineering},
issue_date = {5 2015},
volume = {3},
Issue = {3},
month = {5},
year = {2015},
issn = {2347-2693},
pages = {6-10},
url = {https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=194},
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
UR - https://www.isroset.org/journal/IJSRCSE/full_paper_view.php?paper_id=194
TI - Review on Multi Pitch Detection in Speech
T2 - International Journal of Scientific Research in Computer Science and Engineering
AU - Rupinder Kaur , Navdeep Kumar
PY - 2015
DA - 2015/06/30
PB - IJCSE, Indore, INDIA
SP - 6-10
IS - 3
VL - 3
SN - 2347-2693
ER -
Abstract :
This paper includes the review of research being carried on multiple pitches detection in an audio signal. It includes determining multiple fundamental frequency or different pitched sounds. It discusses different algorithms used to identify different speeches spoken at one time. Although, this technique is used for various applications of speech recognition, but it is widely used in music transcription.
Key-Words / Index Term :
Speech Recognition, MFCC, LPC, Pitch Tracking, Multiple Pitch Estimation
References :
[1] A. M. Noll, “Short-Time Spectrum and Cepstrum Techniques for Vocal-Pitch Detection”, J.A.S.A., vol. 36, no. 2, February 1964.
[2] A. M. Noll, “Cepstrum Pitch Determination”, J.A.S.A., vol. 41, no. 2, 1967.
[3] L. R. Rabiner, M. J. Cheng, A. E. Rosenberg and C. A. McGonegal, “A Comparative Performance Study of Several Pitch Detection Algorithms”, ASSP, vol. 24, no. 5, October 1976.
[4] P. J. Walmsley, S. J. Godsill and P. J. W. Rayner, “Polyphonic pitch tracking using joint Bayesian estimation of multiple frame parameters”, Proc. 1999 IfiM Workshop on Applications & Signal Processing 10 Audio and Acoustics, 1999.
[5] M. Karjalainen and T. Tolonen, “Multi Pitch and periodicity analysis model for sound separation and auditory scene”, IEEE, 1999.
[6] M. Karjalainen and T. Tolonen, “A Computationally Efficient Multipitch Analysis Model”, IEEE Transaction on Speech and Audio Processing, vol. 8, no. 6, November 2000.
[7] A. P. Klapuri, “Multipitch extimation and sound seperation by the spectral smoothness principle”, IEEE, 2001.
[8] M Wu, D Wang and G. J. Brown, “A Multipitch Tracking Algorithm for Noisy Speech”, IEEE Transaction on Speech and Audio Processing, vol. 11, no. 3, November 2003
[9] A. P. Klapuri, “Multiple Fundamental Frequency Estimation Based on Harmonicity and Spectral Smoothness”, IEEE Transaction on Speech and Audio Processing, vol. 11, no. 3, November 2003.
[10] S. S. Abeysekera, “Multiple Pitch estimation of poly-phonic audio signals in a frequency-lag domain using the bispectrum”, IEEE, 2004.
[11] M. G. Christensen, P. Stoica, A. Jakobsson, and S. H. Jensen, “The multi pitch estimation problem: Some New Solution”, IEEE, 2007.
[12] X. Zhang, W. Liu, P. Li and B. Xu, “Multipitch Detection Based on Weighted Summary Correlogram”, National Laboratory of Pattern Recognition, Beijing.
[13] A. Klapuri, “Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model”, IEEE Transaction on Speech and Audio Processing, vol. 16, no. 2, February 2008.
[14] R. Badeau, V. Emiya and B. David, “Expectation Maximization algorithm for multi pitch estimation and separation of overlapping harmonic spectra”, IEEE, 2009.
[15] E. Vincent, N. Bertin and R. Badeau ,“Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation”, IEEE Transaction on Speech and Audio Processing, vol. 18, no. 3, March 2010.
[16] E. Benetos and S. Dixon, “Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription”, IEEE Journal of selected topic in Signal Processing, vol. 5, no. 6, 2011.
[17] A. Koretz and J Tabrikian, “Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model”, IEEE Transaction on Speech and Audio Processing, vol. 19, no. 7, September 2011.
[18] Q Huang and D Wang, “Multi-Pitch Estimation for Speech Mixture Based on Multi-Length Windows Harmonic Model”, Proc. Of IJCCSO, 2011.
[19] S. I. Adalbjornsson, A. Jakobsson, and M. G. Christensen, “Estimating multiple pitches using block sparsity”, IEEE, 2013.
[20] M. Wohlmayr, M. Stark, and F. Pernkopf, “A probabilistic interaction model for multipitch tracking with factorial hidden Markov models,” IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 799–810, May 2011.
[21] M. Bay, A. F. Ehmann, J. W. Beauchamp, P. Smaragdis, and J. S. Downie, “Second fiddle is important too: Pitch tracking individual voices in polyphonic music,” in Proc. Int. Soc. Music Inf. Retrieval Conf. (ISMIR), pp. 319–324, 2012.
[22] E. Vincent, “Musical source separation using time-frequency source priors,” IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 91–98, 2006.
[23] Mads Græsbøll Christensen, Jesper Lisby Højvang, Andreas Jakobsson and Søren Holdt Jensen,” Joint fundamental frequency and order estimation using optimal filtering”, Christensen et al. EURASIP Journal on Advances in Signal Processing 2011.
[24] Johan Xi Zhang1, Mads Græsbøll Christensen, Søren Holdt Jensen and Marc Moonen,” Joint DOA and multi-pitch estimation based on subspace techniques” EURASIP Journal on Advances in Signal Processing 2012.
[25] Daniele Giacobello, , Mads Græsbøll Christensen, Manohar N. Murthi, , Søren Holdt Jensen, , and Marc Moonen, “Sparse Linear Prediction And Its Applications To Speech Processing”, Ieee Transactions On Audio, Speech, and Language Processing, Vol. 20, No. 5, July 2012.
[26] Jesper Kjær Nielsen, Mads Græsbøll Christensen And Søren Holdt Jensen , “Default Bayesian Estimation of The Fundamental Frequency” , Ieee Transactions On Audio, Speech, and Language Processing, Vol. 21, No. 3, March 2013.
[27] Jesper Rindom Jensen, Mads Græsbøll Christensen And Søren Holdt Jensen “Nonlinear Least Squares Methods For Joint Doa And Pitch Estimation”, Ieee Transactions On Audio, Speech, And Language Processing, Vol. 21, No. 5, May 2013.
[28] Zhiyao Duan, Jinyu Han, and Bryan Pardo,” Multi-Pitch Streaming Of Harmonic Sound Mixtures “, Ieee/Acm Transactions On Audio, Speech, And Language Processing, Vol. 22, No. 1, January 2014.
You do not have rights to view the full text article.
Please contact administration for subscription to Journal or individual article.
Mail us at support@isroset.org or view contact page for more details.