10.1515/aoa-2016-0056
Wavelet Packet Transform based Speech Enhancement via Two-Dimensional SPP Estimator with Generalized Gamma Priors
References
AudioMiCro, Free Industrial and Machinery Sound Effects, Retrived November 29th, 2015, from http://www.audiomicro.com/free-sound-effects/free-industrial-and-machinery/.
Bahoura M., Rouat J. (2006), Wavelet speech enhancement based on time-scale adaptation, Speech Communication, 48, 12, 1620–1637.
Bahoura M., Rouat J. (2001), Wavelet speech enhancement based on the teager energy operator, Signal Processing Letters, IEEE, 8, 1, 10–12.
Boll S.F. (1979), Suppression of acoustic noise in speech using spectral subtraction, Acoustics, Speech and Signal Processing, IEEE Transactions on, 27, 2,113–120.
Bovik A. Maragos C.P., Quatieri T.F. (1993), Am-fm energy detection and separation in noise using multiband energy operators, Signal Processing, IEEE Transactions on, 41, 12, 3245–3265.
Chang S.G., Yu B., Vetterli M. (2000), Adaptive wavelet thresholding for image denoising and compression, Image Processing, IEEE Transactions on, 9, 9, 1532–1546.
Cohen I., Berdugo B. (2001), Speech enhancement for non-stationary noise environments, Signal processing, 81, 11, 2403–2418.
Cohen I. (2003), Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging, Speech and Audio Processing, IEEE Transactions on, 11, 5, 466–475.
Cohen I. (2004), Speech enhancement using a noncausal a priori snr estimator, Signal Processing Letters, IEEE, 11, 9, 725–728.
Dunn R.B., Quatieri T.F., Kaiser J.F. (1993), Detection of transient signals using the energy operator, Acoustics, Speech, and Signal Processing, ICASSP., 1993 IEEE International Conference on, pp. 145–148.
Ephraim Y., Malah D. (1984), Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, Acoustics, Speech and Signal Processing, IEEE Transactions on, 32, 6, 1109–1121.
Ephraim Y., Van Trees H.L. (1995), A signal subspace approach for speech enhancement, Acoustics, Speech and Signal Processing, IEEE Transactions on, 3, 4, 251–266.
Erkelens J.S., Hendriks R.C., Heusdens R., Jensen J. (2007), Minimum mean-square error estimation of discrete fourier coe_cients with generalized gamma priors, Audio, Speech, and Language Processing, IEEE Transactions on, 15, 6, 1741–1752.
Fisher E., Tabrikian J., Dubnov S. (2006), Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model, Audio, Speech, and Language Processing, IEEE Transactions on, 14, 2, 502–510.
Gerkmann T., Breithaupt C., Martin R. (2008), Improved a posteriori speech presence probability estimation based on a likelihood ratio with fixed priors, Audio, Speech, and Language Processing, IEEE Transactions on, 16, 5, 910–919.
Ghanbari Y., Karami-Mollaei M. R. (2006), A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets, Speech communication, 48, 8, 927–940.
Hendriks R.C., Gerkmann T., Jensen J. (2013), Dft-domain based single-microphone noise reduction for speech enhancement: a survey of the state of the art, Synthesis Lectures on Speech and Audio Processing, 9, 1, 80–84.
Hu Y., Loizou P.C. (2004), Speech enhancement based on wavelet thresholding the multitaper spectrum, Speech and Audio Processing, IEEE Transactions on, 12 , 1, 59–67.
Hu Y., Loizou P.C. (2007), Subjective comparison and evaluation of speech enhancement algorithms, Speech communication, 49, 7, 588–601.
Johnson M.T., Yuan X., Ren Y. (2007), Speech signal enhancement through adaptive wavelet thresholding, Speech Communication, 49, 2, 123–133.
Kaiser J.F. (1993), Some useful properties of teager's energy operators, Acoustics, Speech, and Signal Processing, ICASSP-93, IEEE International Conference on, pp. 149–152.
Kandia V., Stylianou Y. (2006), Detection of sperm whale clicks based on the teager-kaiser energy operator, Applied Acoustics, 67, 11, 1144–1163.
Langner B., Black A.W. (2004), Creating a database of speech in noise for unit selection synthesis, Fifth ISCA Workshop on Speech Synthesis, 229–230.
Loizou P.C., Speech enhancement: theory and practice, CRC press, 2013.
Martin R. (2002), Speech enhancement using mmse short time spectral estimation with gamma distributed speech priors, Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference, pp. 253–256.
Martin R. (2005), Speech enhancement based on minimum mean-square error estimation and supergaussian priors, Speech and Audio Processing, IEEE Transactions on, 13, 5, 845–856.
Mohammadiha N., Martin R., Leijon A. (2013), Spectral domain speech enhancement using hmm state-dependent super-gaussian priors, Signal Processing Letters, IEEE, 20, 3, 253–256.
Park J., Kim J.-W., Chang J.-H., Jin Y.G., Kim N.S. (2015), Estimation of speech absence uncertainty based on multiple linear regression analysis for speech enhancement, Applied Acoustics, 87, 2015, 205–211.
Sanam T. F., Shahnaz C. (2013), Noisy speech enhancement based on an adaptive threshold and a modified hard thresholding function in wavelet packet domain, Digital Signal Processing, 23, 3, 941–951.
Scalart P. (1996), Speech enhancement based on a priori signal to noise estimation, Acoustics, Speech, and Signal Processing, ICASSP Conference Proceedings, IEEE International Conference on, pp. 629–632.
Simoncelli E. P., and Adelson E. H. (1996), Noise removal via bayesian wavelet coring, Image Processing Proceedings., International Conference on, pp.379-382.
Tasmaz H., Ercelebi E. (2008), Speech enhancement based on undecimated wavelet packet-perceptual flterbanks and mmse-stsa estimation in various noise environments, Digital Signal Processing, 18, 5, 797–812.
Weickert T., Benjaminsen C., Kiencke U. (2008), Analytic complex wavelet packets for speech enhancement, Acoustics, Speech and Signal Processing, ICASSP 2008. IEEE International Conference, pp. 3269–3272.
Ying G., Mitchell C., Jamieson L. (1993), Endpoint detection of isolated utterances based on a modified teager energy measurement, Acoustics, Speech, and Signal Processing, ICASSP-93, IEEE International Conference on, pp. 732–735.
DOI: 10.1515/aoa-2016-0056