Prediction of Psychoacoustic Metrics Using Combination of Wavelet Packet Transform and an Optimized Artificial Neural Network

Mehdi POURSEIEDREZAEI; Ali LOGHMANI; Mehdi KESHMIRI

doi:10.24425/aoa.2019.129271

Authors

Mehdi POURSEIEDREZAEI Isfahan University of Technology, Iran
Ali LOGHMANI Isfahan University of Technology, Iran
Mehdi KESHMIRI Isfahan University of Technology, Iran

Abstract

In this paper, a modified sound quality evaluation (SQE) model is developed based on combination of an optimized artificial neural network (ANN) and the wavelet packet transform (WPT). The presented SQE model is a signal processing technique, which can be implemented in current microphones for predicting the sound quality. The proposed method extracts objective psychoacoustic metrics including loudness, sharpness, roughness, and tonality from sound samples, by using a special selection of multi-level nodes of the WPT combined with a trained ANN. The model is optimized using the particle swarm optimization (PSO) and the back propagation (BP) algorithms. The obtained results reveal that the proposed model shows the lowest mean square error and the highest correlation with human perception while it has the lowest computational cost compared to those of the other models and software.

Keywords:

sound quality measurement, psychoacoustic metrics, wavelet packet transform, optimized artificial neural network

References

1. Aures W. (1985), Method for calculating auditory roughness, Acustica, 58, 268–281.

2. Aures W. (1985), Berechnungsverfahren für den sensorischen Wohlklang beliebiger Schallsignale, Acustica, 59, 130–141.

3. Beheshti Z., Shamsuddin S.M.H., Beheshti E., Yuhaniz S.S. (2014), Enhancement of artificial neural network learning using centripetal accelerated particle swarm optimization for medical diseases diagnosis, Soft Computing, 18, 11, 2253–2270. https://doi.org/10.1007/s00500-013-1198-0.

4. Blauert J., Jekosch U. (1998), Product-sound quality: A New aspect of machinery noise, Archives of Acoustics, 23, 1, 105–124.

5. Błazejewski A., Kozioł P., Łuczak M. (2014), Acoustical analysis of enclosure as initial approach to vehicle induced noise analysis Comparatevely using STFT and wavelets, Archives of Acoustics, 39, 3, 385–394, https://doi.org/10.2478/aoa-2014-0042.

6. Carletti E. (2013), A perception-based method for the noise control of construction machines, Archives of Acoustics, 38, 2, 253–258, https://doi.org/10.2478/aoa-2013-0030.

7. Chen X., Hu H., Liu F., Gao X.X. (2011), Image reconstruction for an electrical capacitance tomography system based on a least-squares support vector machine and a self-adaptive particle swarm optimization algorithm, Measurement Science and Technology, 22, https://doi.org/10.1088/0957-0233/22/10/104008.

8. Dunn M.S., Erickson D., Avenue H., Gregory S. (2013), Recommended standards for newborn ICU design, eighth edition, Journal of Perinatology, 33, S2–S16, https://doi.org/10.1038/jp.2013.10.

9. Fastl H., Zwicker E. (2007), Psychoacoustics: facts and models, Springer, Berlin, Germany, 3rd ed., retrieved from http://dx.doi.org/10.1007/978-3-540-68888-4.

10. Fausett L. (1994), Fundamentals of Neural Networks, Prentice-Hall, Englewood Cliffs, NJ.

11. Gori M., Tesi A. (1992), On the Problem of Local Minima in Recurrent Neural Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 14, 76–86, https://doi.org/10.1109/34.107014.

12. Hasting A., Davies P. (2002), An examination of Aures’s model of tonality, Proceeding on Sound Quality Symposium, 29, 4–9.

13. Hecht-Nielsen R. (1992), Theory of the backpropagation neural network, [in:] H. Wechsler, V. Fairfax [Eds.], Neural networks for perception: computation, learning, architectures, vol. 2, pp. 65–93, Harcourt Brace & Co., Orlando, FL, http://dl.acm.org/citation.cfm.?id=140639.140643.

14. Huang H.B., Li R.X., Huang X.R., Yang M.L., Ding W.P. (2015), Sound quality evaluation of vehicle suspension shock absorber rattling noise based on the Wigner-Ville distribution, Applied Acoustics, 100, 18–25, https://doi.org/10.1016/j.apacoust.2015.06.018.

15. Jaddi N.S., Abdullah S. (2018), Optimization of neural network using kidney-inspired algorithm with control of filtration rate and chaotic map for real-world rainfall forecasting, Engineering Applications of Artificial Intelligence, 67, 246–259, https://doi.org/10.1016/j.engappai.2017.09.012.

16. Kaczmarek T., Preis A. (2010), Annoyance of time-varying road-traffic noise, Archives of Acoustics, 35, 3, 383–393, https://doi.org/10.2478/v10168-010-0032-2.

17. Kim E.Y., Lee Y.J., Lee S.K. (2012), Sound metric design for evaluation of tonal sound in laser printer, International Journal of Precision Engineering and Manufacturing, 13, 1349–1358, https://doi.org/10.1007/s12541-012-0178-0.

18. Klonari D., Pastiadis K., Papadelis G., Papanikolao G. (2011), Loudness assessment of musical tones equalized in a-weighted level, Archives of Acoustics, 36, 2, 239–250, https://doi.org/10.2478/v10168-011-0019-7.

19. Kuo S., Morgan D. (1996), Active noise control systems: algorithms and DSP implementations, Wiley, New York, NY, USA.

20. Leite R.P., Paul S., Gerges S.N.Y. (2008), A sound quality-based investigation of the HVAC system noise of an automobile model, Applied Acoustics, 70, 1–10, https://doi.org/10.1016/j.apacoust.2008.06.010.

21. Liu H., Zhang J., Guo P., Bi F., Yu H., Ni G. (2015), Sound quality prediction for engine-radiated noise, Mechanical Systems and Signal Processing, 56, 277–287, https://doi.org/10.1016/j.ymssp.2014.10.005.

22. Majeed S.A., Husain H., Samad S.A. (2015), Phase autocorrelation bark wavelet transform (PACWT) features for robust speech recognition, Archives of Acoustics, 40, 1, 25–31. https://doi.org/10.1515/aoa-2015-0004.

23. Mallat S. (2009), A wavelet tour of signal processing, Academic Press, 3rd ed., Burlington, MA, https://doi.org/10.1016/B978-0-12-374370-1.X0001-8.

24. Miskiewicz A., Rogala T., Szczepańska-Antosik J. (2007), Perceived roughness of two simultaneous harmonic complex tones, Archives of Acoustics, 32, 4, 737–748.

25. Olbrych S. (2010), Noise pollution in the NICU, Case Western Reserve University, retrived from https://case.edu/med/epidbio/mphp439/NoisePollution_NICU.pdf.

26. de Oliveira L.P.R., Janssens K., Gajdatsy P., Van der Auweraer H., Varoto P.S., Sas P., Desmet W. (2009), Active sound quality control of engine induced cavity noise, Mechanical Systems and Signal Processing, 23, 2, 476–488, https://doi.org/10.1016/j.ymssp.2008.04.005.

27. Parfieniuk M., Baszun J., Petrovsky A.A. (2006), Computing of masking thresholds for audio coders based on a quaternionic 4-band wavelet packet transform, Archives of Acoustics, 31, 1, 155–165.

28. Parmanen J. (2007), A-weighted sound pressure level as a loudness/annoyance indicator for environmental sounds – Could it be improved?, Applied Acoustics, 68, 58–70, https://doi.org/10.1016/j.apacoust.2006.02.004.

29. Parsons C.E., Young K.S., Craske M.G., Stein A.L., Kringelbach M.L. (2014), Introducing the Oxford Vocal (OxVoc) Sounds database: A validated set of non-acted affective sounds from human infants, adults, and domestic animals, Frontiers in Psychology, 5, 562, https://doi.org/10.3389/fpsyg.2014.00562.

30. Pleban D. (2010), Method of acoustic assessment of machinery based on global acoustic quality index, Archives of Acoustics, 35, 2, 223–235.

31. Pleban D. (2014), Definition and measure of the sound quality of the machine, Archives of Acoustics, 39, 1, 17–23, https://doi.org/10.2478/aoa-2014-0003.

32. Qin J., Sun P. (2015), Applications and comparison of continuous wavelet transforms on analysis of A-wave impulse noise, Archives of Acoustics, 40, 4, 503–512, https://doi.org/10.1515/aoa-2015-0050.

33. Razmjooy N., Mousavi B.S., Soleymani F. (2013), A hybrid neural network imperialist competitive algorithm for skin color segmentation, Mathematical and Computer Modelling, 57, 848–856. https://doi.org/10.1016/j.mcm.2012.09.013.

34. Silva M.C.G. (2002), Measurements of comfort in vehicles, Measurement Science and Technology, 13, 41–60.

35. Szczepańska-Antosik J. (2008), Roughness of two simultaneous harmonic complex tones in various pitch registers, Archives of Acoustics, 33, 1, 73–78.

36. Vencovský V. (2016), Roughness prediction based on a model of cochlear hydrodynamics, Archives of Acoustics, 41, 2, 189–201, https://doi.org/10.1515/aoa-2016-0019.

37. Wang Y.S. (2009), Sound quality estimation for nonstationary vehicle noises based on discrete wavelet transform, Journal of Sound and Vibration, 324, 3, 1124–1140, https://doi.org/10.1016/j.jsv.2009.02.034.

38. Wang Y.S., Lee C.M., Kim D.G., Xu Y. (2007), Sound-quality prediction for nonstationary vehicle interior noise based on wavelet pre-processing neural network model, Journal of Sound and Vibration, 299, 4, 933–947, https://doi.org/10.1016/j.jsv.2006.07.034.

39. Wang Y.S., Shen G.Q., Xing Y.F. (2014), A sound quality model for objective synthesis evaluation of vehicle interior noise based on artificial neural network, Mechanical Systems and Signal Processing, 45, 1, 255–266, https://doi.org/10.1016/j.ymssp.2013.11.001.

40. Xing Y.F.F., Wang Y.S.S., Shi L., Guo H., Chen H. (2016), Sound quality recognition using optimal wavelet-packet transform and artificial neural network methods, Mechanical Systems and Signal Processing, 66–67, 875–892, https://doi.org/10.1016/j.ymssp.2015.05.003.

41. Zeng X., Zhao W., Sheng J. (2008), Corresponding relationships between nodes of decomposition tree of wavelet packet and frequency bands of signal subspace, Acta Seismologica Sinica, 21, 1, 91–97, https://doi.org/10.1007/s11589-008-0091-x.

42. Zhang E., Hou L., Shen C., Shi Y., Zhang Y. (2015), Sound quality prediction of vehicle interior noise and mathematical modeling using a back propagation neural network (BPNN) based on article swarm optimization (PSO), Measurement Science and Technology, 27, 1, 15801, https://doi.org/10.1088/0957-0233/27/1/015801.

43. Zhang J.R., Zhang J., Lok T.M., Lyu M. R. (2007), A hybrid particle swarm optimization-back-propagation algorithm for feedforward neural network training, Applied Mathematics and Computation, 185, 2, 1026–1037, https://doi.org/10.1016/j.amc.2006.07.025.

44. Żwan P. (2008), Automatic singing quality recognition employing artificial neural networks, Archives of Acoustics, 33, 1, 65–71, http://acoustics.ippt.gov.pl/index.php/aa/article/view/631.

Online first
2025, Vol 50
	No 1	No 2
2024, Vol 49
	No 1	No 2	No 3	No 4
2023, Vol 48
	No 1	No 2	No 3	No 4
2022, Vol 47
	No 1	No 2	No 3	No 4
2021, Vol 46
	No 1	No 2	No 3	No 4
2020, Vol 45
	No 1	No 2	No 3	No 4
2019, Vol 44
	No 1	No 2	No 3	No 4
2018, Vol 43
	No 1	No 2	No 3	No 4
2017, Vol 42
	No 1	No 2	No 3	No 4
2016, Vol 41
	No 1	No 2	No 3	No 4
2015, Vol 40
	No 1	No 2	No 3	No 4
2014, Vol 39
	No 1	No 2	No 3	No 4
2013, Vol 38
	No 1	No 2	No 3	No 4
2012, Vol 37
	No 1	No 2	No 3	No 4
2011, Vol 36
	No 1	No 2	No 3	No 4
2010, Vol 35
	No 1	No 2	No 3	No 4
2009, Vol 34
	No 1	No 2	No 3	No 4
2008, Vol 33
	No 1	No 2	No 3	No 4	No 4(S)
2007, Vol 32
	No 1	No 2	No 3	No 4	No 4(S)
2006, Vol 31
	No 1	No 2	No 3	No 4	No 4(S)
2005, Vol 30
	No 1	No 2	No 3	No 4
2004, Vol 29
	No 1	No 2	No 3	No 4
2003, Vol 28
	No 1	No 2	No 3	No 4
2002, Vol 27
	No 1	No 2	No 3	No 4
2001, Vol 26
	No 1	No 2	No 3	No 4
2000, Vol 25
	No 1	No 2	No 3	No 4
1999, Vol 24
	No 1	No 2	No 3	No 4
1998, Vol 23
	No 1	No 2	No 3	No 4
1997, Vol 22
	No 1	No 2	No 3	No 4
1996, Vol 21
	No 1	No 2	No 3	No 4
1995, Vol 20
	No 1	No 2	No 3	No 4
1994, Vol 19
	No 1	No 2	No 3	No 4
1993, Vol 18
	No 1	No 2	No 3	No 4
1992, Vol 17
	No 1	No 2	No 3	No 4
1991, Vol 16
	No 1	No 2	No 3-4
1990, Vol 15
	No 1-2		No 3-4
1989, Vol 14
	No 1-2		No 3-4
1988, Vol 13
	No 1-2		No 3-4
1987, Vol 12
	No 1	No 2	No 3-4
1986, Vol 11
	No 1	No 2	No 3	No 4
1985, Vol 10
	No 1	No 2	No 3	No 4
1984, Vol 9
	No 1-2		No 3	No 4
1983, Vol 8
	No 1	No 2	No 3	No 4
1982, Vol 7
	No 1	No 2	No 3-4
1981, Vol 6
	No 1	No 2	No 3	No 4
1980, Vol 5
	No 1	No 2	No 3	No 4
1979, Vol 4
	No 1	No 2	No 3	No 4
1978, Vol 3
	No 1	No 2	No 3	No 4
1977, Vol 2
	No 1	No 2	No 3	No 4
1976, Vol 1
	No 1	No 2	No 3	No 4

Prediction of Psychoacoustic Metrics Using Combination of Wavelet Packet Transform and an Optimized Artificial Neural Network

Downloads

Authors

Abstract

Keywords:

References

Most read articles by the same author(s)

cover

ippt-pan

Issue

Pages

Section

DOI

Received

Revised

Accepted

Published

License

How to Cite

Principal Contact

Address

Support Contact