Quality Evaluation of Speech Transmission via Two-way BPL-PLC Voice Communication System in an Underground Mine

Downloads

Authors

  • Przemysław FALKOWSKI-GILSKI Gdańsk University of Technology, Poland
  • Grzegorz DEBITA General Tadeusz Kosciuszko Military University of Land Forces, Poland

Abstract

In order to design a stable and reliable voice communication system, it is essential to know how many resources are necessary for conveying quality content. These parameters may include objective quality of service (QoS) metrics, such as: available bandwidth, bit error rate (BER), delay, latency as well as subjective quality of experience (QoE) related to user expectations. QoE is expressed as clarity of speech and the ability to interpret voice commands with adequate mean opinion score (MOS) grades. This paper describes a quality evaluation study of a two-way speech transmission system via bandwidth over power line – power line communication (BPL-PLC) technology in an operating underground mine. We investigate how different features of the available wired medium can affect end-user quality. The results of the described study include: two types of coupling (capacitive and inductive), two transmission modes (mode 1 and 11), and four language sets of speech samples (American English, British English, German, and Polish) encoded at three different bit rates (8, 16, and 24 kbps). Our findings can aid both researchers working on low-bit rate coding and compression, signal processing and speech perception, as well as professionals active in the mining and oil industry.

Keywords:

coding, communication applications, compression, signal processing, speech processing, quality of service

References

1. 3rd Generation Partnership Project [3GPP] (2011), Policy and charging control architecture, Technical specification group services and system aspects, 3GPP Technical Specification 23.203, https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=810., access: 21.06.2023.

2. Bernacki K., Wybrańczyk D., Zygmanowski M., Latko A., Michalak J., Rymarski Z. (2019), Disturbance and signal filter for power line communication, Electronics, 8(4): 378, https://doi.org/10.3390/electronics8040378.
3. Boz E., Finley B., Oulasvirta A., Kilkki K., Manner J. (2019), Mobile QoE prediction in the field, Pervasive and Mobile Computing, 59: 101039, https://doi.org/10.1016/j.pmcj.2019.101039.
4. Debita G. et al. (2020), Subjective and objective quality evaluation study of BPL-PLC wired medium, Elektronika ir Elektrotechnika, 26(3): 13–19, https://doi.org/10.5755/j01.eie.26.3.25794.

5. Debita G., Habrych M., Tomczyk A., Miedziński B., Wandzio J. (2019), Implementing BPL transmission in MV cable network effectively, Elektronika ir Elektrotechnika, 25(1): 59–65, https://doi.org/10.5755/j01.eie.25.1.22737.

6. Delcroix M. et al. (2019), End-to-end SpeakerBeam for single channel target speech recognition, [in:] INTERSPEECH 2019 – 21th Annual Conference of the International Speech Communication Association, pp. 451–455, https://doi.org/10.21437/Interspeech.2019-1856.

7. Ding S.Y., Liu J.L., Yue M.H. (2021), The use of ZigBee wireless communication technology in industrial automation control, Wireless Communications and Mobile Computing, 2021: 8317862, https://doi.org/10.1155/2021/8317862.

8. Dubey H., Sangwan A., Hansen J.H.L. (2019), Toeplitz inverse covariance based robust speaker clustering for naturalistic audio streams, [in:] INTERSPEECH 2019 – 21th Annual Conference of the International Speech Communication Association, pp. 416–420, https://doi.org/10.21437/Interspeech.2019-1102.

9. Falkowski-Gilski P. et al. (2020), Subjective quality evaluation of speech signals transmitted via BPLPLC wired system, [in:] INTERSPEECH 2020 – 22th Annual Conference of the International Speech Communication Association, pp. 4601–4605, https://doi.org/10.21437/Interspeech.2020-1077.
10. Falkowski-Gilski P., Uhl T. (2020), Current trends in consumption of multimedia content using online streaming platforms: A user-centric survey, Computer Science Review, 37(4): 100268, https://doi.org/10.1016/j.cosrev.2020.100268.

11. Falkowski-Gilski P. (2020), On the consumption ofmultimedia content using mobile devices: a year to year user case study, Archives of Acoustics, 45(2): 321–328, https://doi.org/10.24425/aoa.2020.133152.

12. Fallgren P., Malisz Z., Edlund J. (2019), How to annotate 100 hours in 45 minutes, [in:] INTERSPEECH 2019 – 21th Annual Conference of the International Speech Communication Association, pp. 341–345, https://doi.org/10.21437/Interspeech.2019-1648.

13. Fuchs G., Ashour C., Bäckström T. (2019), Superwideband spectral envelope modeling for speech coding, [in:] INTERSPEECH 2019 – 21th Annual Conference of the International Speech Communication Association, pp. 416–420, https://doi.org/10.21437/Interspeech.2019-1620.

14. Gibson J.D., Berger T., Lookabaugh T., Lindbergh D., Baker R.L. (1998), Digital Compression for Multimedia: Principles and Standards, Morgan Kaufmann, San Francisco.

15. Hao S., Zhang H.Y. (2021), A cross-layered theoretical model of IEEE 1901 power-line communication networks considering retransmission protocols, IEEE Access, 9: 28805–28821, https://doi.org/10.1109/ACCESS.2021.3059246.

16. Held G. (2016), Understanding Broadband Over Power Line, Auerbach Publications.

17. Helmrich C.R., Markovic G., Edler B. (2014), Improved low-delay MDCT-based coding of both stationary and transient audio signals, [in:] ICASSP 2014 – IEEE International Conference on Acoustic, Speech and Signal Processing, pp. 6954–6958, https://doi.org/10.1109/ICASSP.2014.6854948.

18. Hoßfeld T. et al. (2014), Best practices for QoE crowdtesting: QoE assessment with crowdsourcing, IEEE Transactions on Multimedia, 16(2): 541–558, https://doi.org/10.1109/TMM.2013.2291663.

19. International Telecommunication Union [ITU] (2003), General methods for the subjective assessment of sound quality, ITU Recommendation BS.1284, https://www.itu.int/rec/R-REC-BS.1284/en., access: 21.06.2023.

20. International Telecommunication Union [ITU] (2017), Test signals for telecommunication systems, ITU Recommendation P.501, https://www.itu.int/ITU-T/recommendations/rec.aspx?id=14271., access: 21.06.2023.

21. King M., Nirav D., Arvind A. (2012), Automatic generation of hardware/software interfaces, [in:] Association for Computing Machinery, 47(4): 325–336, https://doi.org/10.1145/2248487.2151011.
22. Korycki R. (2012), Detection of tampering in lossy compressed digital audio recordings, [in:] NTAV/SPA 2012 – New Trends in Audio and Video/Signal Processing: Algorithms, Architectures, Arrangements and Applications, pp. 97–101.

23. Kostek B. (2019), Music information retrieval – The impact of technology, crowdsourcing, big data, and the cloud in art, Journal of the Acoustical Society of America, 146(4): 2946, https://doi.org/10.1121/1.5137234.

24. Kostek B., Odya P., Suchomski P. (2016), Loudness scaling test based on categorical perception, Archives of Acoustics, 41(4): 637–648, https://doi.org/10.1515/aoa-2016-0061.

25. Kotus J., Szczodrak M., Czyewski A., Kostek B. (2012), Distributed system for noise threat evaluation based on psychoacoustic measurements, Metrology and Measurement Systems, 19(2): 219–230, https://doi.org/10.2478/v10178-012-0019-6.

26. Maijala P., Shuyang Z., Heittola T., Virtanen T. (2018), Environmental noise monitoring using source classification in sensors, Applied Acoustics, 129: 258–267, https://doi.org/10.1016/j.apacoust.2017.08.006.

27. Marciniuk K., Kostek B. (2015), Creating a numerical model of noise conditions based on the analysis of traffic volume changes in cities with low and medium structure, [in:] Postepy Akustyki – Progress of Acoustics, Opielinski K.J. [Ed.], pp. 347–358, Polskie Towarzystwo Akustyczne, Wrocław.

28. Meng Z., Gaur Y., Li J., Gong Y. (2019), Speaker adaptation for attention-based end-to-end speech recognition, [in:] INTERSPEECH 2019 – 21th Annual Conference of the International Speech Communication Association, pp. 241–245, https://doi.org/10.21437/Interspeech.2019-3135.

29. Miskiewicz K.,Wojaczek A. (2010), Radio Communication System Using Leaky Feeder in Mines Undergrounds [in Polish: Systemy radiokomunikacji z kablem promieniującym w kopalniach podziemnych] Silesian University of Technology Publishing House, Gliwice.

30. Miskiewicz K., Wojaczek A. (2016), How to assess and improve the quality of voice services in telephone communication and alarm systems in mines, Mining – Informatics, Automation and Electrical Engineering, 2(526): 40–47.

31. Morello R., Mukhopadhyay S.C., Liu Z., Slomovitz D., Samantaray S.R. (2017), Advances on sensing technologies for smart cities and power grids: A review, IEEE Sensors Journal, 17(23): 7596–7610, https://doi.org/10.1109/JSEN.2017.2735539.

32. Möller S., Raake A. (2014), Quality of Experience. Advanced Concepts, Applications and Methods, Springer Cham.

33. Pocta P., Beerends J.G. (2015), Subjective and objective assessment of perceived audio quality of current digital audio broadcasting systems and web-casting applications, IEEE Transactions on Broadcasting, 61(3): 407–415, https://doi.org/10.1109/TBC.2015.2424373.

34. Szczodrak M., Czyzewski A., Kotus J., Kostek B. (2014), Frequently updated noise threat maps created with use of supercomputing grid, Noise Mapping, 1(1): 32–39, https://doi.org/10.2478/noise-2014-0004.

35. Une M., Miyazaki R. (2020), Musical-noise-free noise reduction by using biased harmonic regeneration and considering relationship between a priori SNR and sound quality, Applied Acoustics, 168: 107410, https://doi.org/10.1016/j.apacoust.2020.107410.

36. Zamlynska M., Debita G., Falkowski-Gilski P. (2022), Quality analysis of audio-video transmission in an OFDM-based communication system, [in:] Mobile and Ubiquitous Systems: Computing, Networking and Services. MobiQuitous 2021, Hara T., Yamaguchi H. [Eds.], pp. 724–736, Springer Cham, https://doi.org/10.1007/978-3-030-94822-1_47.