Is a Multi-Slider Interface Layout Responsible for a Stimulus Spacing Bias in the MUSHRA Test?

Sławomir ZIELIŃSKI

doi:10.1515/aoa-2015-0058

Authors

Sławomir ZIELIŃSKI Białystok University of Technology, Poland

Abstract

The multi-stimulus test with hidden reference and anchors (MUSHRA) is commonly used for subjective quality assessment of audio systems. Despite its wide acceptance in scientific and industrial sectors, the method is not free from bias. One possible source of bias in the MUSHRA method may be attributed to a graphical design of its user interface. This paper examines the hypothesis that replacement of the standard multi-slider layout with a single-slider version could reduce a stimulus spacing bias observed in the MUSHRA test. Contrary to the expectation, the aforementioned modification did not reduce the bias. This outcome formally supports the validity of using multiple sliders in the MUSHRA graphical interface.

Keywords:

audio quality assessment, subjective quality evaluation, listening tests, psychoacoustics, multi stimulus test with hidden reference and anchors, MUSHRA.

References

1. Bech S. (1992), Selection and training of subjects for listening tests on sound reproducing equipment, J. Audio Eng. Soc., 40, 590–610.
2. Beresford K., Ford N., Rumsey F., Zieliński S. (2006), Contextual Effects on Sound Quality Judgements: Part II – Multi-Stimulus vs. Single Stimulus Method, Presented at the 121st Convention of the Audio Engineering Society, Paper 6913.
3. Berg J., Bustad Ch., Jonsson L., Mossberg L., Nyberg D. (2013), Perceived Audio Quality of Realistic FM and DAB+ Radio Broadcasting Systems, J. Audio Eng. Soc., 61, 755–777.
4. Blauert J., Jekosch U. (2012), A Layer Model of Sound Quality, J. Audio Eng. Soc., 60, 4–12.
5. Christie D. (2008), On the Effect of Slider Presentation within the MUSHRA Test, Final Year Tonmeister Technical Project, Institute of Sound Recording, University of Surrey.
6. EBU Tech 3296 Technical Document (2003), EBU subjective listening tests on low-bitrate audio codecs, European Broadcasting Union, Geneva, Switzerland.
7. EBU Tech 3324 Technical Document (2007), EBU evaluations of multichannel audio codecs, European Broadcasting Union, Geneva, Switzerland.
8. ITU-R Rec. BS.1534-2 (2001–2014), Method for the Subjective Assessment of Intermediate Quality Level of Coding Systems, International Telecommunications Union, Geneva, Switzerland.
9. ITU-T Rec. P.800 (1996), Methods for objective and subjective assessment of quality, International Telecommunications Union, Geneva, Switzerland.
10. Howell D.C. (1997), Statistical Methods for Psychology, Duxbury, New York.
11. Lawless H.T., Heymann H. (1998), Sensory Evaluation of Food, Kluwer-Plenum, London.
12. Lee S., Lee Y-T., Seo J., Baek M-S., Lim Ch-H., Park H. (2011), An Audio Quality Evaluation of Commercial Digital Radio Systems, IEEE Transactions on Broadcasting, 57, 629–636.
13. Levine T.R., Hullett C.R. (2002), Eta Squared, Partial Eta Squared, and Misreporting of Effect Size in Communication Research, Human Communication Research, 28, 612–625.
14. Liebetrau J. et al. (2014), Revision of Rec. ITU-R BS.1534, Presented at the 137th Convention of the Audio Engineering Society, Paper 9172, Los Angeles.
15. Mellers B.A., Birnbaum M.H. (1982), Loci of Contextual Effects in Judgment, Journal of Experimental Psychology: Human Perception and Performance, 8, 582–601.
16. Möller S. (2000), Assessment and Prediction of Speech Quality in Telecommunications, Kluwer Academic Publishers, London.
17. Neuendorf M. et al. (2013), The ISO/MPEG Unified Speech and Audio Coding Standard – Consistent High Quality for All Content Types and at All Bit Rates, J. Audio Eng. Soc., 61, 956–977.
18. Olejnik S., Algina J. (2000), Measures of Effect Size for Comparative Studies: Applications, Interpretations, and Limitations, Contemporary Educational Psychology, 25, 241–286.
19. Olive S.E. (2003), Differences in Performance and Preference of Trained versus Untrained Listeners in Loudspeaker Tests: A Case Study, J. Audio Eng. Soc., 51, 806–825.
20. Poulton E.C. (1989), Bias in Quantifying Judgments, Lawrence Erlbaum, London.
21. Rumsey F., Zieliński S., Kassier R., Bech S. (2005), Relationships between experienced listener ratings of multichannel audio quality and naïve listener preferences, J. Acoust. Soc. Am., 117, 3832–3840.
22. Schinkel-Bielefeld N., Lotze N., Nagel F. (2013), Audio quality evaluation by experienced and inexperienced listeners, Proceeding of Meeting on Acoustics, 19, ICA, Montreal, Canada.
23. Schmider E., Ziegler M., Danay E., Beyer L., Bühner M. (2010), Is It Really Robust? Reinvestigating the Robustness of ANOVA Against Violations of the Normal Distribution Assumption, Methodology European Journal of Research Methods for the Behavioral and Social Sciences, 6, 4, 147–151.
24. Soulodre G.A., Lavoie M.C. (1999), Subjective Evaluation of Large and Small Impairments in Audio Codecs, Presented at the 17th Audio Engineering
Society International Conference: High-Quality Audio Coding, Florence.
25. Wickelmaier F., Umbach N., Sergin K., Choisel S. (2012), Scaling sound quality using models for paired-comparison and ranking data, Presented at
DAGA 2012 Congress, Germany.
26. Zieliński S., Hardisty P., Hummersone C., Rumsey F. (2007), Potential Biases in MUSHRA Listening Tests, Presented at the 123rd Convention of the Audio Engineering Society, Paper 7179, New York.
27. Zieliński S., Rumsey F., Bech S. (2003), Effects of Down-Mix Algorithms on Quality of Surround Sound, J. Audio Eng. Soc., 51, 780–798.
28. Zieliński S., Rumsey F., Bech S. (2008), On Some Biases Encountered in Modern Audio Quality Listening Tests – A Review, J. Audio Eng. Soc., 56, 427–451.

Online first
2025, Vol 50
	No 1	No 2
2024, Vol 49
	No 1	No 2	No 3	No 4
2023, Vol 48
	No 1	No 2	No 3	No 4
2022, Vol 47
	No 1	No 2	No 3	No 4
2021, Vol 46
	No 1	No 2	No 3	No 4
2020, Vol 45
	No 1	No 2	No 3	No 4
2019, Vol 44
	No 1	No 2	No 3	No 4
2018, Vol 43
	No 1	No 2	No 3	No 4
2017, Vol 42
	No 1	No 2	No 3	No 4
2016, Vol 41
	No 1	No 2	No 3	No 4
2015, Vol 40
	No 1	No 2	No 3	No 4
2014, Vol 39
	No 1	No 2	No 3	No 4
2013, Vol 38
	No 1	No 2	No 3	No 4
2012, Vol 37
	No 1	No 2	No 3	No 4
2011, Vol 36
	No 1	No 2	No 3	No 4
2010, Vol 35
	No 1	No 2	No 3	No 4
2009, Vol 34
	No 1	No 2	No 3	No 4
2008, Vol 33
	No 1	No 2	No 3	No 4	No 4(S)
2007, Vol 32
	No 1	No 2	No 3	No 4	No 4(S)
2006, Vol 31
	No 1	No 2	No 3	No 4	No 4(S)
2005, Vol 30
	No 1	No 2	No 3	No 4
2004, Vol 29
	No 1	No 2	No 3	No 4
2003, Vol 28
	No 1	No 2	No 3	No 4
2002, Vol 27
	No 1	No 2	No 3	No 4
2001, Vol 26
	No 1	No 2	No 3	No 4
2000, Vol 25
	No 1	No 2	No 3	No 4
1999, Vol 24
	No 1	No 2	No 3	No 4
1998, Vol 23
	No 1	No 2	No 3	No 4
1997, Vol 22
	No 1	No 2	No 3	No 4
1996, Vol 21
	No 1	No 2	No 3	No 4
1995, Vol 20
	No 1	No 2	No 3	No 4
1994, Vol 19
	No 1	No 2	No 3	No 4
1993, Vol 18
	No 1	No 2	No 3	No 4
1992, Vol 17
	No 1	No 2	No 3	No 4
1991, Vol 16
	No 1	No 2	No 3-4
1990, Vol 15
	No 1-2		No 3-4
1989, Vol 14
	No 1-2		No 3-4
1988, Vol 13
	No 1-2		No 3-4
1987, Vol 12
	No 1	No 2	No 3-4
1986, Vol 11
	No 1	No 2	No 3	No 4
1985, Vol 10
	No 1	No 2	No 3	No 4
1984, Vol 9
	No 1-2		No 3	No 4
1983, Vol 8
	No 1	No 2	No 3	No 4
1982, Vol 7
	No 1	No 2	No 3-4
1981, Vol 6
	No 1	No 2	No 3	No 4
1980, Vol 5
	No 1	No 2	No 3	No 4
1979, Vol 4
	No 1	No 2	No 3	No 4
1978, Vol 3
	No 1	No 2	No 3	No 4
1977, Vol 2
	No 1	No 2	No 3	No 4
1976, Vol 1
	No 1	No 2	No 3	No 4

Is a Multi-Slider Interface Layout Responsible for a Stimulus Spacing Bias in the MUSHRA Test?

Downloads

Authors

Abstract

Keywords:

References

cover

ippt-pan

Issue

Pages

Section

DOI

License

How to Cite

Principal Contact

Address

Support Contact