Archives of Acoustics, 42, 2, pp. 235–247, 2017

An Automatic Synthesis of Musical Phrases from Multi-Pitch Samples

Marek Janusz PLUTA
Academy of Music in Krakow

Leszek Jędrzej SPAŁEK
Cracow Institute of Technology

Rafał Jacek DELEKTA
Academy of Music in Krakow

Sound synthesizers are a natural element of a musician's toolset. In music arrangement, samplers can often produce satisfactory results, but it requires a combination of manual and automatic methods that may be arduous at times. Concatenative Sound Synthesis reproduces many of natural performance- and expression-related nuances- but at a cost of high demand for processing power. Here, another method of musical phrase synthesis aimed at music arrangement is presented that addresses these and other related issues. Contrary to common practice, we propose to record and utilize sound samples containing not one, but short sequences of pitches. In effect, natural pitch transitions are preserved and phrases appear to be much smoother, despite using very limited set of performance rules. The proof-of-concept implementation of the proposed method is discussed in detail along with attempts at optimizing the final sound effects, based on auditory tests. The limitations and future applications of the synthesizer are also discussed.
Keywords: sound synthesis; synthesis methods; sampling; Concatenative Sound Synthesis
Full Text: PDF
Copyright © Polish Academy of Sciences & Institute of Fundamental Technological Research (IPPT PAN).


Bresin R. (1998), Artificial neural networks based models for automatic performance of musical scores, Journal of New Music Research, 27, 239–270.

Bresin R., Friberg A., Sundberg J. (2002), Director Musices: The KTH Performance Rules System, Proceedings of SIGMUS-46, pp. 43–48, Kyoto.

Czerwiński A., Grzybek D., Krauze P., Łuczko J., Michalczyk K., Orkisz P., Pluta M., Radziszewski L., Saga M., Snamina J. (2015), GPU-based sound synthesis controlled with MIDI protocol, [in Polish: Synteza dźwięku z wykorzystaniem procesora graficznego, sterowana przy użyciu protokołu MIDI], [in:] Selected issues of noise and vibration reduction systems, [in Polish:] Wybrane zagadnienia układów redukcji drgań i hałasu, Orkisz, P. [Ed.], pp. 40–52, Akademia Górniczo-Hutnicza, Kraków.

Delekta R.J., Pluta M. (2015), An implementation of the performance rules in the modified sampling synthesis of wind instruments [in Polish:] Implementacja reguł wykonawczych w syntezie dźwięku instrumentów dętych zmodyfikowaną metodą samplingową, Proceedings of XVI International Symposium on Sound Engineering and Tonmeistering ISSET'2015, pp. 119–125, Warszawa.

Delekta R.J., Spalek L.J., Pluta M. (2016), The impact of selected parameters of a modified sampling synthesis on the result of its auditory assessment, Journal of Applied Mathematics and Physics, 4, 2, 221–226.

Friberg A., Bresin R., Sundberg J. (2006), Overview of the KTH rule system for music performance, Advances in Cognitive Psychology, 2, 2–3, 145–161.

Gabrielsson A. (1985), Interplay between analysis and synthesis in studies of music performance and music experience, Psychology of Music, 3, 59–86.

Klatt D.H. (1983), Review of Text-to-Speech Conversion for English. Journal of the Acoustics Society of America, 82, 3, 737–793.

Lindemann E. (2007), Music Synthesis with Reconstructive Phrase Modeling. IEEE Signal Processing Magazine, 24, 2, 80–91.

London J. (2004), Hearing in Time: Psychological Aspects of Musical Meter, Oxford University Press, Oxford.

Maestre E., Ramı́rez R., Kersten S., Serra X. (2009), Expressive concatenative synthesis by reusing samples from real performance recordings, Computer Music Journal, 33, 4, 23–42.

Pluta M., Delekta, R.J. (2015), Technique to Seamlessly Connect Sound Samples in Sampling Synthesis, [in:] Progress of Acoustics 2015, Opieliński K.J. [Ed.], pp. 271–282, Polskie Towarzystwo Akustyczne, Wrocław.

Pluta M., Spalek L.J., Delekta R.J. (2016), A modified sampling synthesis for a realistic simulation of wind instruments – the design and implementation, Journal of Applied Mathematics and Physics, 4, 2, 215–220.

Prudon R. (2003), A Selection/Concatenation TTS Synthesis System. PhD Thesis, LIMSI, University of Paris XI.

Schwarz D., Beller G., Verbrugghe B., Britton S. (2006), Real-Time Corpus-Based Concatenative Synthesis with CataRT, Proceedings of the COST-G6 Conference on Digital Audio Effects (DAFx), pp. 279–282, Montreal.

Schwarz D., Cahen R., Britton S. (2008), Principles and applications of interactive corpus-based concatenative synthesis, Proceedings of the Journées d'Informatique Musicale (JIM).

Simon I., Basu S., Salesin D., Agrawala M. (2005), Audio Analogies: Creating New Music from an Existing Performance by Concatenative Synthesis, Proceedings of the 2005 International Computer Music Conference, pp. 65–72, San Francisco.

Smith III J.O. (1991), Viewpoints on the History of Digital Synthesis, Proceedings of the International Computer Music Conference ICMC-91, pp. 1–10, Montreal.

Widmer G. (1995), Modeling rational basis for musical expression, Computer Music Journal, 19, 76–96.

Widmer G. (2002), Machine discoveries: A few simple, robust local expression principles, Journal of New Music Research, 31, 37–50.

Widmer G., Tobudic A. (2003), Playing Mozart by analogy: Learning multi-level timing and dynamics strategies, Journal of New Music Research, 32, 259–268.

DOI: 10.1515/aoa-2017-0026