Bridging Music Information Retrieval and Folk Song Research - …pub.dega-akustik.de/NAG_DAGA_2009/data/articles/000392.pdf · 2009-05-14 · searchers. Both are based on the encoded

Bridging Music Information Retrieval and Folk Song Research

The computational setup of the WITCHCRAFT project

J. GarbersUtrecht University, The Netherlands, Email: [email protected]

AbstractThe WITCHCRAFT project sets as its objective todevelop a fully functional content-based retrieval systemfor folk song melodies. Besides making the Netherlandsfolk song collection of the Meertens Institute available forthe general public, we aim to support their research inoral variation. Folk song researchers traditionally knowmany songs by heart and intuitively find correlationsbetween them. For large or international collections,however, researchers cannot know all the song instances,so computer-aided support is indispensable to revealrelated items. In principle this means to take existingmodels about oral variation in designing and implement-ing similarity measures that can be used in a retrievalsystem. This, however, proves to be difficult in practice.

In this paper we present our information retrieval setupconsisting of performed folk song recordings, their sym-bolic encoding, an expert classification of variant tunes(ground truth), annotation data about the particularexpert classification reasons and our approaches to simi-larity. As an example we present how we translated theexpert concept of ”pitch stability” into an alignment-based similarity measure and how we evaluated ourmethod and the original ”pitch stability” hypothesis withrespect to the given annotations.

IntroductionIn general an information retrieval system consists ofa database, a query interface, one or more matchingalgorithms to compare items from the database withthe query and a result presentation component. Thedatabase is obviously a limiting factor, because queryingfor features which are not encoded in the database ismeaningless. But the query and response interfaces arealso important, because they allow the user to expressand satisfy his or her information need. Once thedatabase and the user interface are fixed, the results ofdifferent matching methods can be compared with eachother.

In the following sections we consider two retrieval systemsetups, one for the general public and one for folk song re-searchers. Both are based on the encoded transcriptionsof folk songs. We introduce the folk song collection whichwe make accessible by content-based search. We explainthe information need of folk song researchers and sketchoptions for realizing the folk song query and matchingcomponents.

The collectionOnder de groene linde (Under the green lime tree) wasthe name of a Netherlands radio broadcast show of folksong field recordings from 1957-1994. [1] Old people wereasked to sing tunes that they learned in their childhood(mostly by listening). About 5000 of these recordingswere manually transcribed by experts before the startof our project (see figure 1). Today all recordings andthe scanned transcriptions are part of the Liederenbank1

presentation.

Figure 1: A manual transcription of several strophes of afolk song recording

However, the musical content of the data (both therecordings and the handwritten scores) is difficult toaccess in a content-based retrieval system. Therefore wedeveloped the necessary editing tools (see figure 2) andguided the processes at the Meertens Institute to encodethe transcriptions.

Figure 2: The WitchCraftEditor1http://www.liederenbank.nl

NAG/DAGA 2009 - Rotterdam

336

We decided to stay close to the handwritten transcrip-tions. From these we keep the lyrics-oriented, line-based notation. This results into musically reasonablesegmentations, which is handy if users are not interestedin matching melody segments that cross line boundaries,or if they want to look only for variants of a particularline of a song. From this encoding we generate repre-sentations of the strophes and phrases in MIDI formatand in the musicology-friendly Humdrum format.2 Sheetmusic images are generated for verification and for theweb presentation.

The Liederenbank now presents the first phrase of asong along with metadata and provides a very convenientbrowse interface: A button allows to search for melodiessimilar to the current one (see figure 3). A rankedresult list is produced using a particular computationalmodel of music similarity. In the future users can choosefrom several pre-computed similarity relations. However,querying with one example song at a time is not the onlyoption for experts, who should be able to incorporatemore analytic and corpus knowledge.

Figure 3: A query-by-example link (zoek vergelijkbaremelodieen), embedded in the Liederenbank interface

Folk song research modelsIn [2] we adopted a generic information retrieval systemto suit the musical domain. The system allows toinstantiate and combine different similarity measures (seefigure 4). But to further adopt existing measures for folksong research we must understand the information needof folk song researchers.

Folk song researchers aim to reconstruct the process oforal transmission, where songs are passed from gener-ation to generation by singing, listening and creativelychanging them. Songs that are no more part of ourculture and that are not preserved by recording orwriting them down are lost. Mostly based on pro-found knowledge and professional intuition folk songresearchers relate melodies, that have some identifying

2http://dactyl.som.ohio-state.edu/Humdrum/

Figure 4: A retrieval system with different similaritymeasures to choose from

features in common. In the Liederenbank experts assignto melodies such a melody norm label. This assignmentis not always unproblematic because sometimes songs arecomposed by putting parts of two different parent songstogether. Nevertheless, we can use the melody norm asa ground truth for measuring the retrieval performanceof similarity measures that we design to find folk songvariants.

Melody norm annotationsWe asked our folk song researchers to annotate theinter-relations of the melody norm members further forsome tune families. After some discussions about whichfeatures are relevant and how to encode relations bothquickly and with reliable agreement across experts, weimplemented an annotation tool (see figure 5). Theannotators choose a reference melody for each melodynorm and compare it globally and phrase-wise with othermembers. They rate the song pair’s contour similarityand rhythmical similarity and identify common motifs.

Figure 5: A melody norm annotation tool for global andphrase-wise song comparison

We can use this data in three ways. First, we can


337

infer which features are important for a specific melodynorm and match only those features. Second, we canbuild evaluation sets that are specific to test contourmeasures or rhythmical measures. Third, we can alignthe corresponding phrases (see figure 6) and find whatremains stable in a particular group.

Figure 6: Similarity values between reference melody 70096and other melody norm members and the correspondence ofthe melodic phrases with those of the reference melody

Pitch stabilityWe have conducted a pitch stability study in [4]. Thebasic assumption behind the common pitch stabilityhypothesis is that some pitches or pitch progressions aresomewhat characteristic for a melody norm. Figure 7shows a manual alignment of six folk song variants fromthe same melody norm. Pitches on corresponding onsetpositions are projected on the staffs below. One can seeimmediately that there is generally very little variance(usually only two or three pitch classes) and that strongbeat positions are more stable that week one’s.

Figure 7: Studying pitch stability using alignments andprojections

In [3] we used different superpositions of the alignedvariants at different metrical abstractions and found thata matching method based on that superposition performsbetter than using single melody matching. This shouldbe theoretically the case, when a third variant can beexplained by cross-overs of given variants or by a commonharmonic progression. Another finding was that pitchesare more stable in the beginning of phrases and in thefirst and last phrase.

Alignment problemsIn [3] we describe manual alignments, which can bemade using the WitchCraftEditor. Figure 8 shows amanual alignment, which displays the following three editoperations. On long notes or upbeats an extra slurrednote can be added. Extra notes can be introduced to

accommodate extra syllables. The introduction of aninvisible rest in the last bar is necessary to align themelodies to a common meter signature. (In the originaltranscription of the second melody there is a 9/8 measuresignature change just for the last bar.)

Figure 8: Alignment of 2 variants showing different editoperations

Making manual alignments is quite time consuming. Tomake querying more practical, we therefore investigateautomatic alignment of melody norm melodies. Themain problem is to model the type of changes that canoccur. [6] In [5] we propose a hierarchically constrainedalignment approach which uses segments given by thephrase structure and by the metrical hierarchy. Inour current research we study how to best incorporatedifferent musically reasonable edit operations into eitherscoring functions or complexer alignment algorithm se-tups.

ConclusionDesigning and realizing a music information retrievalsystem for folk song researchers is quite demanding.We designed processes and tools for data entry and toget information from the researchers. We believe thatresearchers should be able to pose explicit queries andhandle adequately complex user interfaces that enablethem to bring in their analytic and intuitive capabilities.Therefore it will probably be more important to supportthe researchers in formulating good queries than to aimfor the ultimate similarity measure.

References[1] Onder de groene linde. Verhalende liederen uit

de mondelinge overlevering. Uitgeverij Uniepers,Amsterdam, 1987-1991.

[2] J . Garbers. An integrated MIR programming andtesting environment. In Proceedings of the seventhInternational Conference on Music InformationRetrieval. University of Victoria, 2006.

[3] J. Garbers, P. van Kranenburg, A. Volk, F. Wiering,L. Grijp, and R. C. Veltkamp. Using pitchstability among a group of aligned query melodiesto retrieve unidentified variant melodies. In SimonDixon, David Bainbridge, and Rainer Typke, editors,Proceedings of the eighth International Conferenceon Music Information Retrieval, pages 451–456.Austrian Computer Society, 2007.

[4] J. Garbers, A. Volk, P. van Kranenburg, F. Wiering,L. Grijp, and R. C. Veltkamp. On pitch andchord stability in folk song variation retrieval. InProceedings of the First International Conference


338

of the Society for Mathematics and Computationin Music (to be published), 2007. (http://www.mcm2007.info/pdf/fri3a-garbers.pdf).

[5] J. Garbers and F. Wiering. Towards structuralalignment of folk songs. In J. P. Bello and E. Chew,editors, Proceedings of the nineth InternationalConference on Music Information Retrieval, 2008.

[6] M. Mongeau and D. Sankoff. Comparison of musicalsequences. In W. Hewlett and E. Selfridge-Field,editors, Melodic Similarity Concepts, Procedures, andApplications. MIT Press, Cambridge, 1990. (http://www.mip.ups-tlse.fr/~mongeau/music.pdf).


339

Bridging Music Information Retrieval and Folk Song Research - …pub.dega-akustik.de/NAG_DAGA_2009/data/articles/000392.pdf · 2009-05-14 · searchers. Both are based on the encoded

Documents