Comparing Sequences and Trees From Computational Biology to Music Analysis Julien Allali, Pascal Ferraro, Pierre Hanna and Matthias Robine PIMS - CNRS, University of Bordeaux 1, LaBRI, SIMBALS From Computational Biology to Music Pascal Ferraro 1 / 34
47
Embed
Comparing Sequences and Trees From Computational Biology ... · Comparing Sequences and Trees From Computational Biology to Music Analysis Julien Allali, Pascal Ferraro, Pierre Hanna
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Comparing Sequences and TreesFrom Computational Biology to Music Analysis
Julien Allali, Pascal Ferraro, Pierre Hanna and Matthias Robine
PIMS - CNRS, University of Bordeaux 1, LaBRI, SIMBALS
From Computational Biology to Music Pascal Ferraro 1 / 34
Comparison of Biological Structures
What do we compare ?DNA (coding and non coding regions)RNAproteinsPlant Architecture
DataSequences or trees of nucleotidesSequences of amino-acidsSequences or trees of elementary entities
=⇒ Comparison of strings (or trees) of characters
Why do we compare ?Search for similar biological functions,Identification of comparable structures,Construction of phylogenetic trees,Identification of gene mutation,Detection of gene transfer.
From Computational Biology to Music Pascal Ferraro 2 / 34
Comparison of Biological Structures
What do we compare ?DNA (coding and non coding regions)RNAproteinsPlant Architecture
DataSequences or trees of nucleotidesSequences of amino-acidsSequences or trees of elementary entities
=⇒ Comparison of strings (or trees) of characters
Why do we compare ?Search for similar biological functions,Identification of comparable structures,Construction of phylogenetic trees,Identification of gene mutation,Detection of gene transfer.
From Computational Biology to Music Pascal Ferraro 2 / 34
Comparison of Biological Structures
What do we compare ?DNA (coding and non coding regions)RNAproteinsPlant Architecture
DataSequences or trees of nucleotidesSequences of amino-acidsSequences or trees of elementary entities
=⇒ Comparison of strings (or trees) of characters
Why do we compare ?Search for similar biological functions,Identification of comparable structures,Construction of phylogenetic trees,Identification of gene mutation,Detection of gene transfer.
From Computational Biology to Music Pascal Ferraro 2 / 34
Measure of Musical Similarity
What do we compare ?TimbreRhythmMelodies
DatabaseAudio (wav, mp3, . . . )Symbolic (MIDI)
=⇒ Symbolic melodic similarity = Comparison of sequences (ortrees) of notes
Why do we compare?Music Information Retrieval,Search for similarities in musical database,Automatic detection of plagiarism,Musical analysis by self-similarity.
From Computational Biology to Music Pascal Ferraro 3 / 34
Measure of Musical Similarity
What do we compare ?TimbreRhythmMelodies
DatabaseAudio (wav, mp3, . . . )Symbolic (MIDI)
=⇒ Symbolic melodic similarity = Comparison of sequences (ortrees) of notes
Why do we compare?Music Information Retrieval,Search for similarities in musical database,Automatic detection of plagiarism,Musical analysis by self-similarity.
From Computational Biology to Music Pascal Ferraro 3 / 34
Measure of Musical Similarity
What do we compare ?TimbreRhythmMelodies
DatabaseAudio (wav, mp3, . . . )Symbolic (MIDI)
=⇒ Symbolic melodic similarity = Comparison of sequences (ortrees) of notes
Why do we compare?Music Information Retrieval,Search for similarities in musical database,Automatic detection of plagiarism,Musical analysis by self-similarity.
From Computational Biology to Music Pascal Ferraro 3 / 34
Molecule of DNA or RNA : linear suite of nucleotides = primarystructure
DNA : a molecule is always made of a sugar, a phosphate group andone of the four nucleic acids: Adenine, Cytosine, Guanine andThymine. There are represented by an alphabet made of their initials :{A, C, G, T}
RNA : Thymine T is replaced by Uracil U.
Sometimes, some positions in the sequence are unknown⇒ anextended alphabet is used.
Proteins : sequences of amino-acids (20 characters in the alphabet).
From Computational Biology to Music Pascal Ferraro 6 / 34
From Computational Biology to Music Pascal Ferraro 19 / 34
Trees Modeling Tree Comparison First Applications
Representation of RNA Secondary Structures
Sequences = Primary Structure
Basis A, C, G, U can make pairings (hydrogen links), 4 levels ofpairings :
Watson-Crick pairs : A—U and G — CWobble pairs (lower energy level) : G—Upairs with very low level of energy : G—A or C—Aother pairs (rare) : actually any pair can occur. (Leontis N., Westhof E.2001)
⇒ folding of the sequence in a secondary structure
From Computational Biology to Music Pascal Ferraro 20 / 34
Trees Modeling Tree Comparison First Applications
Representation of Secondary Structures of RNA
From Computational Biology to Music Pascal Ferraro 21 / 34
Trees Modeling Tree Comparison First Applications
Toward a Multi-scale representation
Ouangraoua et al. 2007
A Multiple Graph Layers Model (Allali and Sagot , 2006)
From Computational Biology to Music Pascal Ferraro 22 / 34
Trees Modeling Tree Comparison First Applications
Plant Architecture Modeling
v
r
From Computational Biology to Music Pascal Ferraro 23 / 34
Trees Modeling Tree Comparison First Applications
Plant Architecture Modeling
v
r
<
+
<
<
<+
<+
<
<
<
<
<
<
<
<
<
<
<
<
++
++
+
+
r
v
From Computational Biology to Music Pascal Ferraro 23 / 34
Trees Modeling Tree Comparison First Applications
Plant Architecture Modeling
(Godin and Caraglio, 1998)From Computational Biology to Music Pascal Ferraro 24 / 34
Trees Modeling Tree Comparison First Applications
Tree Graph Representation of Monophonic Melody
Hierarchy of note duration (Rizo et al., 2003)
From Computational Biology to Music Pascal Ferraro 25 / 34
Trees Modeling Tree Comparison First Applications
Tree Graph Representation of Monophonic Melody
Hierarchy of note duration (Rizo et al., 2003)
From Computational Biology to Music Pascal Ferraro 25 / 34
Trees Modeling Tree Comparison First Applications
Polyphony : Sequences of sequences
(Hanna and Ferraro, 2007)
Notes starting at the same time are grouped,Notes in a same chord are not ordered
Problem with time overlapping : representation in linked notes
(a)
(b)
(c)
From Computational Biology to Music Pascal Ferraro 26 / 34
Trees Modeling Tree Comparison First Applications
Polyphony : Sequences of sequences
(Hanna and Ferraro, 2007)
Notes starting at the same time are grouped,Notes in a same chord are not ordered
Problem with time overlapping : representation in linked notes
(a)
(b)
(c)
From Computational Biology to Music Pascal Ferraro 26 / 34
Trees Modeling Tree Comparison First Applications
Polyphony : Sequences of sequences
(Hanna and Ferraro, 2007)
B4E4 G4 B4 D4 r4 C4 E4 G4 D2 A2 G8 B8 D8
Notes starting at the same time are grouped,Notes in a same chord are not ordered
Problem with time overlapping : representation in linked notes
(a)
(b)
(c)
From Computational Biology to Music Pascal Ferraro 26 / 34
Trees Modeling Tree Comparison First Applications
Polyphony : Sequences of sequences
(Hanna and Ferraro, 2007)
B4E4 G4 B4 D4 r4 C4 E4 G4 D2 A2 G8 B8 D8
Notes starting at the same time are grouped,Notes in a same chord are not ordered
Problem with time overlapping : representation in linked notes
(a)
(b)
(c)
From Computational Biology to Music Pascal Ferraro 26 / 34
Trees Modeling Tree Comparison First Applications
Western Music
Main propertiesRhythmtonal Information
Different levels to be structured
Tree Graph representation using5 layers (Rocher, 2008)
Global tonalityLocal tonality (modulations)Chords (progression)Groups of notes(homorhythmic)Notes
Main key
Local key Local key
ChordChord
NoteChord
Note
Chord
Note Note
NoteChord accord
Note Note
NoteChordNoteChord
Note Note Note
time
From Computational Biology to Music Pascal Ferraro 27 / 34
Trees Modeling Tree Comparison First Applications
Edit Distance between trees
From Computational Biology to Music Pascal Ferraro 28 / 34
Trees Modeling Tree Comparison First Applications
Several Variations
There is several methods based on tree edition principle:Constraints on tree height (Selkow, 1976)
Ordered or Unordered trees (Zhang and Shasha, 1990, Zhang, 1996)
Local edition (Ouangraoua et al., 2006)
Alignment (Jiang et al., 1995, 2002)
From Computational Biology to Music Pascal Ferraro 29 / 34
Trees Modeling Tree Comparison First Applications
Application to Plagiarism Detection
Melodic similarity,Harmonic similarity,Combination of two.
Main key
Local key Local key
ChordChord
NoteChord
Note
Chord
Note Note
NoteChord accord
Note Note
NoteChordNoteChord
Note Note Note
time
From Computational Biology to Music Pascal Ferraro 30 / 34
Trees Modeling Tree Comparison First Applications
First Experiments
Representation Musical pieceSimilarity score
R. Mack vs G. Harrison (1976)Query Sweet Lord
Sweet Lord So Fine Essen Rank 1Note 143.8 14.0 20.0
From Computational Biology to Music Pascal Ferraro 32 / 34
Conclusion and Future Works
Musical and Algorithmic Perspectives
Automatic detection of repetitionsInference of musical structures (Allali et al., 2009)⇒ Verse - Chorus -Verse - ChorusLongest repeated part (overlapping or not overlapping)
no inference of musical structure. . . but comparison based on structuralproperties
Comparison of self-similarity matricesAlgorithmic problem: local alignment of 2D matricesMusical applications: searching for music with a structural query
Examples:Happiness is a Warm Gun, BeatlesParanoid Android, RadioheadWithout you I’m Nothing, Placebo
From Computational Biology to Music Pascal Ferraro 33 / 34
Conclusion and Future Works
Long Term Perspectives
Music recommendation systems
Browsing music
Pedagogy, musical games, evaluation of music performance