Revisiting the quantitative phylogeny of the Uralic languages Kaj Syrjänen , Terhi Honkola, Jadranka Rota, Unni-Päivä Leino, Outi Vesakoski Contextualizing historical lexicology, 15.5.2017, University of Helsinki Map: Geographical Database of the Uralic languages by BEDLAN & J. Ylikoski
38
Embed
Revisiting the quantitative phylogeny of the Uralic languages · Revisiting the quantitative phylogeny of the Uralic languages Kaj Syrjänen, Terhi Honkola, Jadranka Rota, Unni-Päivä
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Revisiting the quantitative phylogeny of the Uralic languages
Kaj Syrjänen,Terhi Honkola, Jadranka Rota,
Unni-Päivä Leino, Outi Vesakoski
Contextualizing historical lexicology,15.5.2017, University of Helsinki
Map: Geographical Database of the Uralic
languages by BEDLAN & J. Ylikoski
Chang et al. 2015
Grollemund et al. 2015
Background
Syrjänen et al. 2013
Bouckaert et al. 2012
Mathematical model
A
B
C
Background PhylogenyPARAMETERS
Mathematical model
Topology
Branch lenght
Background
Other parameters
Syrjänen et al. 2013
Basic rules of sequence evolution
PARAMETERS PhylogenyLanguage phylogeny
• Genetic data is used in biology to make phylogenies
• Not all genetic material is similar
- Heterogenous rate of change
-e.g. coding regions (genes) vs. non-coding regions
→ Should not be analysed together
Introduction
CC Attribution 4.0 License, http://cnx.org/contents/[email protected]:xiQtvh_M@3/Structure-and-Function-of Cell#OSC_Microbio_10_04_noncodDNA
Parameter set 1 Parameter set 2
• Phylogenetic partitioning is the solution
- Takes into account heterogenous patterns of evolution
- Different parameters for different parts of the genome
- Common in biological phylogenetic analyses!
Introduction
CC Attribution 4.0 License, http://cnx.org/contents/[email protected]:xiQtvh_M@3/Structure-and-Function-of Cell#OSC_Microbio_10_04_noncodDNA
Mathematical model
Topology
Branch lenght
Background
Basic rules of sequence evolution
PARAMETERS(Language) phylogeny
Rate heterogeneity
(set 1: coding)
Rate heterogeneity (set 2: non-coding)
• Partitioning vs. no partitioning
• May produce trees which differ in (Kainer & Lanfear
2015)
- Branch support
- Topology
- Branch length
→ In other words, almost anything
Introduction
• Language data is used to make quantitative phylogenies
• Bayesian model-based methods
• Parsimony methods
• Distance based methods
- Glottochronology
• Not all linguistic material is similar
- Heterogenous rate of change
Introduction
Introduction
2) from one meaning to next
= e.g. parts of speech change at varying
rates
1) from one language to next
= Languages change at varying rates
Pagel et al. 2007
Solved in Bayesian analyses by using
evolutionary models
• Two notable points of linguistic variation:
- Rate of lexical replacement varies
Introduction
1) from one language to next
= Languages change at varying rates
• Two notable points of linguistic variation:
- Rate of lexical replacement varies
2) from one meaning to next
= e.g. parts of speech change at varying
rates
a) Solved by allowing rate variation
along gamma distribution
b) Another option could be data
partitioning
Introduction2) from one meaning to next
a) Rate variation along gamma distribution
Used in language phylogenies by e.g.
- Grollemund et al. 2015
- Chang et al. 2015
- Bouckaert et al. 2012
b) Data partitioning
- manual vs. algorithmic
-e.g. with TIGER algorithm
Not used earlier to make language
phylogenies
By Gamma_distribution_pdf.png: MarkSweep and Cburnettderivative work: Autopilot (talk) -Gamma_distribution_pdf.png, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=10734916