World Atlas of Language Structures • Standard typological reference • Includes geographical maps • Ripe for quantitative exploration Data without Variables • Inputs matrices of relative similarity between entities • Outputs n-dimensional network of optimized coordinates • Iterative algorithm that minimizes overall error based on similarity scores Language-Feature Networks • Optimized network of languages based on linguistic features • Short distance: high degree of linguistic similarity; Long distance: little linguistic similarity • LFNs correspond to feature categories defined by WALS • Color-coded according to geographic macro-area defined by WALS Phonology Language-Feature Network • Eurasian and African languages are each tightly clustered • Absence of languages in the center: high degree of dependency among features Conclusions • Linguistic data should be in a machine readable format • Quantitative analyses of typological data are straight-forward and insightful with structured data like that in WALS This work was originally completed as part of the authorʼs undergraduate honors thesis at Dartmouth College. It was approved by the Math & Social Science Department in May of 2006. Visualizing Data in the World Atlas of Language Structures: the Language-Feature Network “Whatʼs Where Why?” Konyagi Mbum Tlapanec S ebei Nasioi Nimboran Nambakaengö Iwam C huave Manchu K urukh K'ekchí Totonac (Papantla) Adzera G oajiro Ignaciano Cayapa C ofá n G uahibo S iona Dahalo Margi Aizi B a riba K ohumono Teke (S outhern) Zande K oyraboro S enni Waray (in Australia) Western Desert (Ooldea) Waris Dumo K a s hmiri Konkani S inha la Ojibwa (E astern) Achumawi P omo (S outheas tern) Yucatec Alabama C hatino (S ierra Occidental) Mazahua Mixtec (Molinos) Brao Atayal Maranao S a'ban Mien Wu (Changzhou) Nung (in Vietnam) G uambiano B ribri J ivaro Apinayé C hulupí Amahuaca Quechua (Cochabamba) T icuna Aché Angas Dangalé at (Western) Dizi Hamer Hebrew (Modern Ashenazic) Kanakuru Kefa K ullo Lamé Neo-Aramaic (Persian Azerbaijan) Tera T igré Tuareg (Ahaggar) Hadza K a dugli //Ani Deti S andawe Aghem Akan Alladian Amo Beembe B irom B is a B obo F ing Bé té C iL uba Dan Dogon (T oro S o) E fik E jagham E wondo Fe'fe' Fyem Gbeya Bossangoa Gwari Gã Ijo (Kolokuma) Isoko J omang K is i (S outhern) Klao Kpan K pelle Lelemi Lua Mambila Mba Moro Mumuye Ndut Noni O gbia T ampulma Tarok UMbundu Yeyi B erta Daju (Dar Fur) Dinka Ik Ingessana K omo Lugbara Luo Maasai Nandi Nara (in E thiopia) Nobiin Nyangi Nyimang Tama Temein Y ulu Arrernte (Mparntwe) B andjalang (Y ugumbir) Bardi Djapu Kala Lagaw Ya Murrinh-P atha Wik Munkan B a ining T oaripi K iwa i Dera K woma Yessan-Mayo Savosavo Dadibi Angaatiha Fasu Gadsup K oiari Wantoat Yagaria R otokas West Makian Yareba Yawa E ven K haria S ora Itelmen K ota T elugu T ulu B ulgarian Darai Irish (Donegal) Ormuri R omansch (S charans) Komi-Zyrian S aami (C entral-S outh) Yukaghir (Tundra) C a ddo Quileute Aleut (E astern) Y upik (S iberian) S ha s ta Huave (S an Mateo del Mar) C herokee Tiwa (Northern) Tzeltal (Aguacatenango) Mixe (Totontepec) Chickasaw Ahtna C hipewyan Eyak Hupa Mazatec (C hiquihuitlán) B ella C oola Lushootseed S hus wa p Tonkawa O'odham Nuuchahnulth Wappo Great Andamanese Bru (Western) Jeh Nancowry Nyah Kur (Tha Pong) Pacoh Parauk Bajau Cham (Western) Iban Ivatan (S outhern) Kaliai-Kove Kedang Ma'ya Mor Po-Ai P ohnpeian R ukai Tausug T etun Tsou Bai B odo C hin (T iddim) Fuzhou Hakka J ingpho Karen (Sgaw) Lepcha Naxi Newari (Kathmandu) Nishi P hlong Tamang Xiamen Gelao Kam (Zhanglu) Lakkia Lü S ha n S ui Yay Campa (Axininca) Iranxe Resí garo Wapishana Y ucuna Cacua Camsá Akawaio Bakairí J apreria Panare K una Pech Huitoto (Murui) Itonama F ulniô Páez Barasano (Northern) S irionó Kabardian Amharic Awngi K otoko Ngizim S omali S oqotri !Xó õ J u|'hoan Bambara Dagbani Doyayo E we S enadi Temne W olof K oyra C hiini Maba Daga Pawaian Hamtai K unimaipa Una Azerbaijani Dagur K irghiz Mangghuer Moghol Nanai Tuvan Yakut Koryak Albanian K urdis h (C entral) R utul Tsova-Tush K ha nty Nganasan K iowa Amuzgo C hinantec (Quiotepec) W intu Chehalis (Upper) S re Hmong Njua Ao Cantonese Lahu Qawasqar Andoke Aikaná Amuesha Jebero Ika S elknam Abipó n Muinane Ocaina B ororo Kaingang Maxakalí Movima S áliba (in C olombia) Arabela Hausa Oromo (Harar) K hoekhoe Igbo Nkore-Kiga S a ngo Y oruba F ur Kanuri Ngiti Imonda S elepet S uena Ainu Basque Brahui Kannada Nepali R omanian S indhi Japanese Korean Lak Mari (Meadow) Ket Y urok Haida Acoma Navajo T lingit Mixtec (C halcatongo) Otomí(Mezquital) Lakhota Nahuatl (North P uebla) Vietnamese K iriba ti Burmese Garo Kayah Li (Eastern) Ladakhi Meithei Thai Awa Pit Yagua Alawa Mbabaram S entani B reton Lithuanian Wiyot Diegueñ o (Mesa Grande) Yana P uré pecha Tol Kawaiisu Luiseñ o Nahuatl (Tetelcingo) Makah Batak (Toba) Javanese K wa io R oro T ibetan (S tandard S poken) Tehuelche Nambikuára Waorani Beja Kera Tashlhiyt Diola-F ogny Murle B urarra Diyari Dyirbal G arrwa K alkatungu Kuku-Yalanji Malakmalak Yanyuwa Marind Wahgi Woisika YelîDnye Uzbek (Northern) Koya Bengali C a ta la n Kalami Norwegian P olis h Avar S elkup S eneca Karok Huastec Chinantec (Lealao) K la ma th Maidu (Northeast) T unica Hopi Kwakw'ala Zuni Hawaiian Iaai Irarutu Lenakel Tigak T iruray Yapese Bawm Aymara Canela-Krahô S hipibo-K onibo C ubeo S hiriana Berber (Middle Atlas) Luvale S upyire S wa hili Zulu B a girmi Lango Maranungku Ngiyambaa P itja ntja tja ra Alamblak Arapesh Dani (Lower Grand Valley) E kari Kewa Usan Bashkir Chuvash Khalkha Mundari C hukchi Armenian (E astern) Hindi P a s hto Persian Archi Hunzib Ingush Lezgian Nivkh Abkhaz Yukaghir (Kolyma) W ichita Oneida Jakaltek Zoque (Copainalá) Koasati Nez Perce Tsimshian (Coast) Yuchi Ndyuka Khasi K hmu' S edang Batak (Karo) K ilivila Maori Paamese Mandarin Paumarí Jaqaru C a rib Epena Pedee P irahã Araona Tacana Trumai Guaraní Arabic (E gyptian) Iraqw K rongo G rebo K oromfe Kunama G ooniyandi Kayardild Mangarrayi Martuthunira Maung Nunggubuyu T iwi Ungarinjin Wambaya Wardaman Y idiny Y ima s Lavukaleve Amele Asmat K obon Maybrat E venki Turkish Burushaski E nglis h French GermanGreek (Modern) Latvian Russian S pa nis h G eorgian F innis h Hungarian Cree (Plains) Passamaquoddy-Maliseet Greenlandic (West) Maricopa Kutenai S lave Coos (Hanis) Miwok (S outhern S ierra) S qua mis h C ahuilla C omanche Yaqui Khmer Semelai C hamorro Drehu F ijia n Indonesian Malagasy Paiwan R apanui Taba Tagalog Tukang Besi Mapudungun Apurinã Hixkaryana Cayuvava Wari' R a ma Wichí Urubú -Kaapor Warao S anuma Nenets Nyimang S a ngu Gbaya Kara Mwera R onga Masakin K ongo Day Nara (in E thiopia) S us u Mangbetu Dongo Langi Mursi Avokaya T irma ga Ndebele (in S outh Africa) Ila K a mba Barambu Mba Dagaare Lingala Wolaytta B ilin /X a m Mbere B erta S ha tt Kete B a bole B a fia E dolo Madimadi P itta P itta E mmi T a bla Iau S ia ne S a lt-Y ui G ureng G ureng Malakmalak F oe T airora Wik Munkan YelîDnye T obelo B inandere G ida ba l K unimaip a Adynyamathanha Nagatman Djingili Arapesh (Abu) Nend Madngele Dera Womo Alawa Fasu P anyjima Muruwari Baruya Gahuku B unuba F ore Anem Kalam Ayiwo Koryak S a nta Yakut S lovene Maithili K oluri S elkup Manchu Kabatei Komi-Zyrian Malayalam Udi R utul Talysh (S outhern) Azerbaijani Nenets Vafsi Tuvan K iliwa Cakchiquel K yuquot Huave (S an Mateo del Mar) Pame Tanacross Pima Bajo Hualapai Halkomelem Passamaquoddy-Maliseet Mazatec (Huautla) Nahuatl (North P uebla) G itks a n Omaha T utelo Tzotzil Mohawk Y up'ik (C entral) Massachusett Naga (Zeme) Motu Nalik Mussau Kachari Manam Idu Nguna Pangasinan Ala'ala B ikol C hepang Yay Thakali Yi Ambae (Lolovoli Northeast) Hani Amara Wedau Mwotlap Dumi Magar T ugun G allong Musom Ilocano Minaveha S imeulue Tatana' T ha ngmi K wa io Tinani Campa (Axininca) Wayamp i Chácobo Muisca T iriyo Hupda P iro Matis Kaingang Jebero Moseté n Achagua Zarma Kenga S ebei G umuz Bakueri Aari K inyarwanda Gamo Dime !Xó õ B abungo Nubian (Kunuz) Masa Ingessana Kxoe E ga P odoko Zande Lelemi Fula (Nigerian) Mbodomo S eme Basaá Igede Mbili Toussian Mongo Kemant Londo Baule Ndumu Qafar Kabyle R on Luganda Paakantyi Arrernte (Mparntwe) Yareba Amanab Kuman P itja ntja tja ra Ngaanyatjarra G ooniyandi Kaki Ae Ö mie G uugu Y imidhirr K orowai K iwa i Karkar-Yuri Warrwa Alyawarra Pawaian Awa G umbaynggir Djap u Kyaka Ngandi Hanga Hundi Meryam Mir Usarufa Kuuku Ya'u Telefol P ungupungu Dani (Lower Grand Valley) Belorussian Ossetic Prasuni B uriat S ardinian R omansch (S ursilvan) T ulu Savi C a ta la n Turkmen S inha la Kabardian Kalmyk Assamese Udmurt T orwali Mutsun Washo H ida ts a Mono (in United S tates) Tepehua Wikchamni Miwok (S outhern S ierra) Zuni S eminole B ella C oola C ora Karok C ahuilla Heiltsuk C hipewyan C row Takelma Mixe (Coatlán) Maidu (Northeast) Tarao Nyamkad Manobo (Western Bukidnon) C hantyal C a mling Alune T hulung Naga (Tangkhul) Dimasa Ma'anyan Mor B a li-V itu Nakanai Banoni P unu J iarong Lou Mae Mekeo Tungak E rromangan C hin (T iddim) Dong S eediq Hiligaynon Kapampangan Maisin Newari (Dolakha) Hmar Bajau Dulong S ia r Taiof Saliba (in Papua New Guinea) Nancowry Maipure C a s hibo Cayap a Iquito T oba Gavião Waorani Amahuaca Dâw F ur Doyayo Haya Baka (in S udan) Nzakara Baka (in Cameroon) Acholi B idiya B a giro B ongo G ula (in C entral African R epublic) Dagbani Zayse Pare B ini Jarawa (in Nigeria) K onni R unga Busa Xhosa T a bwa S hilluk Mano S wa ti Laal S es otho Goemai Mofu-G udur Dhaasanac Ogbronuagum G imira Nabak Ndjé bbana Waskia Ngalakan Dumo Yuwaalaraay I'saka Hamtai Imonda K oita Barai Badimaya S entani Y ukulta G olin Iwaidja S ougb Wardaman Nankina B iri Ngiyambaa Mpur G eorgian Nivkh K umauni Icelandic Bashkir P ortugues e Ket Domari Nepali Ukrainian Khalaj Yukaghir (Tundra) Koya Wiyot Tonkawa Yokuts (Yaudanchi) Quileute P ipil C hinook (Lower) Haida Apache (Western) Yuchi Zoque (Copainalá) E udeve Tarahumara (Central) Acoma P uré pecha Zapotec (Yatzachi) W ichita Lacandó n Koasati Hakka Cham (Western) S is iqa Mangap-Mbula R ukai Nishi Gela Tagalog B odo T s ha ngla Longgu Lotha S obei Brao Lisu Nar-Phu W olio Leti Kadazan K hmu' Atayal S udest Lamaholot Iduna B uru P urki K okborok Irarutu Ladakhi P umi Achang L imbu Mono-Alu G a pa pa iwa Digaro Lewo S herp a B ribri Baure K oreguaje Tucano E s e E jja Guaraní Apinayé E mbera Cayuvava C ubeo P ila gá Tsafiki Mekens Trumai Araona Warekena Baré Huitoto (Murui) K ariri Mocoví Javaé Garí funa S hona Koegu Kresh Teso Didinga Arabic (Moroccan) Ik So K ara (in C entral African R epublic) C hichewa Mbara Ijo (Kolokuma) Anywa Beja Kikuyu Hunde //Ani Bari Kunama Kwangali B us hoong Mondunga Samba Leko Mandinka (Gambian) Mumuye C ha ha Bambara K irma Oromo (Waata) Nup e Pero Arabic (Iraqi) Lagwan Dan Y ulu T igré Masalit Mehri Zulu Dogon Amharic Diyari Usan Ambulas Anggor S uena Mara Ngankikurungkurr Kugu Nganhcara Yale (Kosarek) G arrwa Arapesh Thaayorre Wembawemba G unbalang Daga Y idiny Hua Orokaiva Djambarrpuyngu Archi G uja ra ti Chechen Frisian K a s hmiri R emo T a jik K orku S erbian-C roatian Karachay-Balkar T a ta r Hunzib Avar G ondi Ubykh Diegueñ o (Mesa Grande) P aiute (Northern) Wapp o Osage K iowa U te Mandan S qua mis h T unica Hop i P omo (S outheas tern) B iloxi Oneida Maricop a Hup a T lingit S ius la w Ojibwa (E astern) Blackfoot S alinan C hatino (Y aitepec) Nevome C hitimacha B ontok Kaliai-Kove Nehan Ifugao (Batad) P aulohi Hayu Garo Darmiya Naga (Mao) B a lti K innauri Meithei Lepcha Patep T imugon Halia R oviana Arosi Paamese P a tta ni C ha ng Ngad'a Samoan Tawala Mien R apanui T oba tiT a kia Agta (C entral) Nocte K okota Urak Lawoi' S inaugoro Jabê m Athpare Teop Sakao Manadonese P uluwat R etuarã C a rib S hiriana Warao Canela-Krahô S hipibo-K onibo Arawak Mapudungun Karó (Arára) Tacana P alikur P irahã Huitoto (Minica) Ika Aymara Carib (De'kwana) T eribe Ngä bere Páez E we Mbay Ndut P okot Me'en Kana Uldeme E wondo Fyem Luvale Mooré T ubu Murle Mbum L inda Ngoni Kera Lango S wa hili Pa'a Korana G rebo Musgu Akan Vai Gbeya Bossangoa J ukun Mende Arabic (Modern S tandard) R unyankore Noon Ndonga Lamang Ngiti Malgwa Diola-F ogny Nuer Temne Iraqw E ngenni Chai J u|'hoan Lele Luo Angas Y oruba H a ta m Una R umu Dyirbal B ilua Murrinh-P atha P oko-R awo Walman Wambon Nunggubuyu Lavukaleve Nasioi Martuthunira K oiali (Mountain) Y indjibarndi K obon Tauya K ombai Kewa Kayardild Wahgi Au Mangarrayi Yagaria Maranungku B hojpuri K olami P a s hto Yukaghir (Kolyma) Ingush Itelmen C ornis h Albanian Dutch Karakalpak K uvi R omani (Welsh) Italian K urdis h (C entral) Czech Wakhi Dagur Uzbek B reton C hukchi Ainu Mangghuer S hina Zoque (Chimalapa) Lakhota C hemehuevi Nahuatl (Michoacán) Nahuatl (Huasteca) C omanche Mixtec (Yosondú a) Zapotec (Mitla) Sarcee G uarijí o Ocuilteco Totonac (Xicotepec de Juárez) Mixtec (Jicaltepec) Mis kito Luiseñ o Hawaiian Cè muhî Akha Angami Mikir S ema Bai Maleu K ilivila S ons orol-T obi T ondano Palaung S tieng Nicobarese (Car) Byansi Semelai S ikkimese Chin (Mara) K ha ling Tongan Tukang Besi Mis ing Tamang Tibetan (Modern Literary) Ao Kele Karen (Pwo) Mon Arop-Lokep C hin (S iyin) B uma S re Minangkabau E ngga no Drehu Quechua (Huallaga) Apalaí Guajajara G oajiro S irionó Kamaiurá Abipó n R a ma B erber (F iguig) Pä ri B a girmi Nandi Arabic (S yrian) Majang J ur Mö dö B imoba B urunge K oromfe Mauka Gwari S upyire Maba Neo-Aramaic (Arbel Jewish) Maninka (Western) Maasai Margi Nkonya K rongo Noni Miya Nubian (Dongolese) Lunda Izi Tunen Kanakuru Fula (Cameroonian) Moru Lendu Duka Moro B irom Gokana S omali Hebrew (Modern) G ude Berber (Middle Atlas) Turkana Tera C optic K oyraboro S enni B erber (R if) Igbo Tennet T iwi K uot B ininj G un-W ok West Makian Alamblak T idore Sare S iroi Ungarinjin S a hu Namia Asmat O lo Khalkha Mansi R omanian Kannada Norwegian Hungarian E venki T elugu Korean P a nja bi Latvian Urdu P olis h Russian Hindi Mundari Lithuanian Tsova-Tush Burushaski Lezgian Irish Greek (Modern) Persian Marathi Jakaltek Yaqui Nahuatl (Tetelcingo) Nisgha C hinantec (Quiotepec) Greenlandic (West) Zapotec (Isthmus) S eri Tiipay (Jamul) S lave Nez Perce Trique (Copala) Menomini O'odham C herokee C hontal Maya Tarahumara (Western) Kutenai Tlapanec Tepehuan (Northern) Makah Mam Khasi T boli G urung Kayah Li (Eastern) Khmer K iriba ti Nias Mandarin Ambai K airiru K aulong Batak (Toba) Lao Yapese Nuaulu T inrin Newari (Kathmandu) Niuean Hmong Njua Mamanwa Vietnamese Lahu Lai S edang Mokilese Mizo Muna C hamorro Apatani Maru Lalo Karen (Bwe) R awang T a hitia n Malagasy S undanese Wichí Resí garo Cocama Apurinã Hixkaryana Wari' Urubú -Kaapor Barasano S anuma Yagua Paumarí Tariana Macushi Epena Pedee Mupun Lugbara Nkore-Kiga Ngizim Dinka K is i K hoekhoe K oyra C hiini Arabic (E gyptian) Kanuri S a ngo Hausa W olof K arimojong Ma'di O bolo B uduma Oromo (Harar) Adioukrou Arabic (Gulf) Ngambay Maung Awtuw Abun Maybrat S ulka Amele K woma German French S pa nis h E nglis h Welsh Lamani Armenian (Western) Turkish B ulgarian S wedis h Japanese Basque Danish Tamil F innis h Gaelic (Scots) Abkhaz Chuvash E s tonia n Mixtec (Peñ oles) Otomí(Mezquital) C hatino (S ierra Occidental) Mixtec (C halcatongo) Chinantec (Palantla) Mixtec (Ocotepec) C hoctaw Tü mpis a S hos hone Tsimshian (Coast) T z utujil Coos (Hanis) Navajo Huastec K la ma th Chinantec (Lealao) C hinantec (C omaltepec) Y urok Ndyuka Karen (Sgaw) Kosraean Kham Nung (in Vietnam) Indonesian R otuman Iaai Manggarai J ingpho P ohnpeian Anejom Burmese Thai Tigak Iban Acehnese Woleaian Lampung F ijia n Bawm Tolai Taba T etun Palauan Cantonese Lenakel Batak (Karo) Loniu S io Gumawana F utuna-Aniwa Chrau Maori Quechua (Imbabura) Awa Pit Word Order Language-Feature Network P urple: Africa Green: Australia-New Guinea Blue: Eurasia R ed: North America Yellow: S outh-E ast Asia and Oceania Orange: S outh America • Australia-New Guinea and SE Asia and Oceania are the most internally similar • SE Asia and Oceania languages are typical with respect to languages of the rest of the world • South American languages are not internally linguistically similar, but their distribution on the network is not uniform P urple: Africa Green: Australia-New Guinea Blue: Eurasia R ed: North America Yellow: S outh-E ast Asia and Oceania Orange: S outh America