Data Citation: The State of the Art in Linguistics Lauren Gawne(1), Andrea Berez-Kroeker(2), Barbara F. Kelly(3) & Tyler Heston(2) 1st Workshop on Data Citation and Attribution in Linguistics Boulder CO September 18 2015 (1) SOAS, University of London, (2) The University of Hawaii, (3) The University of Melbourne View these slides! bit. ly/DataCitationSOTA
53
Embed
Data Citation: The State of the Art in Linguistics · Language 33 LANG Linguistics of the Tibeto-Burman Area 18 LTBA Natural Language and Linguistic Theory 32 NLLT Oceanic Linguistics
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Data Citation: The State of the Art in Linguistics
Lauren Gawne(1), Andrea Berez-Kroeker(2), Barbara F. Kelly(3) & Tyler Heston(2)
1st Workshop on Data Citation and Attribution in LinguisticsBoulder CO September 18 2015
(1) SOAS, University of London, (2) The University of Hawaii, (3) The University of Melbourne
Developing Standards for Data Citation and AttributionWorkshop aim: to develop and promote standards for data citation and attribution for linguistic data sets.
A data-driven linguistic science has the potential to provide substantiation of scientific claims by promoting attention to the care and structuring of language data.
Our studyWe examined:● 100 books (50 published grammars, 50 dissertations)
○ Variety of publishers, languages, institutions, supervisors
● 271 journal articles from 9 journals○ Range of areal foci, linguistic subfields, theoretical persuasions
● Published 2003-2012 ○ 5 years after Himmelmann 1998:164
“[Language] documentation […] will ensure that the collection and presentation of primary data receive the theoretical and practical attention they deserve.”
● Methodological variables:○ participants, data collection equipment, data analysis
tools/software, time spent collecting data (see bit.ly/GoodMethods)
● Data-related variables:○ 1. Source of data○ 2. Data genre analyzed (linguistic genre)○ 3. Where data is now○ 4. Citation conventions used to reference data, if any
● Methodological variables:○ participants, data collection equipment, data analysis
tools/software, time spent collecting data (see bit.ly/GoodMethods)
● Data-related variables:○ 1. Source of data○ 2. Data genre analyzed (linguistic genre)○ 3. Where data is now○ 4. Citation conventions used to reference data, if any
3. Where the data is now● ARCH: archived in institutional repository● PUBD: published● HERE: article contains the primary data● HERESUMMARY: data summarized in the article (stats,
graphs, tables)● ONL: online (website or other non-archive)● UNST: location of data not stated
Eight Deadly Sins of Web 2.0 Start-Ups […] Happinessless: Your start up has no future if you are not happy.(http://www.slideshare.net/imootee/eight-deadly-sins-of-
The reference number (here ref 07032.031) indicates the name of the file the sentence is extracted from (07032) and the line inside the text where the sentence appears (031).
“...one participant wondered why she felt the need to point out the importance of citing the source of each example when presenting it in an academic publication. This seemed obvious to those present…”
Ruth Singer, Melbourne LIP “Grammar writing: where are we now?” http://www.paradisec.org.au/blog/2015/02/grammar-writing-where-are-we-now/
ReferencesBerez, Andrea. 2014. “Reproducible research in descriptive linguistics: Integrating archiving and citation into the postgraduate curriculum at the University of Hawai‘i at Manoa.” In Amanda Harris, Nick Thieberger & Linda Barwick (eds.), Research, records, and responsibility: Ten years of the Pacific and Regional Archive for Digital Sources in Endangered Cultures. Sydney: University of Sydney Press.
Berez, Andrea, Lauren Gawne, Barbara Kelly and Tyler Heston. In prep. Citation and transparency in descriptive linguistics.
Bird, Steven & Gary Simons. 2003. Seven dimensions of portability for language documentation and description. Language 79(3): 557-582.
Bowern, Claire. 2008. Linguistic fieldwork: a practical guide. Basingstoke [England] ; New York: Palgrave Macmillan.
Chelliah, Shobhana L., and Willem J. De Reuse. 2011. Handbook of descriptive linguistic fieldwork. London: Springer.
Crowley, Terry. 2007. Field linguistics: a beginner's guide. Edited by Nicholas Thieberger, Oxford linguistics. Oxford ; New York: Oxford University Press.
Gippert, Jost, Nikolaus P. Himmelmann, & Ulrike Mosel. 2006. Essentials of language documentation. Berlin: Mouton de Gruyter.
Himmelmann, Nikolaus P. 1998. "Documentary and descriptive linguistics." Linguistics no. 36:161–195.
Himmelmann, Nikolaus P. 2006. “Language documentation: What is it good for?” In Jost Gippert, Nikolaus P. Himmelmann, & Ulrike Mosel (eds.). Essentials of language documentation, 1-30. Berlin: Mouton de Gruyter.
Nakayama, Toshihide, and Keren Rice (eds). 2014. The Art and Practice of Grammar Writing. Vol. 8, Language Documentation & Conservation Special Publication. Honolulu: University of Hawaii Press.
Pawley, Andrew. 2014. "Grammar writing from a dissertation advisor’s perspective." In The Art and Practice of Grammar Writing, edited by Toshihide Nakayama and Keren Rice, 7-23. Honolulu: University of Hawaii Press.
Ring, Hiram. 2015. A Grammar of Pnar. PhD Thesis, Nanyang Technological University.
Thieberger, Nicholas. 2009. “Steps toward a grammar embedded in data” In Patricia Epps and Alexandre Arkhipov (eds.) New Challenges in Typology: Transcending the Borders and Refining the Distinctions, 389-408. Berlin; New York, NY : Mouton de Gruyter.
Thieberger, Nicholas. 2012. The Oxford handbook of linguistic fieldwork. Oxford: Oxford University Press.
Woodbury, Anthony C. 2011. Language documentation. In Austin & Sallabank 2011, 159-186.
Works from study included hereBaker, Mark C., Roberto Aranovich & Lucia A. Golluscio. 2005. Two types of syntactic noun incorporation: Noun incorporation in Mapudungun and its typological implications. Language 81(1): 138-176.Blevins, Juliette. 2005. Origins of northern Costanoan ʃak:en 'six': A reconsideration of senary counting in Utian. IJAL 71(1): 87-101.Brown, Becky. 2003. Code convergent borrowing in Louisiana French. J Socio 7(1): 3-23.Chand, Vineeta. 2011. Elite positionings toward Hindi: Language policies, political stances and language competence in India. J Socio 15(1): 6-35.Davis, Henry. 2005. On the syntax and semantics of negation in Salish. IJAL 71(1): 1-55.Ewing, Michael C. 2005. Hierarchical constituency in conversational language: The case of Cirebon Javanese. SiL 29(1): 89-112.Guérin, V. M. (2008). Discovering Mavea: Grammar, texts, and lexicon. PhD dissertation. Honolulu: University of Hawaii. Huang, S., & Tanangkingsing, M. 2011. A Discourse Explanation of the Transitivity Phenomena in Kavalan, Squliq, and Tsou. Oceanic Linguistics, 50(1), 93–119.Mattina, Nancy. 2006. Determiner phrases in Moses-Columbia Salish. IJAL 72(1): 97-134.Mauri, Caterina. 2008. The irreality of alternatives: Toward a typology of disjunction. SiL 32(1): 22-55.Maddieson, Ian, Heriberto Avelino, & Loretta O'Connor. 2009. The phonetic structures of Oaxaca Chontal. IJAL 75(1): 69-101.Plag, Ingo & Harald Baayen. 2009. Suffix ordering and morphological processing. Language 85(1): 109-152.Post, Mark W. 2007. Grammaticalization and compounding in Thai and Chinese: A text frequency approach. SiL 31(1): 117-175.Schadeberg, T. C., & Kossmann, M. 2010. Participant reference in the Ebang verbal complex (Heiban, Kordofanian). Journal of African Languages and Linguistics, 31(1). Takara, Nobutaka. 2012. The weight of head nouns in noun-modifying constructions in conversational Japanese. SiL 36(1): 33-72.Yasugi, Yoshiho. 2005. Fronting of nondirect arguments and adverbial focus marking on the verb in Classical Yucatec. IJAL 71(1): 56-86.Zanuttini, Raffaella. 2008. Encoding the addressee in the syntax: Evindence from English imperative subjects. NLLT 26(1): 185-218.
Special thanks to the National Science Foundation, The University of Melbourne library staff, The University of Melbourne and NTU, Singapore where Lauren worked on earlier stages of this project