Top Banner
Sharing linguistic multi-media re Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands
39

Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Dec 27, 2015

Download

Documents

Dwight Bell
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing linguistic multi-media resources

Jacquelijn Ringersma

Paul Trilsbeek

Max Planck Institute for Psycholinguistics

Nijmegen, The Netherlands

Page 2: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Max Planck Institute for psycholinguistics

Max Planck Gesellschaft

78 research institutes (Germany)

3 outside Germany:

2 Italy (art)

1 The Netherlands (psycholinguistics)

The study of mental processes involved in language production, language comprehension and language acquisition, as well as the relation between language, thought, and culture

Page 3: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Documenting (endangered) languages

Creation of a representative and long lasting, multipurpose record of natural languages

It contributes to maintain, consolidate or revitalize endangered languages and thus safeguards the full range of their uses …

and it also

contributes to the description of cultural elements of a language community

Page 4: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Documenting (endangered) languages

Audio resources: represent spoken language

Video resources: information on the socio-linguistic environment

Enrichments: Annotations, transcriptions, translations, lexica

Page 5: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

Where is the data stored?

Digital (online) archives:

DoBeS (MPI-archive), AILLA (Austin), Paradise© (Sydney)

Page 6: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Archive for linguistic resources (MPI)

Archive for linguistic resources

Different types of linguistic material: Endangered languages archive (DoBeS) MPI language documentation corporaExternal corpora (Carib, Narrangansett, Slavonic etc.)

Total amount of data in the archiveMore than 230.000 objects, 25 Tb datadigitized audio and videoimagesannotations

Organization:Metadata descriptions, data base

Page 7: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Archive for linguistic resources (DoBeS)

Page 8: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Multimedia Lexicon

Typed Relations within the Lexicon

Annotated Media

Described Corpus

Archive for linguistic resources (MPI)

Photos

Page 9: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

Issues in the access debate

(Culturally) sensitive data

Ownership

Research purposes

National and institutional regulations

Code of conducts

Specific groups or individual users have specific access rules to resources

Who is the data for?

Collector (team) - researcher

Colleague researchers

General public – education, information

Speech communities – knowledge sharing, education, revitalization etc.

Page 10: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

Page 11: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

Page 12: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

Page 13: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

Page 14: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

Page 15: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

Page 16: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

Page 17: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

Page 18: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

Page 19: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

Page 20: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

Page 21: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

Collector (team) - researcherColleague researchersTrained general public – education, information

Page 22: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

Page 23: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

Page 24: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

Page 25: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

Page 26: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

Page 27: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

Colleague researchersGeneral public – education, information

Page 28: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

5. Lexicon and conceptual spaces

Page 29: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

LEXUS - Lexicon tool

LEXUS

Web based lexicon tool

Word lists and detailed views of information in the lexical entries

Linking of multi-media fragments (images, video, sound files)

Linking of multi-media fragments stored in digital archives

Toolbox/XML compatibility (import and export)

Page 30: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

ViCoS

Page 31: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

LEXUS - Lexicon tool

LEXUS

Web based lexicon tool

Word lists and detailed views of information in the lexical entries

Linking of multi-media fragments (images, video, sound files)

Linking of multi-media fragments stored in digital archives

Toolbox/XML compatibility (import and export)

Page 32: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

LEXUS - Lexicon tool

Page 33: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

ViCoS – Visualizing conceptual spaces

Conceptual spaces in multi media encyclopedia

Conventional paper dictionaries: network of meanings less visible

Paper dictionaries limited usefulness in language maintenance and language revival (Manning et al., 2000) Members of speech community prefer following semantic links of different semantic types (synonyms, antonyms, lexical, taxonomies)

Page 34: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Complement lexical spaces with ontological spaces

Allow users to construct a space of culturally relevant concepts

Concepts as centres for all sorts of information

relations to other concepts

anchored in the language to express them

linked to multimedia archive to describe them

Vizualizing Conceptual Spaces

ViCoS – Visualizing conceptual spaces

Page 35: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

Collector (team) researcherSpeech community

Page 36: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

Collector (team) researcherSpeech community

Page 37: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

Collector (team) researcherSpeech community

Page 38: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

1. Browsing

When represented in an organized manner (e.g. tree or by category)

2. Metadata search

3. Content search (only when users have access rights)

4. Geographic browsing

5. Lexicon and conceptual spaces

Collector (team) - researchersSpeech community members

Page 39: Sharing linguistic multi-media resources Jacquelijn Ringersma Paul Trilsbeek Max Planck Institute for Psycholinguistics Nijmegen, The Netherlands.

Sharing resources

How can the data be accessed?

Direct access to archive through: browse, metadata search

Access through content search

Collector (team) – researcherColleague researchersTrained general public

Geographic browsingColleague researchersGeneral public

Lexicon and conceptual spacesCollector (team) – researcherMembers of the speech community