Prof. Dr. Klaus-Dirk Schmitz Cologne University of Applied Sciences Data Modelling: Part 2 TSS 2007 Cologne 1 Data Modelling Part 2: Modelling Principles Terminology Summer School - Cologne 16 - 20 July 2007 Klaus-Dirk Schmitz Institute for Information Management Faculty 03 University of Applied Sciences Cologne [email protected]K.-D. Schmitz, IIM, FH Köln Overview A little bit of theory again Data modelling (Data categories) Dependencies Modelling variances Concept orientation Term autonomy Data modelling in general: meta model Support by (ISO) standards
23
Embed
Data Modelling Part 2: Modelling Principles - TermNet · Data Modelling Part 2: Modelling Principles ... First comprehensive analysis of terminological ... Machine-readable terminology
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
1
Data Modelling Part 2:Modelling Principles
Terminology Summer School - Cologne16 - 20 July 2007
Klaus-Dirk SchmitzInstitute for Information ManagementFaculty 03University of Applied Sciences [email protected]
Data modelling in general: meta modelSupport by (ISO) standards
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
2
K.-D. Schmitz, IIM, FH Köln
Communication
“mouse”“mouse”
K.-D. Schmitz, IIM, FH Köln
Terminological triangle
“mouse”“mouse”
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
3
K.-D. Schmitz, IIM, FH Köln
Terminological triangle
objectterm
concept
designation
K.-D. Schmitz, IIM, FH Köln
Object
Any part of the perceivable or conceivable world
Objects may be material (e.g. mouse) or immaterial (e.g. magnetism)
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
4
K.-D. Schmitz, IIM, FH Köln
Concept
Unit of thinking made up of characteristicsthat are derived by categorizing objects having a number of identical properties (DIN)
Unit of knowledge created by a unique combination of characteristics (ISO)
Concepts are not bound to particular languages. They are, however, influenced by social or cultural background
K.-D. Schmitz, IIM, FH Köln
Term
Designation of a defined conceptin a special languageby a linguistic expression
Designation: Any representation of a concept
A term may consist of one or more words
“mouse”“mouse”
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
5
K.-D. Schmitz, IIM, FH Köln
Communication can be disturbed
“return key?”“return key?”
Synonymy
“enter key”“enter key”
K.-D. Schmitz, IIM, FH Köln
Communication can fail
“mouse”“mouse”
Homonymy / Polysemy
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
6
K.-D. Schmitz, IIM, FH Köln
Data modelling for terminology DBs
Important aspects:(selection of adequate data categories)modelling dependencymodelling varianceconcept orientationterm autonomy
Terminology science and terminology standards provides the adequate theory, principles and methods for data modelling
K.-D. Schmitz, IIM, FH Köln
Data categories
Data categories have not been discussed in detail in terminologytheory in the past
First approaches of describing“fields” of forms for recordingterminological data offline
Improvement for the descriptionof term bank structures
But no real definition of underlying data categories
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
7
K.-D. Schmitz, IIM, FH Köln
First comprehensive analysis of terminological data categories used in TMS for the preparation of ISO 12620
First standard for data categories: ISO 12620:1999
Data model for terminological data collections developed for terminology interchange (MARTIF): ISO 12200: 1999
Improved for the Terminology Markup Framework(TMF) in ISO 16642: 2003
Data categories
K.-D. Schmitz, IIM, FH Köln
Dependencies between data categories
ISO 12620:1999 provides a “simple hierarchy”of data categories• grammar = term-related:
grammar is dependent from term
In addition to this, much more dependencies exist and have to be taken into account
• source is dependent from definition• for additional definitions, additional sources are needed• the source of the definition has to be differentiated
from the source of the term or the context example
Modelling dependencies
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
8
K.-D. Schmitz, IIM, FH Köln
Although following ISO 12620, there are sometime more than one modellingsolution to implement the data category
Simple example:
a) gender: m. / f. / n.
b) masculine: yes / nofeminine: yes / noneuter: yes / no
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
11
K.-D. Schmitz, IIM, FH Köln
All terminological information belonging to one concept including all terms in all languages and all term-related and administrative data must be store in one terminological entry
concept = terminological entry
Concept orientation
K.-D. Schmitz, IIM, FH Köln
Many of the older term banks and TMS are more designed for term orientation
Modern TMS not only follow the concept approach but also support features for consistent concept entries (preventing “double entries”)
Concept orientation
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
12
K.-D. Schmitz, IIM, FH Köln
K.-D. Schmitz, IIM, FH Köln
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences
Data Modelling: Part 2TSS 2007 Cologne
13
K.-D. Schmitz, IIM, FH Köln
All terms belonging to one concept should be managed (in one terminological entry) as autonomous (repeatable) blocks of data categories without any preference for a specific term
Therefore all terms can be documented with the relevant term-related data categoriesTerm autonomy is necessary for the main term, all synonyms, all variants, and all short formsTerm autonomy is not explicitly discussed in theoretical literature
Term autonomy
LREC 2002, May 2002 K.-D. Schmitz, IIM, FH Köln
Conceptrepresented by ID-No. and/or classification / notation
Language 1 Language 2 Language 3 ...
Term 1+ AuxInfo
Term 2+ AuxInfo
Term 1+ AuxInfo
Term 2+ AuxInfo
Term 1+ AuxInfo
Term 3+ AuxInfo
Term autonomy
Prof. Dr. Klaus-Dirk SchmitzCologne University of Applied Sciences