An Analysis of a Collection of Fairy Tales by the Brothers Grimm Matteo Starri MADH – 2007-2008
Dec 24, 2014
An Analysis of a Collection of Fairy Tales by the Brothers Grimm
Matteo StarriMADH – 2007-2008
2
Topics to be Covered
• Introduction to the text itself
• Tools used for the research
• Possible patterns detectable with these
• Possible interaction of the patterns
3
The Tales Collected by the Grimms
• Kinder- und Hausmärchen
• Literally: Children's and Household Tales, commonly known as Grimms’ Fairy Tales
4
The Tales Collected by the Grimms
• 1st Edition: 1812-1814, 2 voll., 156 tales
• 2nd Edition: 1819-1822, 3 voll., 170 tales
• …
• 7th Edition: 1857, 211 tales
5
The Tales Collected by the Grimms
• Continuous additions and subtractions throughout the editions
• “Small Edition” in 1825, 50 tales, for child readers
6
Size of the corpus
• 64 tales (not specified in the file why this many and why these ones)
• 8.097 paragraphs
• 100.761 words counted by Word (102.962 by MonoConc Pro 2.2)
(the introductory part of the original file and the titles list have been excluded)
7Complete list of the tales included (file screenshots)
8
The tool used
• MonoConc Pro 2.2, KWIC-based program by Michael Barlow
• Corpus loaded as .txt file, modified from the original downloaded from the Project Gutenberg website (to cut heading and to avoid repetitions in titles) [x] [screenshot]
9
Preliminary research
• Frequency list, sorted in frequency order
• Q: What does it say at a first glance?
10
Frequency list (first four screenshots, 120 most occurring words)
11
Preliminary research
• Past tenses
• Speech verbs
• Recurring types of characters
All these features are typical of the genre
12
Preliminary research and delineation of possible patterns
• That two
• Two is not the plural of a/an/one
• So why is two used so often?
13
Preliminary research and delineation of possible patterns
• King is the highest occurring noun, man is the second, and father is the third
• Is this saying anything to us?
• Are these words related?
14
First pattern: numbers
• Two occurs 149 times
• What about other cardinals?
15
First pattern: numbers
• Occurrences generally decrease as numbers get higher
• Clear exceptions: seven and twelve
• But these are concentrated in very few tales
16
First pattern: numbers
• The Wolf and the Seven Little Kids, The Seven Ravens, Snowdrop, The Valiant Little Tailor
• The Twelve Dancing Princesses, The Twelve Huntsmen
• In these tales the number is often used as part of the name of the characters, as in the titles
17
First pattern: numbers
• Two and three are the ones occurring more often
• Relatively often associated with characters
• In these cases, though, they sometimes delineate a single “actor”.
18
Related patterns: numbers and familiar relations
• Out of 143 occurrences, in 25 cases two is associated with familiar relations
• Children (10), brothers (7), daughters (4), sisters (2), [King’s] sons (1), sons (1)
• Children are given positive attributes, or neutral, while brothers are represented badly (6/7 times)
19
Related patterns: numbers and familiar relations
• Two daughters (4) are either both bad (2), ore a good and a bad one (2)
• When different, they are stepsisters, and one is ugly but loved because a proper child, while the beautiful one is hated being a stepdaughter (by evil parents)
• When both bad, they are daughters of a new wife, and against a third, proper one (Cinderella)
20
Related patterns: numbers and familiar relations
Frequency collocation for two (words occurring once or twice are not displayed)
21
Related patterns: numbers and familiar relations
• Out of 96 occurrences, only in 6 cases three is associated with familiar relations
• Daughters (3), sons (2), brothers (1)
• The daughters act always as a single actor, while for sons/brothers the third one (the youngest) is represented as good against the other two
22
Related patterns: numbers and familiar relations
Frequency collocation for three (words occurring once or twice are not displayed)
23
First pattern: numbers
• Four (32) is used only 4 times related to brothers
• Seven (52) is not used with familiar relations, but often delineates time: years (5), long years (3)
24
First pattern: numbers
• Is there any similarity in the use of cardinals and ordinals?
25
First pattern: numbers
• Ordinals are mainly used in connection with people and time-related expressions
• Second (45): son (6), time (4)
• Third (50): day, time, night (8), brother (2), son (1)
26
First pattern: numbers
• Ordinals can be grouped in terms of preservation or evolution in the narration
• Example: when second is used in a group of more than 2, usually there is a repetition of previous events, or slight changes, while often is the third element (like son, brother, or day/night passed) that develops the story
27
First pattern: numbers• Fifth (9) is equally used to preserve and evolve
• Sixth (3) is always used between a fifth element and a seventh one
• Seventh (6) usually brings innovation
• Ninth (1), used with day, evolves the story after 8 similar days
28
Second pattern: familiar relations
• Father (153) is the most occurring word related to familiar connection between characters
• Mother follows at 140
• Both mainly anticipated by possessives or positive attributes
29
Second pattern: familiar relations
30
Second pattern: familiar relations
• Familiarly-related characters are mainly given positive attributes
• Dear/dearest is used 29 times, at every level
• Relations between siblings are usually given age/order attributes: elder/eldest (6), little (8), second (3), youngest (2),
31
Second pattern: familiar relations
• Child/son/daughter are mainly given positive attributes, while their plurals are mainly ordered
• Parents usually set the action, developed then by their children
• This is particularly true for kings (and, less, queens), who mainly speak and rarely act, while princes and princesses do
32
Second pattern: familiar relations
Collocation for king (274 occurrences)
33
Second pattern: familiar relations
• “Improper” relatives (stepmothers, sisters and daughters) are not always clearly given bad attributes, but always act as evil
• Most relations are on two levels (parents-sons). Three-level relations are very few and never characterized
• There are no ucles, aunts, nephews or nieces
34
Overall Results
• Relatively poor information about the characters gained from the closer surroundings of the word, clear necessity of shifting from text to search results, back to the text and so on to understand characters
• Animal characters frequent but basically not considered in the study because they do not follow the human relational scheme
35
Overall Results
• Not clear scheme identified about preservation/evolution in the narration of the events. Need for more data and better statistical tools
36
Possible Direction for Further Study
• A database could be built in order, for example, to relate characters, track their actions, and draw clear parallels between tales
• More text (i.e. all KHM) could be marked up and put in a database, analysed and compared to other tales from other traditions, to find common patterns, influences (from French tales, in this case), and maybe be used as reference for worldwide folk studies