Top Banner
DATA IN THE DIGITAL HUMANITIES DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges http:// methodologicalchallenges.group.shef.ac.uk/
7

DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

Jan 08, 2018

Download

Documents

Mary McDaniel

Data acquisition: 1.Most of the evidence base is pre- digital. Very little is ‘born digital’. 2.Data acquisition is a question of translation, representation and interpretation. 3.The methods we use either enable or inhibit research. 4.But, the process also develops intimate knowledge of the evidence.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

DATA IN THE DIGITAL HUMANITIESDATA IN THE DIGITAL HUMANITIES

Michael Pidd

26th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

http://methodologicalchallenges.group.shef.ac.uk/

Page 2: DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

The data lifecycle in a typical digital humanities project:

1.Acquisition (e.g. digitisation)2.Processing (adding value)3.Analysis (and dissemination)

Data in the humanities is usually:

1. Small (discrete sources created by individuals).

2. Broad (many different types of sources have to be assembled).

3. Complex (because humans are not spreadsheets).

Rarely ‘Big’.

http://hridigital.shef.ac.uk@hridigital

Page 3: DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

Data acquisition:

1. Most of the evidence base is pre-digital. Very little is ‘born digital’.

2. Data acquisition is a question of translation, representation and interpretation.

3. The methods we use either enable or inhibit research.

4. But, the process also develops intimate knowledge of the evidence.

Page 4: DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

British Library NewspapersKeyword search for “pidd” gives 2,730 results…

Page 5: DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

Data processing:

1. Metadata can be complex, reflecting the complexity of the data.

2. Metadata can be very specialised, limiting re-use.

3. When processed at scale, computational methods are a trade-off between through-put and accuracy.

Page 6: DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

• Nominal record linkage using computational means to trace the lives of 90,000 people.• Record linkage across 45 separate datasets (some public, some commercial, all in different

formats and with different data models).• And most people have common names.

http://www.digitalpanopticon.org

Page 7: DATA IN THE DIGITAL HUMANITIES Michael Pidd 26 th November 2014, ICOSS, University of Sheffield. NatCen Seminar Series on Methodological Challenges

Analysing data:

Do data visualisations tell us anything that we do not already know?

Data visualisation is only as good as the data.

Data visualisation should reveal trends and anomalies, directing us to deeper readings of the evidence.