Implementing a media archive - lessons learned

Post on 04-Aug-2015

29 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

Transcript

FOUNDED

21 Dec. ‘12

Not (only) a Broadcast

Archive

CULTURAL HERITAGE

We don’t own

collections

We provide services towards

50+ org’s

BROADCAST CITY

ARCHIVES

SERVICE PROVIDER

STRONGER TOGETHER

Collectie Huis Van Alijn | © Huis Van Alijn

DIGITISATION ARCHIVING DISSEMINATION

DIGITISATION

HERITAGE + MEDIA

HERITAGE

MEDIA

Type materiaal/formaat

# DRAGERS

TOTAAL

UREN

TOTAAL

# dragers aandeel %

# dragers aandeel %

Film 74.173 26.709 11% 89%

Video analog 150.389 228.773 26% 74%

Video digital 225.578 156.189 5% 95%

Audio analogue 165.883 78.370 36% 64%

Audio digital 34.870 18.112 54% 46%

TOTAL 650.893 508.153 21% 79%

INVENTORY ANALOGUE CARRIERS

FIRST DIGITIZATION WAVE

ALL DIGITIZATION WAVES

UP  TO    450  TB/MONTH  

ALL DIGITIZATION WAVES

UP  TO    450  TB/MONTH  

WOI  

Evergem's Yzerblad: komt uit de loopgrachten als ’t past (S.l. 1917 – 1918) Collectie Erfgoed- bibliotheek Hendrik Conscience | © Vlaamse Erfgoedbibliotheek – Foto: Stefan Tavernier

WW I Newspaper

project

ARCHIVING

Slide 14

Scalable, redundant preservation system

Slide 15

MAM system

DISSEMINATION

QUID PRO QUO

CP’s

Education

Libraries

Research

TARGET AUDIENCE

EDUCATION

Slide 20

FROM ARCHIVE

TO CLASSROOM

ARCHIVE SYSTEM PROCUREMENT / INSTALLATION

BUDGET 2013-2014

11.8 mio EURO

TIME CONSTRAINT

TIME CONSTRAINT Evaluation

after 18 months

START JAN 1, 2013

(staff: 1)

Get and keep all customers

on board Start all

services within 1 year

VIAA NOW TEAM

ARCHIVE 4 people

14 FTE

March 2013 -

Greenfield!

lessons learned

May 2013 -

Greenfield!

Public Procurement

surveys towards users

UX Designed according

to their needs

Creates buy-in!

requirements gathering

workhops involving users

Users co-wrote our RFP

PRIOR NOTICE

Transparent Procurement Proces

DETAILED  ALLOTMENT  REPORT  (debrief)  

EVALUATION JURY

Working with a jury

•  Jury consisted of •  Stakeholders (future users)•  International experts•  VIAA staff

•  Why?•  A balanced answer / evaluation•  Jury members add weight & credibility•  Again: user buy-in!

3 EU tenders - Timeline of procurement•  June 2013 : prior notice•  August 2 : Candidates invited•  September 12 : Candidates selected•  September 13 : RFP published•  October 17 : Quotes received•  November, 6 : Allotment •  November 25 : Final allotment•  April 2014 : MAM in production

Slide 34

Three copies (2 MAM’s)

Archive system

Slide 36

Zeticon MediaHaven

Main MAM services

mul@-­‐tenant  MAM  SYSTEM  

1.  IMPORT  WORKFLOWS  &  INGEST  REPORTING  

3.  EXPORT  WORKFLOW  (OAI-­‐PMH)  ,  EXIT  PATH  

2.  MANAGEMENT      -­‐  Transcoding  (ffmpeg)      -­‐  Par@al  re@eve      -­‐  Metadata      -­‐  Time-­‐based  Annota@on      -­‐  Search  (~SOLR)      -­‐  Manual  QC      -­‐  Storage  Management    

Slide 38

Processing capacity & ramp-up

Processing capacity: => up until 13 TB / day (and still scaling)

WORKFLOWS

Practical MAM use

•  We work along the lines of OAIS for defining•  Ingest process (SIP definition)•  Long term preservation (AIP)•  Dissemination (DIP)

•  Definition in a service agreement•  Practical agreement between VIAA and CP•  Usually for one collection (e.g. audio digitization)

Ingest workflow from digitisation

•  We use PREMIS for provenance•  Every step in the process is recorded

digitization firm, CP, VIAA, …  

registration, carrier inspection, digitisation, encoding, …  

Each having a time and outcome + notes  

Analogue carrier or digital equivalent  

Complete flow => more than 1 MAM

1. Registration of the carrier / AMS

AMS  func@ons:  •  Analogue  carrier  

registra@on  •  PID  crea@on  •  Support  the  

logis@cs  process  •  Technical  

characteris@cs  •  Metadata  from  

digi@za@on  

Persistent identifier creation

Persistent identifier creation

1  Analogue  carrier  =  1  Intellectual  En@ty  =  1  PID  

Persistent identifier

The  PID  is  the  key  for  keeping  track  of  all  events  that  have  an  impact  on  the  digital  object.      We  monitor  the  complete    lifecycle  of  the  digital  object,  from  registra7on  to  dissemina7on  

2. Ingest validation

SIP  contains  •  Essence  (MXF)  •  Metadata  (XML)  •  QC  Report  (XML)  

2a. Transfer validation

2b. SIP validation

2c. Storage

All  metadata  generated  during  the  ingest  process  is  stored  as  PREMIS.  

MAM workflow-Ingest monitoring using PREMIS

Ingest monitoring using PREMIS

Error handling

Error handling (Tableau report)

Step 3: Archived

 

How  big  is  the  VIAA  archive?  Which  mime  types  do  we  have  in  the  archive?  How  many  items  do  we  archive  /  CP?  

     

   

 

grappige  foto  van  VIAA  team  en  dashboard?  

Items Archived

 

●  539,6  TB  ●  47.273  items  

     

   

 

Size of the VIAA archive

•  539  TB  •  50.000  items  

TB archived per CP

Step 4: Reuse / interaction

Collectie Huis Van Alijn © Huis Van Alijn

Road Ahead

Collectie Huis Van Alijn © Huis Van Alijn

Road ahead

•  Ingest of born digital material•  Complex and many (50+) data sources•  Need for a ‘SIP creator’•  Realisation through integration with an

enterprise service bus•  Should we be a TDR?

•  Builing along the lines, looking into certification

CONCLUSIONS

Collectie Huis Van Alijn © Huis Van Alijn

CONCLUSION•  MAM system

•  Took a while to understand business needs•  Found a very flexible partner in Zeticon

•  Perfect MAM system? •  Flexibility through well documented API’s•  Pluggable!•  Interface: Usability – HTML5•  Standards: support for and keep up with –

PREMIS, RDF, DC:TERMS, OAI-PMH, …

THANKS!

Collectie Huis Van Alijn © Huis Van Alijn

top related