The Nordic Gene Bank, NGB, Alnarp, Sweden The Nordic Gene Bank, NGB, Alnarp, Sweden Sharing of germplasm data with Web Services February 20, 2006 FAO, Rome, Italy Dag Endresen The Nordic Gene Bank, IPGRI
Dec 06, 2014
The Nordic Gene Bank, NGB, Alnarp, SwedenThe Nordic Gene Bank, NGB, Alnarp, Sweden
Sharing of germplasm data with Web Services
February 20, 2006FAO, Rome, Italy
Dag EndresenThe Nordic Gene Bank, IPGRI
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 2
TOPICSTOPICS
Germplasm data
Data Standards Data exchange
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 3
Germplasm dataGermplasm data
The Germplasm data describe very similar data objects as the natural history museums and the botanical gardens.
Preserved reference collections, such as those in museums and herbaria.
Living collections, like botanical and zoological gardens, aquaria, seed banks, microbial strain cultures and tissue collections.
Data collections, from surveys of objects in the field, such as observations.
These collections have most of their attributes in common, although the terminology used to describe them may differ substantially.[http://www.bgbm.org/TDWG/CODATA/ABCD-Evolution.htm]
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 4
Data Standards
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 5
MCPDMCPD MMulti ulti CCrop rop PPassport assport DDescriptorsescriptors
The MCPD is designed to be compatible with the IPGRI crop specific descriptor lists and the FAO World Information and Early Warning System (WIEWS).
The MCPD descriptor list is compatible with ABCD (2.06).
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 6
IPGRI Crop Specific IPGRI Crop Specific DescriptorsDescriptors
The IPGRI crop descriptors (as well as other networks) expand the MCPD List to meet specific needs for these crops.
The International Union for the Protection of New Varieties of Plants (UPOV) maintains crop descriptors for protection of intellectual property right (since 1961).
The COMECON descriptor lists came even earlier, and was the result of a cooperation of the Eastern European Genebanks in PGR documentation (1949 –1999).
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 7
Taxonomic Database Working Taxonomic Database Working GroupGroup
Standards development and Standards development and maintenancemaintenance
Darwin Core 2 - Element definitions designed to support the sharing and integration of primary biodiversity data". [http://darwincore.calacademy.org/]
Access to Biological Collection Data (ABCD) 2.06 - An evolving comprehensive standard for the access to and exchange of data about specimens and observations (a.k.a. primary biodiversity data)“.[http://www.bgbm.org/TDWG/CODATA/Schema/]
ABCD 2.06 is compatible with MCPD.
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 8
PGR sub-unit of ABCDPGR sub-unit of ABCD
PGR
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 9
Generation Challenge Generation Challenge ProgrammeProgramme
In the context of the GCP (Generation Challenge Programme), the GCP_Passport data exchange schema was developed.
Similar XML schema are under development for Phenotype (trait data) and Genotype
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 10
Biodiversity informatics
data exchange tools
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 11
Data Provider SoftwareData Provider Software
Distributed network of data providers retrieving structured data from multiple, distributed, heterogeneous databases across the Internet.
DiGIR, Distributed Generic Information Retrieval. [http://digir.net]
BioCASE, The Biological Collection Access Service for Europe.
[http://www.biocase.org/]
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 12
BioCASE data provider software has been implemented at (almost) all the CGIAR germplasm centers during the autumn of 2005.
Several genebanks have installed the GBIF web service technology. Nordic Gene Bank, IPK Gatersleben, IHAR (DiGIR), USDA GRIN, CGN, more to follow soon.
BioCASEBioCASEBioBiological logical CCollection ollection AAccess for ccess for EEuropeurope
[www.biocase.org/]
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 13
BioMOBYBioMOBY
BioMOBY is an international research project on methodologies for biological data representation, distribution, and discovery.
BioMOBY is chosen as the web service framework for the Generation Challenge Program[http://www.biomoby.org/]
Work is in progress to develop BioMOBY and BioCASE interoperability.
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 14
Biodiversity informatics workflow
tools
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 15
WorkbenchWorkbench
Bioinformatics analyses often involve combining the use of databases and analysis programs which are linked in a specific order to form a workflow process.
Flow of data from one analytical step to another can be captured in a formal workflow language.
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 16
Taverna workflowTaverna workflow
The Taverna Workbench allows users to construct complex analysis workflows from components located on both remote and local machines, run these workflows on their own data and visualize the results.
BioMOBY objects can be connected in a workflow.
[http://taverna.sourceforge.net/]
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 17
Web service
technology
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 18
Some web service keywordsSome web service keywords
Application-to-application
Platform independent
Programming language independent
Object model independent
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 19
Example of a service callExample of a service call
All exchanged data is formatted with XML tags.
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 20
Example of a service Example of a service responseresponse
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 21
Data warehouse modelData warehouse model(Slide by Samy Gaiji, IPGRI)(Slide by Samy Gaiji, IPGRI)
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 22
Decentralized modelDecentralized model(Slide by Samy Gaiji, IPGRI)(Slide by Samy Gaiji, IPGRI)
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 23
Data flow from genebanks to Data flow from genebanks to EURISCO and ECCDBs EURISCO and ECCDBs
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 24
Decentralized modelDecentralized model
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 25
Demo Data PortalDemo Data Portal
A demo data portal was developed, providing live access to the BioCASE data providers.
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 26
Germplasm data harvestGermplasm data harvest
We are now building data harvest methodologies for access to global germplasm data.
This is planned to build a Germplasm Clearing House Mechanism.
In cooperation with GBIF, which themselves harvest global biodiversity data from a similar approach.
Sharing of germplasm data, February 20, 2006, FAO, RomeSharing of germplasm data, February 20, 2006, FAO, Rome 27
Thank you for listening!