The Importance and Future Trends of Sharing Biodiversity Data Chau Chin Lin Taiwan Forestry Research Institute
Chris King 2011 Genotype Biodiersity: Tree of Life on the Planet
Vernacular (FR): Pyrale du maïs Vernacular (ES): Piral del maíz Vernacular (DE): Maiszünsler Diagnosis: Wingspan 26-30mm; sexually dimorphic;male: forewings ochreous to dark brown; female: forewings pale yellow; … Foodplant: Zea mais L Species: Ostrinia nubilalis (Hübner, 1796) Family: Pyralidae Order: Lepidoptera Class: Insecta Genus: Ostrinia Hübner, 1825 Vernacular (EN): European Corn-borer Family: Gramineae Taxonomic Names Collection:DGH Lepidoptera Record id:DGHEUR_ Country:France Coordinates:03.047˚E ˚N Date:28 June 2003 Collector:Donald Hobern Individuals: 3 Richness: Spatial /Temporal Observations Biotic Interactions Locus:AAL35331 Definition:acyl-CoA Z/E11 desaturase 1 mvpyattadg hpekdecfed... Sequence Data Average Rainfall Location: 48.82°N 2.29°E Jan Feb Mar Apr Abiotic Taxonomic Descriptions Pheromones of Ostrinia /pheronet/phlist/ostrinia.html Digital Literature and Web Resources Synonym: Pyralis nubilalis Hübner, 1796 Biodiversity: Information of Life
Challenges and Opportunities Scientific innovation has been called on to spur economic recovery and to inform sustainability. Data collection,curation, and access are central to all of these issues Science 311: , 2011
Annual Cumulative Worm et al., Science 2006 Data Informs the Lost of Biodiviersity
Data Provides Actions Making
Data Enhances Understanding of The Real World Understanding this disease requires knowledge of epidemiology, genetics, and transmission modes, along with their ecological contexts. Integrating ecologically pertinent data into the chain of information from the gene to the biosphere will significantly enhance our understanding of the natural world. Whitfield J Ape populations decimated by hunting and Ebola virus. Nature 422:551
All about Data
Observations/experiments the real world Data/Raw data/Dataset information Data Comes from Research of the Real World
Planning Problem Analysis and modeling The Traditional Paradigm of Data Data Collection Publications Raw Data
(Michener et al. 1997) Data Entropy Information Content Time Time of publication Specific details General details Accident Retirement or career change Death
Planning Problem Definition (Research Objectives) Analysis and modeling Collection Original Observations Publications Planning Selection and extraction Secondary Observations used data New Paradigm of Data
Data Cyberinstrstructure The Real World Synthesis Hubs and Nodes Fundamental Research Observatory Networks Collaboratories A Data-intensive Approach Information Resources
Breaking Spatial and Temporal Barriers
Integrating Heterogeneous Data
Data collecting Data Preserving and managing Adapting Cutting Edge Technologies Data transferring Data discovering, Integrating,analyzin g,visualizing
Providing Good Quality Data Available Online
Dealing with Data Flow Change
Interpret a pattern 1,000 x daily Interpret a number 10 x daily Dealing with Data Collecting Change
Dealing with Data Deluge
Raw data Information Knowledge Management, Archiving, & Curation Discovery, Retrieval Integrating, Analysis & Visualization Towards Automation of Data Processing
Metadata ? Metadata Is the Key to New Paradigm
Metadata Date (YYYYMMDD) Temp (°C) Precip. (mm) Obs. #1 Obs. #2 Obs. # Data What Is Metadata-An Example
Metadata Is Data about Data … and All Other Facet of Dataset ! Who? What? When? Where? How?
Standards for Metadata ISO is a geo-spatial metadata standard developed by ISO/TC 211. ISO defines a comprehensive metadata model for geographic objects. ISO/TC 211also defined a smaller set of core metadata elements (shown on example slide.) This core contains the minimum elements that satisfy the requirements of an ISO conformant metadata record. The ISO standard does not specify storage format, but XML schemas are under development for an XML encoding of it (in full or for specialized profiles). CSDGM/FGDC (Content Standard for Digital Geospatial Metadata) is a standard for metadata for geographic objects developed by FGDC (Federal Geographic Data Committee). However, this standard is not limited to spatial data. FGDC enables development of profiles, i.e. customization of the standard to suit the needs of a particular application domain (while staying within the framework of the standard). SPOT imagery FGDC example c-html.xsl¤tTab=¤tItem=&portal=gcmd EML (Ecological Metadata Language) Darwin Core The Darwin Core (sometimes abbreviated as DwC) is a standard designed to facilitate the exchange of information about the geographic occurrence of species and the existence of specimens in collections. Many Standards Can be Chosen
Ecological Metadata Language is… an ecological metadata standard very extensible; it can be used to describe many different types of data comprehensive and supports a rich set of constructs to fully describe data XML and is defined by an XML Schema exploitable by different computer applications What Is EML?
Early ecological metadata work in LTER and elsewhere FLED report Michener et al. paper EML XML 1.0 released EML 1.4.xEML beta1-9EML 2.0.0rc1-3EML 2.0.0EML BDP approved Second EML workshop EML ASU meeting KNB Tools Workshop EML FGDC CSDGM NBII created ‘91‘92‘93‘94 ‘96‘97‘98‘99 ‘01‘02‘03‘04 ISO GML 3.0 FGDC CSDGM 2.0 FGDC CSDGM RS FGDC created NCEAS formed First EML workshop EML History
EML Modules
Darwin Core Ratified in the Year of Darwin!!
taxonRank higherClassification taxonConceptID associatedSequences geodeticDatum specificEpithet coordinatePosition associatedSequences: A list (concatenated and separated) of identifiers (publication, global unique identifier, URI) of genetic sequence information associated with the Occurrence. Darwin Core – a glossary of terms
Data Archiving Provides Opportunities for Auto Analysis
Species Distribution Prediction (Abies Kawakamii)
Metadata list Key words query Metadata provides data source Retrieval of dataset Data analysis An Example of Biodiversity and Ecological Data
Thank You!