Presentation is loading. Please wait.

Presentation is loading. Please wait.

Database Publishing at Nature Timo Hannay Nature Publishing Group 7 October 2005.

Similar presentations


Presentation on theme: "Database Publishing at Nature Timo Hannay Nature Publishing Group 7 October 2005."— Presentation transcript:

1 Database Publishing at Nature Timo Hannay Nature Publishing Group 7 October 2005

2 Overview Publishing collaborations: Making databases more like journals NPG New Technology: Making journals more like databases Tagging and social bookmarking: New methods of annotation and navigation

3 Database publishing at NPG The AfCS-Nature Signaling Gateway (http://www.signaling-gateway.org/) The CMC-Nature Cell Migration Gateway (http://www.cellmigration.org/) Forthcoming collaborations with NCI and several other groups

4 The AfCS-Nature Signaling Gateway A freely available online resource for anyone interested in cellular signalling A collaboration with the research community through the Alliance for Cellular Signaling An experiment in the next generation of online, database-driven scientific publications

5 The Signaling Gateway Hardware & software hosted at San Diego Supercomputer Center Molecule Pages AfCS Data Center Signaling Update Home, Info & News Facts and figures on major cell signaling proteins (3,700+) Continually updated by selected experts (~1000) Peer-review run by NPG News & comment written and commissioned by NPG editors Repository for raw experimental data from AfCS Tools for viewing and analyzing data (online & offline)

6

7

8

9

10

11

12 The Molecule Pages Comprehensive, structured data for 3,700+ proteins involved in cellular signalling Some information automatically fed in from other online databases and updated monthly Other information entered by selected expert authors and updated annually Author-entered data peer-reviewed by NPG Fully citable using digital object identifiers (DOIs)

13

14

15

16

17

18

19 Using Digital Object Identifiers Nature 409, 860 - 921 (2001) doi:10.1038/35057062 Allows unambiguous identification of paper Allows readers to find the paper online Allows publishers to cross-link reference lists Guaranteed not to change (even if the publisher changes) http://dx.doi.org/10.1038/35057062 IDF/CrossRef databases Correct URL at publisher’s website

20 The Molecule Pages: A scientific publication CharacteristicTraditional journal Traditional database Molecule Pages Recognised serial publication with an ISSN  Authored by recognised scientific experts  ?  Subjected to full anonymous peer review  Maintained indefinitely (with errata and addenda)  Formerly citable and fully integrated into CrossRef  Structured and highly queryable  The Molecule Pages has the same features as a traditional journal, except that the information it contains is more highly structured and queryable.

21 Overview Publishing collaborations: Making databases more like journals NPG New Technology: Making journals more like databases Tagging and social bookmarking: New methods of annotation and navigation

22 Great underestimated technologies of our age Alternating current (1880s) Executing criminals The electrically powered society Web-based scientific publishing (2004) A new charging model for scientific papers Redefining the concept the scientific paper Steam engines (early 1700s) Pumping water from coal mines The Industrial Revolution Technology Purported useEventual impact

23 Scientific papers as structured data objects Print journal Online facsimile circa 2000 Article metadata database Structured data sets circa 2006 Structured, interactive and queryable figures and text

24 Experimental article metadata database Initial data to be included: Author and institute details Scientific:  Molecules (InChI)  Genes (Entrez Gene)  Proteins (UniProt)  Cellular processes, functions, locations (GO)  Species (NCBI) Citation annotations (controlled vocabulary)

25

26 Support for structured data sets Preview in browserDownload to desktop software Search for more data Developing support for: Systems Biology Markup Language CellML Chemical Markup Language Others

27 SVG: Figures as interactive data objects Plot graph on axes of choiceOverlay data sets of choice Click to download raw data Zoom and pan to view detail

28

29 Automated scientific markup and linking

30 Increasing structure in text markup (1) The old way (no semantic markup): “...gp120 binding to CXCR4 or CCR5 activates PYK2 and FAK… ” Now (key entities and concepts marked up): “... gp120 binding to CXCR4 or CCR5 activates PYK2 and FAK … ”

31 Increasing structure in text markup (2) The new way (full RDF/XML):... gp120 binding to CXCR4 or CCR5 activates PYK2 and FAK … With RDF markup, the article XML itself literally becomes a relational database

32 Why go to all this effort? Discoverability and recontextualisation “Show me statements about the hedgehog gene.” “Find claims that disagree with this.” Transparency and flexibility“Plot this graph on a different scale, with error bars added and with these two extra data sets overlaid.” Specificity and completeness“Give me a full description of this mathematical model that I can run on my own computer.” Reuse and interoperability“Provide the raw data set used in this analysis in a form that allows me to merge it with my own data.”

33 Views from the database side “Before the end of the next decade, pathway databases will become scientific journals and journals will become databases. Biologists will be greatly empowered, and bioinformatics will continue its long evolution.” Lincoln Stein (Reactome) “Is a biological database any different than a biological journal? I am working toward reaching an answer of, no, there is no difference.” Phil Bourne (Protein Data Bank)

34 Overview Publishing collaborations: Making databases more like journals NPG New Technology: Making journals more like databases Tagging and social bookmarking: New methods of annotation and navigation

35

36

37

38

39

40

41

42

43

44

45

46

47

48

49 A few uses for Connotea Keeping bookmarks and references in order Sharing links and ideas within a team (perhaps geographically dispersed) Providing readers with a (dynamic) list of further or related reading Encouraging readers to share relevant links with the author and with each other


Download ppt "Database Publishing at Nature Timo Hannay Nature Publishing Group 7 October 2005."

Similar presentations


Ads by Google