Presentation is loading. Please wait.

Presentation is loading. Please wait.

The Web-Enabled Research Commons: Applications, Goals, and Trends Thinh Nguyen October 2009.

Similar presentations


Presentation on theme: "The Web-Enabled Research Commons: Applications, Goals, and Trends Thinh Nguyen October 2009."— Presentation transcript:

1 The Web-Enabled Research Commons: Applications, Goals, and Trends Thinh Nguyen October 2009

2 Use Case #1 NeuroCommons Project: Science Commons project using Semantic Web to link massive amounts of data

3

4 27,266 papers 4,563 papers 41,985 papers 10,365 papers 128,437 papers

5 NeuronDB BAMS Literature Homologene SWAN Entrez Gene Gene Ontology Mammalian Phenotype PDSPki BrainPharm AlzGene Antibodies PubChem MESH Reactome Allen Brain Atlas credit: W3C HCLS

6 NeuronDB BAMS Literature Homologene SWAN Entrez Gene Gene Ontology Mammalian Phenotype PDSPki BrainPharm AlzGene Antibodies PubChem MESH Reactome Allen Brain Atlas

7 Web page links to making computers understand linkages (the WWW)

8 receptorCell membrane is located in http://ontology.foo.org/receptor directed, contextual links

9 receptorCell membrane is located in “URI” (unique names for things on the web) http://ontology.foo.org/receptorhttp://ontology.foo.org/compartmenthttp://ontology.foo.org/receptor http://ontology.foo.org/is_located_in

10 receptorCell membrane is located in channelCell membrane is located in neuronCell membrane has

11 Cell membrane “compartment” “container” “doohickey” http://ontology.foo.org/compartment using the web to integrate data and databases

12

13 prefix go: http://purl.org/obo/owl/GO# prefix rdfs: df-schema#> prefix owl: <http://www.wprefix owl: prefix mesh: mmons/record/mesh/> prefix sc: prefix ro: <http://www.obofoundry.oprefix ro: select ?genename ?processname wheree { graph <http://purl.org/commons/hcls/pu{ graph { ?paper ?p mesh:D017966. ?article sc:identified_by_pmid ?paper.dentified_by_pmid ?paper. ?gen ?gene sc:describes_gene_or_gene_product_mentioned_by ?article. } graph.org/commons/hcls/goa> { ?protei { ?protein rdfs:subClassOf ?res. ?res owl:onProperty ro:has_function. ?res owl:someValuesFrom ?res2. ?res2 owl:onProperty ro:realized_as. ?res2 owl:someValuesFrom ?process. graph ttp://purl.org/commons/hcls/2007 {{?process go:GO_0007166} union {?process rdfs:subClassOf go:GO_0007166 }} ?protein rdfs:subClassOf ?parent. ?parent owl:equivalentClass ?res3. ?res3 owl:hasValue ?gene.owl:hasValue ?gene. } graph <http://purl.org/ graph { ?gene rdfs:label ?genename } graph purl.org/commons/hcls/20070416> { ?process rdfs:label ?processname} } Mesh: Pyramidal Neurons Pubmed: Journal Articles Entrez Gene: Genes GO: Signal Transduction better answers through better formats:

14 reformat what we already have reformat into a commons, not a closed system get the materials into the emerging research web

15 What data sharing protocol (legal and policy) best enables use of Web technology?

16 “Licensing” Archetypes Public Domain: No restrictions on use or distribution, no contracts, copyright waived. Community Licenses: standard “open access” licenses, a range of rights, some rights reserved, available to all Private Licenses: custom agreements, varies by institution, privately negotiated, may be offered only to some

17 Goals Interoperable: data from many sources can be combined without restriction Reusable: data can be repurposed into new and interesting contexts Administrative Burden: low transaction costs and administrative costs over time Legal Certainty: users can rely on legal usability of the data Community Norms: consistent with community expectations and usages

18 Interoperability Public Domain **** –Can be combined with other data sources with ease Community Licenses *** / ** –Depends on type of license: share-alike or copyleft are unsuitable, but attribution-only licenses are less problematic Private Licenses * / ** –Depends on restrictions, but not scalable; permutations too large

19 Reusable Public Domain **** –No restrictions on subsequent use Community Licenses *** –Depends on license, but some licenses such as NC / ND can be restrictive Private Licenses ** –Depends on license, but typically restrictive

20 Administrative Burden Public Domain **** –No paperwork or legal review needed Community License *** –Little paperwork, but some legal review needed (attribution stacking issues) Private Licenses * –Large amounts of paperwork, frequent legal review needed

21 Legal Certainty Public Domain **** / *** –Clear rights; generally irrevocable; (copyright should be addressed) Community Licenses *** –Generally credible, good track record with open access and open source licenses Private Licenses ** –Must be considered individually; few private licenses tested by time

22 Community Norms Public Domain *** –Traditional method for scientific data sharing (citation) Community Licenses *** –Relatively new, but familiar to computer scientists and open source community (attribution) Private Licenses ** –tendency to emphasize private / individual interests rather than community norms

23 Overall Grade Public Domain *** –Easiest and least restrictive form of sharing Community Licenses ** –Can be used to implement community expectations, but can be burdensome / restrictive Private Licenses * –High transaction costs, burdensome, unpredictable

24 Convergence

25 CC0 Released by Creative Commons in 2009 Result of a 3-year policy exploration process Not a license but a waiver of copyright

26 Why is it needed “Borderline” copyright European sui generis database rights Varying legal standards for copyright protection in different countries

27 CC0 [deed]

28 CC0 Waiver of copyright Waiver of sui generis database rights Waiver of “neighboring rights” Does not affect trademarks or patents Only affects rights of person making assertion

29 Use Case #2 Coordination and Sustainability of International Mouse Informatics Resources (CASIMIR) (EU Project) Commentary in Letter to Nature (Sept 2009) recommends PD and use of CC0 for sharing mouse genomic data Recommendations endorsed by scientists, NIH representatives, Jackson Labs, and editors of top scientific journals

30 Use Case #3 Personal Genome Project - personalized medicine project from George Church lab Adopted CC0 to release sequence and medical data collected from volunteers

31 Summary Solving some bioinformatics problems require ability to integrate massive quantities of data from diverse sources Public Domain sharing best fits this need CC0 waiver can be used to enrich public domain and provide clarity

32 Thank You Thinh Nguyen (thinh@creativecommons.org)thinh@creativecommons.org On the Web: http://www.sciencecommons.org http://www.sciencecommons.org


Download ppt "The Web-Enabled Research Commons: Applications, Goals, and Trends Thinh Nguyen October 2009."

Similar presentations


Ads by Google