Licence to Share Research and Collaboration through Go-Geo! and ShareGeo Guy McGarva, Geoservices Support, Project Manager for ShareGeo Nicola Osborne,

1 Licence to Share Research and Collaboration through Go-Geo! and ShareGeo Guy McGarva, Geoservices Support, Project Manager for ShareGeo Nicola Osborne, Social Media Officer for EDINA David Medyckyj-Scott, Research & Geo-data Services Team Manager UK e-Science All Hands Meeting 2009: Sharing & Collaboration Wednesday 9 th December 2009

2 Outline of this talk Introduction to ShareGeo & Go-Geo! The ShareGeo and Go-Geo! Community. The benefits and challenges of sharing geospatial data. Our experiences of enabling sharing of geospatial data sets (so far). Challenges and opportunities for the future.

3 Introduction to ShareGeo & Go-Geo!

4 What is the community ? 40,000 users in 160 institutions. Scientists, researchers and students who use geospatial data. Those with data to share. Individuals looking for existing data and relevant resources. Those seeking visibility or reputation gains through building an active Depositor Profile. Engagement via the Digimap Blog, emails, RSS feeds, website updates, newsletters and Go-Geo! Twitter stream. Large natural overlap between the ShareGeo and Go-Geo! user communities. Digimap Users

5 Why Share Geospatial data? Significant collections of geospatial data have already been created. High cost (time and money) associated with collecting data. Existing data can form useful components of new data sets. Research and so use of data can be over long time periods Increase visibility and/or create a record of data. Benefit from and work with derived licensed data.

6 Examples of types of Geospatial data in ShareGeo Grids Boundaries DTM Derived OS data GPS Land - use Imagery

7 Challenges of sharing Geospatial Data Sharing Licensed Data, particularly derived data, requires compliance with often complex licenses. Increased sharing of data is useful only if data has integrity and is of consistent quality 1. Commercial arrangements and third party data are subject to additional restrictions. There are technical issues such as format. Any shared data will be subject to a distributed trust network – you must trust any potential downloader not to expose or misuse data.

8 Derived Data & Licenses

9 Publish Metadata in Go-Geo! using GeoDoc PrivateInstitutional Node PublicResearch Cluster Node (planned) Create Metadata in GeoDoc Publish metadata to a Institution Node Publish metadata as Public Publish metadata as Cluster Node No one will be able to discover your metadata Only members of your institution will be able to discover your metadata Anyone searching Go-Geo! or the www will be able to discover your metadata Anyone registerd as part of your cluster will be able to discover your metadata You can export metadata to xml to save locally

10 ShareGeo & Go-Geo! ShareGeo is intended for when: –You are willing or able to share your data. –You want to reuse existing datasets. –Within a known community Go-Geo! provides an alternative sharing mechanism for those that need to: –Publicise the existence of data. –Share metadata about ongoing work where the collected data may still be changing. –Cannot trace all licenses for – and therefore cannot share - complex data combinations (Grey data). –Share metadata publicly OR with peers.

11 Our contribution experiences Usage of both services is steadily growing… But most of our users are consumers not creators: –Around 0.5% of ShareGeo users upload data, but 22% of ShareGeo users download data. –1.5% of Go-Geo! Users create metadata records; 18% of all Go-Geo! visitors access these.

12 Our contribution experiences compared to others 1:9:90 2 is the often quoted rule for online participation Under 0.0001% of Firefox users contribute to development or testing 3, 4. 0.02% of Wikipedia users are (active) editors/contributors 5 7% of OpenStreetMap users make some type of edit each month 6 10% of Twitter users author 90% of all Tweets 7 24% of the British Public have voted for a reality TV show 8 49% of active UK Internet users have a profile on a Social Networking Site 9 62% of the registered UK voters voted in the 2005 General Election 10

13 Our Experiences to Date Far more downloads than uploads Comments back from ShareGeo users include: I am not sure if my data would be of interest to others I have often considered adding data to ShareGeo, given how often individuals must reproduce the work of obtaining and pre-processing datasets; however the license agreements for each datasets prohibit such action. If I had any data to share I would definitely use ShareGeo.

14 Sharing Behaviours & Cultures Academic culture –Funding competitive. –Access can be very restricted (especially pre-publication). –Commercial restrictions may apply. –Collaboration is rarely directly rewarded. –It is hard to trust the reliability of others data –As a contributor there are concerns that: Data could be misused (maliciously or not, Data is often viewed through a tribal prism 11, 12 ). You might somehow be liable for what others do with/derive from your data. You could receive time-consuming questions about your data. Licensing culture –Perceived as complex and litigious. –Can be intimidating even if data is licensed for sharing. Personal vs. community benefit –Greatest benefit to the community paradoxically when a contributor is exiting it (e.g. graduating students). –Selfless attitude & strong sense of community rare.

15 Challenges for the Future Raise awareness and increase impact of ShareGeo and Go- Geo! Increase the number of both passive users and proactive creators. Define and publicize benefits to depositors particularly around: –Community benefits (continuity, reuse & saved costs). –Personal reputation benefits (e.g. citation). Engage in the Making Public Data Public initiative

16 Opportunities Making Public Data Public initiative 13 : Prime Minister announced, in November 2009 14, that Ordnance Survey is going to make some data available for free: –Electoral and local authority boundaries –Postcode areas –Mid-scale mapping Data from other agencies including crime, transport, health, education to be included. Should reduce barriers to sharing data.

17 Short Term Technical Improvements Integrate with standard desktop apps for one-touch submission e.g. using SWORD. Visualize data with plug-in applications. Option to expose metadata either to search engines (Google) directly or via Go-Geo. Provide data in alternative formats, including web-services. Add more social features such as annotations, tagging and ratings. Gicentre, City University London - demo by Jo Wood:

18 Long Term Policy Improvements Source more open data (especially as more types of data become open). Create open access ShareGeo for unlicensed and/or less restrictively licensed materials. Measure - and display – the impact (re/use) of data more effectively. Improve visibility of data reuse and of the impact of ShareGeo (e.g. through citations). Seamless interoperability – around policy, licensing, access levels etc. - with Go-Geo metadata portal.

19 Thank You If you have any Questions we would be very happy to answer them. Or email us: Or if you have any general comments about ShareGeo or Go-Geo! Email: Links ShareGeo: Go-Geo!: EDINA:

