Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”

Slides:



Advertisements
Similar presentations
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Advertisements

Moving Forward With Digital Preservation at the Library of Congress Laura Campbell Associate Librarian for Strategic Initiatives Library of Congress.
Maines Sustainability Solutions Initiative (SSI) Focuses on research of the coupled dynamics of social- ecological systems (SES) and the translation of.
Spatial Data Infrastructure: Concepts and Components Geog 458: Map Sources and Errors March 6, 2006.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements Workforce Demand and Career Opportunities From.
NSF and Environmental Cyberinfrastructure Margaret Leinen Environmental Cyberinfrastructure Workshop, NCAR 2002.
CPSC 695 Future of GIS Marina L. Gavrilova. The future of GIS.
1 The Australian Partnership for Sustainable Repositories Margaret Henty Digital Futures Industry Briefing November 8, 2006.
1 IS112 – Chapter 1 Notes Computer Organization and Programming Professor Catherine Dwyer Fall 2005.
Institutional Perspective on Credit Systems for Research Data MacKenzie Smith Research Director, MIT Libraries.
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Data Management Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition.
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Tutorial 1: Getting Started with Adobe Dreamweaver CS4.
Effective User Services for High Performance Computing A White Paper by the TeraGrid Science Advisory Board May 2009.
1 Common Challenges Across Scientific Disciplines Laurence Field CERN 18 th November 2013.
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
1 Guidelines For The Future Sharing Best Practice For National Bibliographies In The Digital Era Neil Wilson Information Coordinator IFLA Bibliography.
Database System Concepts and Architecture
2 Systems Architecture, Fifth Edition Chapter Goals Describe the activities of information systems professionals Describe the technical knowledge of computer.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
SCSC 311 Information Systems: hardware and software.
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Digital Preservation: Lessons learned through national action Digital Preservation Interoperability Framework Workshop April 2010.
CSCI 5980: From GPS and Google Earth to Spatial Computing Fall 2012 Midterm Presentation Chapter 7: Architectures Team 9: Thao Nguyen, Nathan Poole October.
Planning for Arctic GIS and Geographic Information Infrastructure Sponsored by the Arctic Research Support and Logistics Program 30 October 2003 Seattle,
Providing Access to Your Data: Access Mechanisms Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Sharing Research Data Globally Alan Blatecky National Science Foundation Board on Research Data and Information.
Dr. Fran Berman, RPI Feedback from BRDI Sponsor Forum 11/11 January 29, 2012 Fran Berman.
DOE 2000, March 8, 1999 The IT 2 Initiative and NSF Stephen Elbert program director NSF/CISE/ACIR/PACI.
NanoHUB.org and HUBzero™ Platform for Reproducible Computational Experiments Michael McLennan Director and Chief Architect, Hub Technology Group and George.
National Center for Supercomputing Applications Barbara S. Minsker, Ph.D. Associate Professor National Center for Supercomputing Applications and Department.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
CyberInfrastructure for Network Analysis Importance of, contributions by network analysis Transformation of NA Support needed for NA.
Symposium on Global Scientific Data Infrastructures Panel Two: Stakeholder Communities in the DWF Ann Wolpert, Massachusetts Institute of Technology Board.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Marv Adams Chief Information Officer November 29, 2001.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
29 March 2004 Steven Worley, NSF/NCAR/SCD 1 Research Data Stewardship and Access Steven Worley, CISL/SCD Cyberinfrastructure meeting with Priscilla Nelson.
SAON Data Policy, activities, services, and deliveries Jan René Larsen SAON Board Meeting, 29 April 2013, Vancouver, Canada,
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
IT and Network Organization Ecommerce. IT and Network Organization OPTIMIZING INTERNAL COLLABORATIONS IN NETWORK ORGANIZATIONS.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
System Development & Operations NSF DataNet site visit to MIT February 8, /8/20101NSF Site Visit to MIT DataSpace DataSpace.
1 Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in.
Margret Plank 17th International Conference on Grey Literature 1st and 2nd December 2015, Amsterdam (Netherlands) Move beyond text – How TIB manages the.
Infrastructure Breakout What capacities should we build now to manage data and migrate it over the future generations of technologies, standards, formats,
Working with your archive organization: Broadening your user community Robert R. Downs, PhD Socioeconomic Data and Applications Center (SEDAC) Center for.
Working with Your Archive : Broadening Your User Community Robert R. Downs, PhD NASA Socioeconomic Data and Applications Center (SEDAC) Center for International.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Chapter 1 Computer Technology: Your Need to Know
IOT – Firefighting Example
Strategies for NIS Development
Green IT CHAPTER 3: PROGRAMMATIC AND INSTITUTIONAL OPPORTUNITIES TO ENHANCE COMPUTER SCIENCE RESEARCH FOR SUSTAINABILITY.
Summit 2017 Breakout Group 1: Advanced Research Computing (ARC)
Overview What is Multimedia? Characteristics of multimedia
DOE 2000 PI Retreat Breakout C-1
DIGITAL LIBRARY.
Bird of Feather Session
Presentation transcript:

Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks” Characteristics (give some numbers) Dimension Format (analogue and digital) – video, physical, numerical Media types (text, numeric, images, audio, video) – print, samples Attributes Historical and contemporary data Availability Timeliness Quality Metadata - Describes the data Security Wed, Oct 30, 2002

Special Properties and Aspects of Environmental Sciences Complexity – data from multiple disciplines Time series - long term monitoring, historical data, endangered data Geospatially referenced (GIS not handle time series well) Scale – vast time (seconds to 4.6 B yrs) and space (microns to kilo km) scales Marriage of observation and computation Real time data collection, distribution, use, response Societal impacts Emergent properties Education Data accessibility – physically and intellectually available to scientists and broader audiences (public, educators, decisions makers) Mostly open and free access of data

Needs Categories – “4 A’s” (define roles and professions who should be engaged in the processes) Acquisition – sensors, legacy, transmission, connectivity Archive - storage, managing, cataloguing, preservation Application – manipulation, interpretation Accessibility – discovery and outreach (make data known, marketing) Documentation Standards for metadata information Integration/compatibility with existing initiatives – e.g. NSDI Content standards process, separate from IT standards Data and cooperation from multidisciplinary sources 4-D observation, analyses, visualization Metadata needs to be linked to data p. 1

Needs Toolboxes and tutorials data mining (including semantic web, web services) data subsetting, distribution, and extraction tools 4-D search engine and filtering Longevity of data Rewards system for database contributions Distribution services for data generators and providers – libraries, value-added (data providers are also data users) People and education infrastructure Ready access to IT support No one can do it all or by themselves Training to work in teams Process for repurposing, reusability of data p. 2

Needs System Bandwidth – locally sufficient except for ‘last mile’ Ability to limit resolution to deal with ‘last mile’ problem Global connectivity - address 3 rd world limitations High performance transmission Data Interoperability/interchange Ease of use of system, hardware, software Maturation of technology system Internet connectivity all the way to the sensor – common protocols Systems engineering approach Services “Ship time” computational resources allocated automatically for funded projects Educated students; cross-trained Content Adequate quality data in digital form – computational capability may outstrip data resources

Challenges Across the board support for archiving data and populating data bases Global interconnectivity and interoperability – rapidly evolving formats and standards Reward structure – tenure track, proposals, citations, investigators, ‘meta-citations’ hits Citations for continuously evolving data sets Establishing metrics for data use There are only limited models to apply in developing the system Role of federal agencies (NSF and others) in establishing, managing, maintaining, sensor networks and observing systems (what is the “exit strategy”?, unfunded mandates) End to end plan from federal agencies in acquiring, populating, managing, maintaining, archiving, and distributing data sets (“exit strategies” and unfunded mandates) p. 1

Challenges Conversion of IT to sustainable scientific infrastructure Stewardship of the CI longterm Prioritization and criteria for preservation of legacy data Technological limitations – e.g. high speed wireless, IT tools, terascale operations (but data is coming in that fast) Security – data-, system-, national- Scalability Middleware and operating systems maintenance and support – no incentive to modify original version Building community consensus p. 2

Principles: Creation of the Cyberinfrastructure should: Be process oriented Respond to changing/evolving needs Enable innovation Think broadly, don’t limit approaches Be open sourced Advance scientific research Facilitate integration of research and education System needs to evaluate its own utility (metrics) Feedback system from users