Presentation is loading. Please wait.

Presentation is loading. Please wait.

RDA Plenary 5 Big Data (Analytics) IG Session

Similar presentations


Presentation on theme: "RDA Plenary 5 Big Data (Analytics) IG Session"— Presentation transcript:

1 RDA Plenary 5 Big Data (Analytics) IG Session
10 March 2015 Paradise Point San Diego, California

2 Next Joint Session Canceled
The planned breakout joint session with Reproducibility IG has been canceled: 10 March, 16:00-17:30 Sunset Ballroom Salon V Responsible co-chairs from both IGs are unable to attend this Plenary. 3/10/15 RDA P5, San Diego, California

3 Agenda (Final Version, Promise!)
Time Presenter/Moderator Presentation/Discussion 14:00-14:05 Kuo Session Introduction (Actually we are waiting for people to come in…) 14:05-14:25 Markus Götz Smart Data Analytics – 3 use cases in different domains 14:25-14:45 Line Pouchard Issues in Big Data Curation 14:45-15:05 Peter Baumann Use Case Advances Report: N­D Arrays, Spatiotemporal Earth Data 14:45-15:00 The Case for IG Name Change – Big Data Analytics IG 15:00-15:30 Members and Participants BD IG Outcome and Deliverable Discussion WG Creation and Coordination Discussion  3/10/15 RDA P5, San Diego, California

4 RDA P5, San Diego, California
Presentations Start 3/10/15 RDA P5, San Diego, California

5 Next Joint Session Canceled
The planned breakout joint session with Reproducibility IG has been canceled: 10 March, 16:00-17:30 Sunset Ballroom Salon V Responsible co-chairs from both IGs are unable to attend this Plenary. 3/10/15 RDA P5, San Diego, California

6 Draft NBD-PWG Reference Architecture
3/10/15 RDA P5, San Diego, California

7 RDA P5, San Diego, California
Mission The ultimate goals of RDA Big Data Interest Group is to produce a set of recommendation documents to advise diverse research communities with respect to: How to select an appropriate Big Data solution for a particular science application with optimal value. Important: Need to connect with various science/research domains! What are the best practices in dealing with various data and computing issues associated with such a solution. 3/10/15 RDA P5, San Diego, California

8 RDA P5, San Diego, California
Objectives Clarifying, and sometimes defining, terminologies related to Big Data, leveraging: ISO/IEC JTC 1 Terms and Definitions, NIST Big Data PWG (NBD-PWG) Definitions, and Taxonomies documents, and RDA Terminologies WG Characterizing leading Big Data technologies. Important: Need to collaborate with relevant RDA IGs and initiate Working Groups. Example characteristics include: performance, resource utilization, scalability, usability, flexibility, extensibility, propensity in supporting scientific collaborations, etc. Collaborating with external entities through IG member involvements, including: ISO, NIST, INCITS, OGC, NBD-PWG, EarthCube, EarthServer, etc. Producing a set of recommendation documents based on results obtained from activities in attaining above objectives, including: A systematic classification of algorithms pertinent to the characterization of Big Data technologies, Characterizations of Big Data technologies investigated, especially their value characteristics in each category of use cases, Frequency of each class of algorithms and/or queries used by workflows in various use cases, delineated by science domains/subdomains, and Feasible combinations of analysis algorithms, analytical tools, data and resource characteristics and scientific queries. 3/10/15 RDA P5, San Diego, California

9 RDA P5, San Diego, California
Participation Domain scientists wishing to utilize Big Data solutions for their research and/or applications, Data specialists with experience in data production, curation, analysis, and management, especially involving large volumes and varieties of data, Computational scientists or software engineers with special interests in data analysis techniques and algorithm analysis, especially pertaining to BigData relevant technologies and tools, Experts, or aspiring experts, of various Big Data technologies and tools, Computational infrastructure and architecture experts in fields such as distributed computing, high-performance computing, and database systems, Data scientists with a blended interest involving some subsets the activities mentioned above, in particular with share, use and reuse of open scientific datasets, and Managers involved in any combination of the activities mentioned above. 3/10/15 RDA P5, San Diego, California

10 Interaction Mechanism
Monthly teleconference to with planned agenda to discuss specific issues. Proposing 10 AM US Eastern Time (4 PM Central European Time) every 1st or 2nd Thursday of each month. We will use GoToMeeting instead of the default RDA means for teleconferencing. Agenda should be available 1 week before meeting. Meeting minutes should be available within 1 week after meeting. Asynchronous collaboration using RDA Wiki, Google Docs, and lists. Semiannual RDA Plenary meetings to hold sessions for progress reports and face-to-face interactions amongst interested parties. 3/10/15 RDA P5, San Diego, California

11 RDA P5, San Diego, California
Schedule Year Qr. Task 2015 Q1 Revise BDA IG (original group name) charter to suit the broadened scope of proposed IG name change to “Big Data Interest Group”. Prepare RDA 5th Plenary. Q2 Start the planning and organization of studies into the characterization of various popular Big Data technologies Evaluation of potential WG spinoffs based on characterization work. Q3 Progress reports on characterization studies. Prepare RDA 6th Plenary. Created Spinoffs WGs on detailed big data studies Q4 2016 Produce a report on characterization studies. Prepare RDA 7th Plenary. Initial results of Spinoff WGs and their findings 3/10/15 RDA P5, San Diego, California


Download ppt "RDA Plenary 5 Big Data (Analytics) IG Session"

Similar presentations


Ads by Google