Peter Lee Head, Computer Science Department Carnegie Mellon University.

Slides:



Advertisements
Similar presentations
Microsoft Research Microsoft Research Jim Gray Distinguished Engineer Microsoft Research San Francisco SKYSERVER.
Advertisements

"Een nieuwe blik op sterrenkunde" "Panoramische camera's en het Astro-Wise systeem" Gijs Verdoes Kleijn Kapteyn Astronomical Institute, University of Groningen.
Space …. are big. Really big. You just won't believe how vastly, hugely, mindbogglingly big they are. Massive data streams Douglas Adams – Hitchhiker’s.
Current NIST Definition NIST Big data consists of advanced techniques that harness independent resources for building scalable data systems when the characteristics.
Human- Computer Interfaces HUMAN COMPUTATION.  Humans helping solve large problems  Using humans WITH computers to solve problems not solvable be either.
Luis von Ahn Carnegie Mellon University. Verification technology developed in collaboration with Carnegie Mellon University “CAPTCHA”
AN IMPROVED AUDIO Jenn Tam Computer Science Dept. Carnegie Mellon University SOAPS 2008, Pittsburgh, PA.
1 SLAC National Accelerator Laboratory Amber Boehnlein October 18, 2011.
A 100,000 Ways to Fa Al Geist Computer Science and Mathematics Division Oak Ridge National Laboratory July 9, 2002 Fast-OS Workshop Advanced Scientific.
1 Lecture-I CSIT-120 Spring 2001 Introducing the Course Syllabus Introduction to Computers Introduction to Computer Science Information, Algorithms and.
_______________ RIT Observatory Data Pipeline & Automation Project: Summer Research ______________________ Presented by: Kevin Beaulieu & Dustin Crabtree.
Telling Humans and Computers Apart (Automatically) Or How Lazy Cryptographers do AI Luis von Ahn The Aladdin Center Carnegie Mellon University.
Detecting and Exploiting Narrow Bitwidth Computations Mihai Budiu Carnegie Mellon University joint work with Seth Copen Goldstein.
Human Computation CSC4170 Web Intelligence and Social Computing Tutorial 7 Tutor: Tom Chao Zhou
Moments of Discovery Forging connections between the humanities and the sciences.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
The Mind Map of a Data Scientist Rebecca Perry and Carlota Valdivieso, Work Experience Students July 2013 What qualifies Data Science? Many things qualify.
Agile: A Time-Series CCD Photometer to Study Variables Anjum Mukadam, Russell Owen, Ed Mannery University of Washington, Seattle.
U.S. Department of the Interior U.S. Geological Survey David V. Hill, Information Dynamics, Contractor to USGS/EROS 12/08/2011 Satellite Image Processing.
ICS 499 Projects Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 12/7/20111Lipyeow Lim -- University of.
Mrs. Beth Cueni Carnegie Mellon
intelligence study and design of intelligent agentsis the intelligence of machines and the branch of computer science that aims to create it. AI textbooks.
Tennessee Technological University1 The Scientific Importance of Big Data Xia Li Tennessee Technological University.
Hopkins Storage Systems Lab, Department of Computer Science A Workload-Driven Unit of Cache Replacement for Mid-Tier Database Caching Xiaodan Wang, Tanu.
Big Data in Science (Lessons from astrophysics) Michael Drinkwater, UQ & CAASTRO 1.Preface Contributions by Jim Grey Astronomy data flow 2.Past Glories.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
The Cluster Computing Project Robert L. Tureman Paul D. Camp Community College.
Lecture on Computer Science as a Discipline. 2 Computer “Science” some people argue that computer science is not a science in the same sense that biology.
VESL-Career & life planning Career Presentation April 13, 2011 Mt.SAC.
Department of Mathematics, Statistics, and Computer Science An Experimental Laboratory Environment for Teaching Embedded Hardware Systems Dennis Brylow.
Desktop Video. Basics Desktop Video Desktop Video Frame Rate Frame Rate.
MARC: Developing Bioinformatics Programs July 2009 Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist:
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
Big Data Vs. (Traditional) HPC Gagan Agrawal Ohio State ICPP Big Data Panel (09/12/2012)
Agenda Motion Imagery Challenges Overview of our Cloud Activities -Big Data -Large Data Implementation Lessons Learned Summary.
ESR 2 / ER 2 Testing Campaign Review A. CrivellaroY. Verdie.
The KB e-Depot long-term preservation of scientific publications in practice Marcel Ras, National library of The Netherlands.
1.1 – What is Science?. What is Science? Science is … Knowledge – what we know A process – how we discover new things Driven by curiosity Asking questions.
1 Structure of Aalborg University Welcome to Aalborg University.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
CSE 102 Introduction to Computer Engineering What is Computer Engineering?
Edinburgh e-Science MSc Bob Mann Institute for Astronomy & NeSC University of Edinburgh.
What is Astronomy? An overview..
Sponsored by the U.S. Department of Defense © 2008 by Carnegie Mellon University page 1 Pittsburgh, PA The Implications of a Single Mobile Computing.
Commentary on: The Virtual Observatory G. Jogesh Babu Center for Astrostatistics
European open science cloud (EOSC) visions and impact on DARIAH roadmap Eveline Wandl-Vogt, Maarten Hoogerwerf, Jakub Szprot.
Capstone design Remote manless probe Kim Jinkwang Han Seongkyu.
1 This Changes Everything: Accelerating Scientific Discovery through High Performance Digital Infrastructure CANARIE’s Research Software.
MARC: Developing Bioinformatics Programs Alex Ropelewski PSC-NRBSC Bienvenido Vélez UPR Mayaguez Reference: How to Think Like a Computer Scientist: Learning.
Presented by SciDAC-2 Petascale Data Storage Institute Philip C. Roth Computer Science and Mathematics Future Technologies Group.
A U.S. Department of Energy laboratory managed by UChicago Argonne, LLC. Introduction APS Engineering Support Division –Beamline Controls and Data Acquisition.
Billy Vivian Dr. Oblitey COSC  What is CAPTCHA?  History  Uses  Artificial Intelligence Relationship  reCAPTCHA  Works Cited.
CSC322 OPERATING SYSTEM Mr. Dilawar Lecturer, Department of Computer Science, Jahan University Kabul, Afghanistan.
Parallel programs Inf-2202 Concurrent and Data-intensive Programming Fall 2016 Lars Ailo Bongo
Geoffrey Fox Panel Talk: February
CSCI 161: Introduction to Programming Course Introduction
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Rocky K. C. Chang September 4, 2017
Unit 1: Science, Technology and Engineering
COMPUTING BTEC LEVEL /17.
Web Programming Week 11 Old Dominion University
Mrs. Beth Cueni Carnegie Mellon
CSC Classes Required for TCC CS Degree
What is Astronomy? An overview..
Human computation, and the wisdom of crowds
What is this and how can I use it?
What is Astronomy? An overview..
Presented By Vibhute J.B. Class : M.Sc. (CS)
What is Astronomy? An overview..
Presentation transcript:

Peter Lee Head, Computer Science Department Carnegie Mellon University

The dark laboratory?

DataComputingPh.D.s Crowds

Predicted alignment for known B-helices on cross-family validation [Carbonell, et al] [Tom Mitchell, Marcel Just]

Fluid dynamics model reduction [AdrienTreuille, et al] “Universe in a box” [TizianadiMatteo, et al]

30M reCAPTCHAs are being solved per day, allowing the equivalent of 125 books to be digitized daily. Some people solve them simply to help digitize books. Over 750M different people have solved a CAPTCHA. Luis von Ahn, et al.

The deluge of data makes data-intensive computing mandatory for many future scientific endeavors Future students (and, ultimately, scientists) will find it natural for the data (and computing) to be “in the cloud” There are many big technology problems to be solved But there are also organizational and cultural gaps

A wide field-of-view telescope with 3.2Gpx camera, to be operational in 2014 Will provide full-sky survey (15sec exposure) every three nights In support of basic astronomy research, as well as change detection (eg, for asteroids) On the order of 16TB of image data per day A major focus on the hardware and pipes

In the 2014 time frame, such data storage requirements may not be extraordinary But deriving knowledge from such data streams efficiently will be hard Astronomers don’t yet know what questions they will want to ask It’s all about the software, and in this case the software is research

Virtually all major research instruments of the future will be software-driven, but with big lead times and unknowns Funding programs need to address and tolerate this Who gets to read? Who gets to write? Who gets to write code? What are the ownership, allocation, and trust models? Are there reward structures for researchers to develop the software architectures, APIs, languages, and algorithms?

Microsoft Research Faculty Summit 2008