David P. Anderson Space Sciences Laboratory University of California – Berkeley Volunteer Computing.

Slides:



Advertisements
Similar presentations
BOINC: A System for Public-Resource Computing and Storage David P. Anderson University of California, Berkeley.
Advertisements

BOINC Berkeley Open Infrastructure for Network Computing An open-source middleware system for volunteer and grid computing (much of the images and text.
BOINC The Year in Review David P. Anderson Space Sciences Laboratory U.C. Berkeley 22 Oct 2009.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley May 2, 2007.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Distributed Computer Architecture Benjamin Jordan, Kevin Cone, Jason Bradley.
Scientific Computing on Smartphones David P. Anderson Space Sciences Lab University of California, Berkeley April 17, 2014.
Volunteer Computing and Hubs David P. Anderson Space Sciences Lab University of California, Berkeley HUBbub September 26, 2013.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
1 port BOSS on Wenjing Wu (IHEP-CC)
Achievements and Opportunities in Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley 18 April 2008.
A Guided Tour of BOINC David P. Anderson Space Sciences Lab University of California, Berkeley TACC November 8, 2013.
HTCondor and BOINC. › Berkeley Open Infrastructure for Network Computing › Grew out of began in 2002 › Middleware system for volunteer computing.
A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate.
درس تجارت الکترونیک دوره کارشناسی ارشد مجازی Volunteer Computing.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley May 7, 2008.
Volunteer Computing with BOINC David P. Anderson Space Sciences Laboratory University of California, Berkeley.
Scientific Computing in the Consumer Digital Infrastructure David P. Anderson Space Sciences Lab University of California, Berkeley The Austin Forum November.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Introduction to the BOINC software David P. Anderson Space Sciences Laboratory University of California, Berkeley.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
07:44:46Service Oriented Cyberinfrastructure Lab, Introduction to BOINC By: Andrew J Younge
Lessons Learned from David P. Anderson Director, Spaces Sciences Laboratory U.C. Berkeley April 2, 2002.
BOINC.
David P. Anderson Space Sciences Lab U.C. Berkeley Exa-Scale Volunteer Computing.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Volunteer Computing with GPUs David P. Anderson Space Sciences Laboratory U.C. Berkeley.
and Citizen Cyber-Science David P. Anderson Space Sciences Laboratory U.C. Berkeley.
BOINC: Progress and Plans David P. Anderson Space Sciences Lab University of California, Berkeley BOINC:FAST August 2013.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Designing Middleware for Volunteer Computing.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public and Grid Computing.
TEMPLATE DESIGN © BOINC: Middleware for Volunteer Computing David P. Anderson Space Sciences Laboratory University of.
Dr Jukka Klem CHEP06 1 Public Resource Computing at CERN – Philippe Defert, Markku Degerholm, Francois Grey, Jukka Klem, Juan Antonio.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
BOINC: An Open Platform for Public-Resource Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Celebrating Diversity in Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley Sept. 1, 2008.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
Exa-Scale Volunteer Computing David P. Anderson Space Sciences Laboratory U.C. Berkeley.
Volunteer Computing Involving the World in Science David P. Anderson Space Sciences Lab U.C. Berkeley 13 December 2007.
Volunteer Computing David P. Anderson Space Sciences Lab U.C. Berkeley 14 Sept 2007.
Volunteer Computing and BOINC Dr. David P. Anderson University of California, Berkeley Dec 3, 2010.
The Future of Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab UH CS Dept. March 22, 2007.
Emulating Volunteer Computing Scheduling Policies Dr. David P. Anderson University of California, Berkeley May 20, 2011.
David P. Anderson Space Sciences Laboratory University of California – Berkeley A Million Years of Computing.
Volunteer Computing: Involving the World in Science David P. Anderson U.C. Berkeley Space Sciences Lab February 16, 2007.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Supercomputing with Personal Computers.
The Limits of Volunteer Computing Dr. David P. Anderson University of California, Berkeley March 20, 2011.
Volunteer Computing Involving the World in Science David P. Anderson Space Sciences Lab U.C. Berkeley 13 December 2007.
Volunteer Computing and Large-Scale Simulation David P. Anderson U.C. Berkeley Space Sciences Lab February 3, 2007.
Using volunteered resources for data-intensive computing and storage David Anderson Space Sciences Lab UC Berkeley 10 April 2012.
Technology for Citizen Cyberscience Dr. David P. Anderson University of California, Berkeley May 2011.
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab Nov. 15, 2006.
Volunteer Computing with BOINC: a Tutorial David P. Anderson Space Sciences Laboratory University of California – Berkeley May 16, 2006.
Volunteer Computing David P. Anderson U.C. Berkeley Space Sciences Lab January 30, 2007.
An Overview of Volunteer Computing
A Brief History of BOINC
Volunteer Computing and BOINC
University of California, Berkeley
Building a Global Brain David P. Anderson U. C
Volunteer computing PC owners donate idle cycles to science projects
Volunteer Computing: Planting the Flag David P
Volunteer Computing: SETI and Beyond David P
Volunteer Computing for Science Gateways
Designing a Runtime System for Volunteer Computing David P
David P. Anderson Space Sciences Lab UC Berkeley LASER
The Global Status of Citizen Cyberscience
Grid Computing Colton Lewis.
The software infrastructure of II
Presentation transcript:

David P. Anderson Space Sciences Laboratory University of California – Berkeley Volunteer Computing

Outline ● Volunteer computing ● BOINC: an OS for volunteer computing ● Applications ● Challenges and research directions

Where's the power? ● 2010: 1 billion Internet-connected PCs, 55% privately owned ● If 100M people participate: – 100 PetaFLOPs, 1 Exabyte (10^18) storage ● Consumer products drive technology – GPUs (NVIDIA, Sony Cell) your computers academic business home PCs

Volunteer computing history GIMPS, distributed.net climateprediction.net volunteer computing public [resource] computing Internet computing screensaver computing global computing peer-to-peer computing Grid computing BOINC

Volunteer/Grid differences

Save money! So volunteer computing is cheaper if X > 1/20 X = 1,000) cluster/Gridvolunteer computing:$1 per CPU/dayfree network:free$1 per 20 GB cost per GB:$X$1/20 Suppose processing 1 GB of data takes X computer days

Educational discount Internet2 (free, underutilized) UCB commodity Internet ($$) UCLA UIUC partner institutions Underutilized flat-rate ISP connections... so bandwidth may be effectively free also

Infrastructure software ● Roll your own ● XtremWeb, cosm – not complete/robust ● United Devices, Entropia – not free ● Grid (Globus/Condor), jxta – solve a different problem ● BOINC (Berkeley Open Infrastructure for Network Computing) –

Projects and participants SETIphysics Climate biomedical Joe Alice Jens diversity, autonomy heterogeneity allocation, trust

Encourage participation in >1 project ● Better long-term resource utilization – project A works while project B thinks ● Better short-term resource utilization – communicate/compute in parallel – match applications to resources project computing needs think work think work time

Creating a BOINC project ● Install BOINC server software on Unix box ● Adapt or develop application – compile for various platforms ● Write scripts/programs to: – generate tasks – validate results – handle results ● Develop web site ● Get media coverage

Structure of a BOINC project Scheduling server (C++) BOINC DB (MySQL) Work generation data server (HTTP) Web interfaces (PHP) Retry generation Result validation Result processing Garbage collection Ongoing tasks: - monitor server correctness - monitor server performance - develop and maintain applications

Redundant computing ● Addresses hardware errors, hackers ● Issue 2 or more copies of each task – don't send to same host or user – timed retry up to a limit ● Result comparison approaches – Application-specific “fuzzy comparison” – Homogeneous redundancy ● send copies only to numerically equivalent hosts – Develop platform-independent app

What do participants want? ● Incentives – contribute to science – get acknowledgement – community – screensaver graphics ● Invisibility, control of resource usage ● Involvement – translation, porting etc.

Credit accounting ● Credit is granted for – computation (CPU time x benchmark) – storage – network communication ● Cheat-resistance ● Accounting – user, host, team ● Credit DB export for 3rd-party web sites – cross-project identification

Participating ● Select project(s) ● Create account(s) ● Download/install BOINC client software ● Interact via web: – preferences – leaderboards – profile – teams – message boards, dynamic FAQ

Anonymous platform mechanism ● Participant compiles software from source ● Scheduler RPC: platform is “anonymous” ● Purposes: – support obscure platforms – security-conscious participants – performance tuning of applications

Client structure App Core client screensaver BOINC Manager servers

Applications ● Computation model – Workunits, results – Deadlines, resource estimates ● Data model – files, file references ● Mostly existing apps (FORTRAN, C) ● Categories – Physical simulation – Data processing – Distribution for its own sake

● Analysis of radio telescope data from Arecibo – SETI: search for narrowband signals – Astropulse: search for short broadband signals ● 0.3 MB in, ~4 CPU hours, 10 KB out ● Enhancements under BOINC: – data archival on clients – direct data distribution from observatory

Climateprediction.net ● Climate change study (Oxford University) – Met Office model (FORTRAN, 1M lines) ● Input: ~10MB executable, 1MB data ● Output per workunit: – 10 MB summary (always upload) – 1 GB detail file (archive on client, may upload) ● CPU time: 2-3 months (can't migrate) – trickle messages – preemptive scheduling

Biology projects ● Protein folding – (Scripps Institute) – (Stanford) ● Virtual drug discovery – ● Gene sequence analysis – NTT projects – Lattice (U. Maryland)

● Gravitational wave detection; LIGO ● UW Milwaukee/CalTech/Max Planck Inst. ● 30, MB data sets ● Each data set is analyzed w/ 40,000 different parameter sets; each takes ~6 hrs CPU ● Locality scheduling – minimize data transfer, client disk usage – minimize credit-granting delay

CERN projects ● – accelerator simulation (Sixtrack) ● – collision data analysis

Others ● UCB Internet measurement – Map/measure the Internet and home PCs ● BURP (big ugly rendering project) – ray-tracing ● PlanetQuest – image analysis for planetary transit detection

Challenges and questions ● Get 100 million participants – simplified account management ● Get more projects ● Distributed file system support ● Use peer-to-peer communication – BitTorrent integration ● Use GPUs and other resources ● Integrate with Grid (Lattice, CERN)

Volunteer computing ● A new high-performance computing paradigm ● Benefits to projects: – enables otherwise infeasible computational research – economic advantage even for small projects ● Benefits to participants: – increase public scientific knowledge/interest – catalyze virtual communities – democratize resource allocation