Presentation is loading. Please wait.

Presentation is loading. Please wait.

Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation.

Similar presentations


Presentation on theme: "Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation."— Presentation transcript:

1 Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation

2 CMSC 838T – Presentation Overview u Overview of talk  Motivation  Challenge  Methods l Ensemble Dynamics l Folding@Home  Evaluation  Observations

3 CMSC 838T – Presentation Motivation u Atomistic simulation of protein folding  understand dynamics of folding  real-time folding in full atomic detail  large-scale parallelization methods u Benefits  protein folding & disease l protein self-assemble to function l proteins misfold  diseases  nanotechnology l nanomachines l self-assemble on the nanoscale

4 CMSC 838T – Presentation Challenge u Difficulties  limited by current computational techniques l fastest folding in microseconds l one CPU: 1ns/day, 30 years l 10,000 fold computational gap u 1,000 CPUs, 1 microsecond / day  traditional parallelization scheme l hard to scale to a large amount of processors l extremely fast communication l complexity of coordination l expensive supercomputers u cost u time-sharing

5 CMSC 838T – Presentation Method u ensemble dynamics  a new simulation algorithm  parallel simulation u Folding@Home  heterogeneous network, Internet  large-scale distributed platform

6 CMSC 838T – Presentation Simulation of Dynamics u free energy barrier  progress from one state to another: transition  thermal fluctuations to push system over free energy barrier u previous approaches: sampling  maybe stuck in meta-stable free energy minima  expensive computational cost of sampling

7 CMSC 838T – Presentation Ensemble Dynamics u application scenario  waiting time of transitions dominates total time  protein folding l transition: free energy barrier crossing  coupled simulations: transition coupling u Algorithm  M independent simulations from a initial condition  first simulation to cross free energy barrier l M times less to cross barrier than average time  restart M simulations with the new location after transition u Near linear speed up in #processors  exponential kinetics: f(t) = 1 – exp(-k t)  If k * t is small, f(t) = k * t  M simulations  M * f(t) = M * k * t folding events

8 CMSC 838T – Presentation Limitations u barrier crossing probability  exponential assumptions u correct transition detection  transition: free energy barrier crossing  a large variance in energy: threshold  correct detection is not guaranteed u multiple possible transition  not addressed  selection of the first transition

9 CMSC 838T – Presentation Distributed Computing u Distributed simulations  M processors for each run  simulate folding in atomic detail on each processor  restart once a crossing barrier event occurs u Implementation: Folding@Home  worldwide distributed computing: Internet  started in October 2000 l more than 200,000 participants l 10,000 CPU-years in the first 12 months

10 CMSC 838T – Presentation Folding@Home

11 CMSC 838T – Presentation Folding@Home u client-server architecture  server assign jobs(work unit) to client  client sends back results after computation  ~100K data transfer between client and server u why is ensemble dynamics good for Folding@Home?  CPU intensive job: a few hours, often days  connection speed: modem, good enough  suitable for Folding@Home

12 CMSC 838T – Presentation Other@Home Work u SETI@Home  search for intelligent life outside Earth  data analysis of signals u FightAids@Home  find drug therapy for HIV  how drugs interact with various HIV virus mutations u distributed projects  Divide-and-Conquer  CPU intensive jobs  small pieces of data(kilobytes) transfer  communication not a major concern

13 CMSC 838T – Presentation Evaluation u Folding@Home  based on Tinker molecular dynamics code  voluntary participants worldwide, over 400,000 CPUs u simulate folding and unfolding  folding rates  simulations on small proteins

14 CMSC 838T – Presentation Folding Rates

15 CMSC 838T – Presentation Folding & Unfolding

16 CMSC 838T – Presentation Observations u Sampling  too expensive to run for a long timescales  waste too much time lingering in local energy minima u Ensemble dynamics  speed up simulations of dynamics  biological meaning of simulations results?  results on large protein folding?  limitations: correct transition detection, transition probability u Folding@Home  cheap way to achieve super computation power  huge distributed computing platform: over 400,000 CPUs  an efficient approach for CPU intensive job u Complexity of problems and size of data increase rapidly  find better algorithm is preferable to buying supercomputers


Download ppt "Atomistic Protein Folding Simulations on the Submillisecond Timescale Using Worldwide Distributed Computing Qing Lu CMSC 838 Presentation."

Similar presentations


Ads by Google