Efficient Implementation of Complex Interventions in Large Scale Epidemic Simulations Network Dynamics & Simulation Science Laboratory Jiangzhuo Chen Joint.

Slides:



Advertisements
Similar presentations
High level QA strategy for SQL Server enforcer
Advertisements

1 Scoped and Approximate Queries in a Relational Grid Information Service Dong Lu, Peter A. Dinda, Jason A. Skicewicz Prescience Lab, Dept. of Computer.
The State of the Art in Distributed Query Processing by Donald Kossmann Presented by Chris Gianfrancesco.
An Efficient Multi-Dimensional Index for Cloud Data Management Xiangyu Zhang Jing Ai Zhongyuan Wang Jiaheng Lu Xiaofeng Meng School of Information Renmin.
Modeling and Analysis of Random Walk Search Algorithms in P2P Networks Nabhendra Bisnik, Alhussein Abouzeid ECSE, Rensselaer Polytechnic Institute.
A Java Architecture for the Internet of Things Noel Poore, Architect Pete St. Pierre, Product Manager Java Platform Group, Internet of Things September.
Objektorienteret Middleware Presentation 2: Distributed Systems – A brush up, and relations to Middleware, Heterogeneity & Transparency.
Project 4 U-Pick – A Project of Your Own Design Proposal Due: April 14 th (earlier ok) Project Due: April 25 th.
Chapter 6: Database Evolution Title: AutoAdmin “What-if” Index Analysis Utility Authors: Surajit Chaudhuri, Vivek Narasayya ACM SIGMOD 1998.
Communication-Efficient Distributed Monitoring of Thresholded Counts Ram Keralapura, UC-Davis Graham Cormode, Bell Labs Jai Ramamirtham, Bell Labs.
Unicenter Desktop & Server Management Scaling Options - SQL -Latest Revision June Read the notes pages.
Internet Quarantine: Requirements for Containing Self-Propagating Code David Moore et. al. University of California, San Diego.
Comparison of Private vs. Public Interventions for Controlling Influenza Epidemics Joint work with Chris Barrett, Jiangzhuo Chen, Stephen Eubank, Bryan.
Concepts and value of TI-Nspire™ Technology Module A.
Students: Nadia Goshmir, Yulia Koretsky Supervisor: Shai Rozenrauch Industrial Project Advanced Tool for Automatic Testing Final Presentation.
Impact Analysis of Database Schema Changes Andy Maule, Wolfgang Emmerich and David S. Rosenblum London Software Systems Dept. of Computer Science, University.
1 Web Database Processing. Web Database Applications Static Report Publishing a report is prepared from a database application and exported to HTML DB.
Ekrem Kocaguneli 11/29/2010. Introduction CLISSPE and its background Application to be Modeled Steps of the Model Assessment of Performance Interpretation.
ADLB Update Recent and Current Adventures with the Asynchronous Dynamic Load Balancing Library Rusty Lusk Mathematics and Computer Science Division Argonne.
Multiple testing correction
Managing the development and purchase of information systems (Part 1)
Case Study : Morcom Trading – P BSC 21
Zhiyong Wang In cooperation with Sisi Zlatanova
Winrunner Usage - Best Practices S.A.Christopher.
Lecture Set 14 B new Introduction to Databases - Database Processing: The Connected Model (Using DataReaders)
DBSQL 14-1 Copyright © Genetic Computer School 2009 Chapter 14 Microsoft SQL Server.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
Interaction-Based HPC Modeling of Social, Biological, and Economic Contagions Over Large Networks Network Dynamics & Simulation Science Laboratory Jiangzhuo.
Comparing Effectiveness of Top- Down and Bottom-Up Strategies in Containing Influenza Achla Marathe, Bryan Lewis, Christopher Barrett, Jiangzhuo Chen,
1 Wenguang WangRichard B. Bunt Department of Computer Science University of Saskatchewan November 14, 2000 Simulating DB2 Buffer Pool Management.
EpiFast: A Fast Algorithm for Large Scale Realistic Epidemic Simulations on Distributed Memory Systems Keith R. Bisset, Jiangzhuo Chen, Xizhou Feng, V.S.
DFTL: A flash translation layer employing demand-based selective caching of page-level address mappings A. gupta, Y. Kim, B. Urgaonkar, Penn State ASPLOS.
Parallel Execution Plans Joe Chang
Lecture Set 14 B new Introduction to Databases - Database Processing: The Connected Model (Using DataReaders)
Systems Analysis and Design in a Changing World, Fourth Edition
Scaling Agent-based Simulation of Contagion Diffusion over Dynamic Networks on Petascale Machines Keith Bisset Jae-Seung Yeom, Ashwin Aji
Network theory 101 Temporal effects What we are interested in What kind of relevant temporal /topological structures are there? Why? How does.
Ian Bird GDB; CERN, 8 th May 2013 March 6, 2013
1 Admission Control and Request Scheduling in E-Commerce Web Sites Sameh Elnikety, EPFL Erich Nahum, IBM Watson John Tracey, IBM Watson Willy Zwaenepoel,
Making Watson Fast Daniel Brown HON111. Need for Watson to be fast to play Jeopardy successfully – All computations have to be done in a few seconds –
CIS/SUSL1 Fundamentals of DBMS S.V. Priyan Head/Department of Computing & Information Systems.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Simulating Diffusion Processes on Very Large Complex networks Joint work with Keith Bisset, Xizhou Feng, Madhav Marathe, and Anil Vullikanti Jiangzhuo.
Coevolution of Epidemics, Social Networks, and Individual Behavior: A Case Study Joint work with Achla Marathe, and Madhav Marathe Jiangzhuo Chen Network.
Query Processing – Query Trees. Evaluation of SQL Conceptual order of evaluation – Cartesian product of all tables in from clause – Rows not satisfying.
WMT The CSDMS Web Modeling Tool Mark Piper Eric Hutton Irina Overeem
Optimal Interventions in Infectious Disease Epidemics: A Simulation Methodology Jiangzhuo Chen Network Dynamics & Simulation Science Laboratory INFORMS.
Comparison of Individual Behavioral Interventions and Public Mitigation Strategies for Containing Influenza Epidemic Joint work with Chris Barrett, Stephen.
Fast Parallel Algorithms for Edge-Switching to Achieve a Target Visit Rate in Heterogeneous Graphs Maleq Khan September 9, 2014 Joint work with: Hasanuzzaman.
Level 1-2 Trigger Data Base development Current status and overview Myron Campbell, Alexei Varganov, Stephen Miller University of Michigan August 17, 2000.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
Written By: Presented By: Swarup Acharya,Amr Elkhatib Phillip B. Gibbons, Viswanath Poosala, Sridhar Ramaswamy Join Synopses for Approximate Query Answering.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
A web based tool for estimation of Gage R&R and Measurement Uncertainty Siva Venkatachalam & Dr. Jay Raja Center for Precision Metrology The University.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
ADO .NET from. ADO .NET from “ADO .Net” Evolution/History of ADO.NET MICROSOFT .NET “ADO .Net” Evolution/History of ADO.NET History: Most applications.
Efficient Data Management Tools for the Heterogeneous Big Data Warehouse Autors: Aleksandr Alekseev (Programmer), Victoria Osipova (Associate professor),
INTRODUCTION TO WIRELESS SENSOR NETWORKS
Internet Quarantine: Requirements for Containing Self-Propagating Code
Business System Development
Simulation Production System
Ruslan Fomkin and Tore Risch Uppsala DataBase Laboratory
Drum: A Rhythmic Approach to Interactive Analytics on Large Data
Performance Load Testing Case Study – Agilent Technologies
Predictive Performance
Admission Control and Request Scheduling in E-Commerce Web Sites
Chapter 5 Architectural Design.
Performance And Scalability In Oracle9i And SQL Server 2000
Storing and Processing Sensor Networks Data in Public Clouds
Presentation transcript:

Efficient Implementation of Complex Interventions in Large Scale Epidemic Simulations Network Dynamics & Simulation Science Laboratory Jiangzhuo Chen Joint work with Yifei Ma, Keith Bisset, Suruchi Deodhar, and Madhav Marathe Winter Simulation Conference December 14, 2011

Network Dynamics & Simulation Science Laboratory Talk Outline Background –Interventions in large scale epidemic simulations –Indemics simulation framework Productivity Enhancement with Indemics –Efficient intervention implementation –Comparison results from real experiences Performance Modeling and Prediction –Methodology: explained by examples –Experiment results Summary

Network Dynamics & Simulation Science Laboratory Large Scale Agent-Based Epidemic Simulation Disease diffusion in a population (millions of agents) through agent-agent contacts (billions) Real world intervention policy to epidemics can be very complex and difficult to predefine/code. Many possible interventions with multiple configurable parameters: large (factorial) simulation design Ideally we would like: Fast simulation Capability to represent complicated realistic interventions Appropriate experiment design for given study with a deadline

Network Dynamics & Simulation Science Laboratory Complex Interventions Vaccinate randomly chosen people Vaccinate people with high degrees in contact network Keep all school age children home for 2 weeks Each county decides to close its schools if number of diagnosed students in the county exceeds threshold; students from closed schools stay home. Same as above, plus for each student that stays home, if age<12 then a guardian must stay home too not so realistic more realistic

Network Dynamics & Simulation Science Laboratory Indemics: System Architecture HPC Epidemic Simulator (e.g. EpiFast) Indemics Adapter Indemics Server New Interventions New epidemic dynamics Indemics Adapter Queries & Interventions Interactive Client Batch Client Indemics web-interface client on PC Analyst sees only this module Indemics database running on a data server Indemics Server, running on head node of HPC Relational database Temporal database Semi- structured database

Network Dynamics & Simulation Science Laboratory Indemics: System Architecture HPC Epidemic Simulator (e.g. EpiFast) Indemics Adapter Indemics Server New Interventions New epidemic dynamics Indemics Adapter Queries & Interventions Interactive Client Batch Client Indemics web-interface client on PC Analyst sees only this module Indemics database running on a data server Indemics Server, running on head node of HPC Relational database Temporal database Semi- structured database

Network Dynamics & Simulation Science Laboratory Indemics: System Architecture HPC Epidemic Simulator (e.g. EpiFast) Indemics Adapter Indemics Server New Interventions New epidemic dynamics Indemics Adapter Queries & Interventions Interactive Client Batch Client Indemics web-interface client on PC Analyst sees only this module Indemics database running on a data server Indemics Server, running on head node of HPC Temporal database Semi- structured database Relational database

Network Dynamics & Simulation Science Laboratory Indemics: System Architecture HPC Epidemic Simulator (e.g. EpiFast) Indemics Adapter Indemics Server New Interventions New epidemic dynamics Indemics Adapter Queries & Interventions Interactive Client Batch Client Indemics web-interface client on PC Analyst sees only this module Indemics database running on a data server Indemics Server, running on head node of HPC Relational database Temporal database Semi- structured database

Network Dynamics & Simulation Science Laboratory Epidemic Intervention Implementation diffusion code (C++) intervention code (C++) EpiFast diffusion code (C++) Intervention script Indemics EpiFast DBMS framework code (Java)

Network Dynamics & Simulation Science Laboratory Scenario 1: Benefit of Indemics EpiFast is a fast epidemic simulation tool in our lab It can represent intervention in the form: –if predefined global conditions and local conditions are satisfied for a predefined set of nodes, then change node properties and/or labels of edges incident on them –(a) antiviral prophylaxis to randomly chosen people –(b) keep all primary school students home It took too much coding effort (weeks) to implement –(a’) antiviral treatment to sick people –(b’) keep all primary school students home and let their guardians stay home too With Indemics, it took only hours to script (a’) or (b’)

Network Dynamics & Simulation Science Laboratory Performance and Productivity We have been concerned about performance of HPC simulation tools Human effort starts to be the bottleneck –Understand, implement, and verify new intervention strategies designed by epidemiologists –Set up simulations; run simulations –Post simulation analysis Indemics: improve human productivity while maintaining simulation performance –Development cost reduction

Network Dynamics & Simulation Science Laboratory Compare Different Ways to Implement Interventions Development cost Triggered intervention: when fraction of diagnosed school-age children exceeds 20% close all schools Targeted intervention: treat diagnosed school-age children with antiviral School (block) intervention: vaccinate all people in any school (census block) if over 5% in that school (block) are diagnosed

Network Dynamics & Simulation Science Laboratory Improvement in Study Total Turnaround Time

Network Dynamics & Simulation Science Laboratory Big Saving in Human Effort

Network Dynamics & Simulation Science Laboratory Reasonable Performance Overhead

Network Dynamics & Simulation Science Laboratory Scenario 2: Motivation for Performance Modeling Epidemiologist in our lab wanted to run a large experiment (factorial design) with complex interventions Simulation results needed in a week Decided to use Indemics. But could simulation finish in time? We applied performance model and predicted two weeks running time Epidemiologist revised experiment design (cut half) Simulation done in one week

Network Dynamics & Simulation Science Laboratory Example of Indemics Intervention Script School intervention: provide vaccines to all students in any school where more than 5% students are sick initialization; define School_Trigger as SCHOOL_DIAGNOSED_TOTAL.persons > 0.05 * SCHOOL_INTERVENED.size; reset table SCHOOL_INTERVENED intervened_day = NULL; for Day = 1 to 10 do count new_diagnosed each school save_to SCHOOL_DIAGNOSED_TODAY; count accum_diagnosed each school save_to SCHOOL_DIAGNOSED_TOTAL; set SCHOOL_INTERVENED intervened_day = Day if intervened_day = NULL and School_Trigger = true; apply Vaccination to school in SCHOOL_INTERVENED where intervened_day = Day; done

Network Dynamics & Simulation Science Laboratory Translated into SQL initialization; define School_Trigger as SCHOOL_DIAGNOSED_TOTAL.persons > 0.05 * SCHOOL_INTERVENED.size; reset table SCHOOL_INTERVENED intervened_day = NULL; update SCHOOL_INTERVENED set intervened_day = -1; for Day = 1 to 10 do count new_diagnosed each school save_to SCHOOL_DIAGNOSED_TODAY; insert into SCHOOL_DIAGNOSED_TODAY select school, count(pid) as persons, Day as diag_time from STUDENT, DIAGNOSED where pid = diagnosed_pid and diagnosed_time = Day; count accum_diagnosed each school save_to SCHOOL_DIAGNOSED_TOTAL; set SCHOOL_INTERVENED intervened_day = Day if intervened_day = NULL and School_Trigger = true; apply Vaccination to school in SCHOOL_INTERVENED where intervened_day = Day; select pid from STUDENT, SCHOOL_INTERVENED where STUDENT.school = SCHOOL_INTERVENED.school and intervened_day = Day; done

Network Dynamics & Simulation Science Laboratory Performance Prediction Prepare SQL atom statement performance lookup table Given Indemics intervention script (S) for a study Generate corresponding SQL query statements (Q 1, Q 2, …, Q i ) Decompose query statements into atoms Estimate configurations of the atoms (size of table and result) Look up atom running time AP(a) For each query compute query time QP(Q)=sum of AP(a) Compute script running time SP(S) = sum of query time QP(Q) Predicted running time is only a rough estimate.

Network Dynamics & Simulation Science Laboratory Atomic SQL Statements Examples of basic SQL statement in Indemics script

Network Dynamics & Simulation Science Laboratory Example of Performance Lookup Table Data collected for Oracle 10g on server with 4 quad-core 2.4GHz Xeon processors and 64GB memory

Network Dynamics & Simulation Science Laboratory School Intervention in Relational Algebra

Network Dynamics & Simulation Science Laboratory DB Query Time for School Intervention: Predicted vs. Actual

Network Dynamics & Simulation Science Laboratory Summary Indemics is a database-driven high-performance high-productivity epidemic simulation framework It enables realistic representation and efficient implementation of complex intervention strategies We provide performance modeling for predicting simulation running time before running it.