Presentation is loading. Please wait.

Presentation is loading. Please wait.

Recent Enhancements to Quality Assurance and Case Management within the Emissions Modeling Framework Alison Eyth, R. Partheepan, Q. He Carolina Environmental.

Similar presentations


Presentation on theme: "Recent Enhancements to Quality Assurance and Case Management within the Emissions Modeling Framework Alison Eyth, R. Partheepan, Q. He Carolina Environmental."— Presentation transcript:

1 Recent Enhancements to Quality Assurance and Case Management within the Emissions Modeling Framework Alison Eyth, R. Partheepan, Q. He Carolina Environmental Program University of North Carolina at Chapel Hill Marc Houyoux Emissions Inventory and Analysis Group U.S. EPA OAQPS

2 Carolina Environmental Program OAQPS EMF Goals Improve timeliness and quality of data used in emissions modeling Improve timeliness and quality of data used in emissions modeling Provide transparency and tracking of Provide transparency and tracking of Data (by using versions and metadata) Data (by using versions and metadata) Quality assurance steps on the data Quality assurance steps on the data Usage of data for emissions modeling applications Usage of data for emissions modeling applications Create tools that can be used by EPA and others Create tools that can be used by EPA and others Support criteria and toxics modeling Support criteria and toxics modeling

3 Carolina Environmental Program EMF Components Data Management with Versioning Data Management with Versioning Quality Assurance Quality Assurance Tracking, automating of QA procedures Tracking, automating of QA procedures Case Management Case Management Running SMOKE and other programs Running SMOKE and other programs Control Strategy Development Control Strategy Development Problem Tracking System Problem Tracking System Surrogate and Speciation Tools Surrogate and Speciation Tools

4 Carolina Environmental Program EMF Project Timeline October, 2004: Design process began October, 2004: Design process began June, 2005: Implementation began June, 2005: Implementation began Client-server Java-based system Client-server Java-based system Spring 2006: Data Management and Quality Assurance Tracking deployed Spring 2006: Data Management and Quality Assurance Tracking deployed September, 2006 version included: September, 2006 version included: Running SQL Quality Assurance Steps Running SQL Quality Assurance Steps First version of Case Management First version of Case Management First version of Strategy Development First version of Strategy Development

5 Carolina Environmental Program EMF Architecture at EPA Data Management Case Management Quality Assurance Strategy Devel. 4 CPU Application & Database Server Clients Compute Cluster SMOKE input files Shared Disk imports & exports data starts and tracks runs

6 Carolina Environmental Program Case Management A Case stores information about SMOKE [and other model] runs A Case stores information about SMOKE [and other model] runs Summary attributes (i.e., metadata) Summary attributes (i.e., metadata) Inputs to programs Inputs to programs Program/Model Parameters Program/Model Parameters Programs to run Programs to run Outputs from programs Outputs from programs History and results of the runs History and results of the runs Should have all information needed to run SMOKE programs and track results Should have all information needed to run SMOKE programs and track results

7 Carolina Environmental Program

8 Case Manager Cases are created, edited, copied, and removed from the Case Manager Cases are created, edited, copied, and removed from the Case Manager Summary attributes assist with selection Summary attributes assist with selection

9 Carolina Environmental Program

10 Case Editor – Parameters Tab FY07 Parameters table: Parameter Name, Sector, Program, Envt. Var., Type, Required? FY07 Parameters table: Parameter Name, Sector, Program, Envt. Var., Type, Required?

11 Carolina Environmental Program Case Editor – Inputs Tab Specifies Input Datasets and Versions

12 Carolina Environmental Program

13 Summary of FY06 Features Case Manager can create, edit, copy, and delete Cases Case Manager can create, edit, copy, and delete Cases Summary information (metadata) can be specified for a Case Summary information (metadata) can be specified for a Case Inputs to a Case can be specified, including choosing specific versions of Datasets to use in the Case Inputs to a Case can be specified, including choosing specific versions of Datasets to use in the Case Specified versions of input Datasets can be exported for use by SMOKE Specified versions of input Datasets can be exported for use by SMOKE

14 Carolina Environmental Program Planned FY07 Case Management Enhancements Finish the Parameters, Programs, Outputs, and History tabs of the Case Editor Finish the Parameters, Programs, Outputs, and History tabs of the Case Editor Support writing scripts to run SMOKE programs on compute server Support writing scripts to run SMOKE programs on compute server Manage runs of SMOKE on compute server Manage runs of SMOKE on compute server Add problem tracking for Cases Add problem tracking for Cases Implement user/group/world permissions Implement user/group/world permissions Register outputs of Cases as Datasets Register outputs of Cases as Datasets

15 Carolina Environmental Program Goals for Quality Assurance in EMF Support QA of Datasets prior to their use in SMOKE Support QA of Datasets prior to their use in SMOKE Integrate with EMF data management Integrate with EMF data management Specify a list of QA Steps to be performed on each dataset type (i.e., type of data) Specify a list of QA Steps to be performed on each dataset type (i.e., type of data) Track the QA steps and their results for multiple versions of Datasets Track the QA steps and their results for multiple versions of Datasets Track information about the progress of the steps: status, who, when, etc. Track information about the progress of the steps: status, who, when, etc. Automate (speed up) the QA process Automate (speed up) the QA process

16 Carolina Environmental Program Tracking QA Steps in EMF First, set up “QA Step Templates” for EMF Dataset Types First, set up “QA Step Templates” for EMF Dataset Types Create “QA Steps” using the Templates by copying into the Dataset properties Create “QA Steps” using the Templates by copying into the Dataset properties Add any ad-hoc QA Steps (not from templates) to Dataset properties Add any ad-hoc QA Steps (not from templates) to Dataset properties Record results of the steps for each version of a Dataset Record results of the steps for each version of a Dataset

17 Carolina Environmental Program Dataset Type Manager

18 Carolina Environmental Program

19 Setup a QA Step Template Enter a SQL query; $TABLE[#] syntax allows query to be generic across multiple datasets Enter a SQL query; $TABLE[#] syntax allows query to be generic across multiple datasets

20 Carolina Environmental Program QA Step Templates vs. QA Steps ORL Point Inventory List of Point Specific QA Step Templates Dataset TypeDataset NC 2002 NEI Point Inv. List of QA Steps with result, who, when, comment Copy templates to QA Steps Get list of templates has a

21 Carolina Environmental Program Summary of QA Steps for all Versions of a Dataset Add from Template adds steps from the dataset type; Add Custom is for ad-hoc steps Add from Template adds steps from the dataset type; Add Custom is for ad-hoc steps

22 Carolina Environmental Program

23 Summary of FY06 QA Features Can define required and optional steps for each type of dataset (codifies the QA process) Can define required and optional steps for each type of dataset (codifies the QA process) For a particular dataset, steps can be quickly copied from templates, or custom steps can be added For a particular dataset, steps can be quickly copied from templates, or custom steps can be added Tracking is performed for each step: who did it, when, status, comment Tracking is performed for each step: who did it, when, status, comment SQL steps can be run and results exported SQL steps can be run and results exported

24 Carolina Environmental Program FY07 Quality Assurance Plans View results of steps View results of steps Analyze results of steps using Analysis Engine (e.g., create plots) Analyze results of steps using Analysis Engine (e.g., create plots) Enhance the SQL syntax to support referencing other steps and Datasets Enhance the SQL syntax to support referencing other steps and Datasets Support comparison of the results of two similar QA steps (e.g., compare old and new totals) Support comparison of the results of two similar QA steps (e.g., compare old and new totals) Support running more types of steps Support running more types of steps

25 Carolina Environmental Program EMF Software Requirements Java 1.4 or 1.5 Java 1.4 or 1.5 PostgreSQL 8.1 PostgreSQL 8.1 Apache Tomcat Apache Tomcat Tested on Linux and Windows Tested on Linux and Windows Should run on other operating systems that support Java and other required software Should run on other operating systems that support Java and other required software Deployment configuration is flexible: runs on a single computer or several Deployment configuration is flexible: runs on a single computer or several

26 Carolina Environmental Program Availability of EMF Public EMF release is not yet funded, but direct arrangements can be made Public EMF release is not yet funded, but direct arrangements can be made Source code can be downloaded from SourceForge Source code can be downloaded from SourceForge May hold training class at 2007 Emissions Inventory Conference May hold training class at 2007 Emissions Inventory Conference

27 Carolina Environmental Program

28 Case Editor– Programs Tab (FY07) Shows the: Sector, Program Name, Program Version, Arguments, Run Order, Whether to Run?, Run Status, and Path Shows the: Sector, Program Name, Program Version, Arguments, Run Order, Whether to Run?, Run Status, and Path

29 Carolina Environmental Program Case Editor – Outputs Tab (FY07) Shows Output Name, Sector, Program, Dataset Name, Dataset Type, Environment Variable, whether it is Required or Available, and if it should be Registered in the EMF as a Dataset Shows Output Name, Sector, Program, Dataset Name, Dataset Type, Environment Variable, whether it is Required or Available, and if it should be Registered in the EMF as a Dataset


Download ppt "Recent Enhancements to Quality Assurance and Case Management within the Emissions Modeling Framework Alison Eyth, R. Partheepan, Q. He Carolina Environmental."

Similar presentations


Ads by Google