Reinventing Chemical Information Management at the Environmental Protection Agency (EPA) Presented to the Chemical Information Division American Chemical Society March 30, 2000 Lois E. Fritts, Tommie G. Curtis Scientific Applications International Corporation SDC LF- 2043
Background to EPA 1948smogDonora, PA 1940sCuyahoga RiverCleveland, OH 1962pesticidesSilent Spring 1970sKeponeHopewell, VA 1970sPCBsHudson River 1970sPBBsMichigan 1970sValley of DrumsWest Point, KY 1970sLove Canal Niagara Falls
SDC LF EPA Formed in 1970 Single independent regulatory authority formed from federal organizations including: Department of Interior Food and Drug Administration Health Education and Welfare Department of Agriculture Atomic Energy Commission Council of Environmental Quality
SDC LF EPA Legislative Authority CAA NEPA CWA FIFRA ODA SDWA TSCA RCRA ERDDA CERCLA EPCRA PPA
SDC LF Chemical Information Management Problems EPA formed from many separate organizations EPA programs funded independently by Congress No EPA standards to ensure consistent identification of chemical substances and groupings
SDC LF Chemical Data Problems Chemical substances are identified and represented inconsistently in Agency data systems and in environmental regulations EPA is interested in chemical classes and categories that have not been registered by Chemical Abstracts Service (CAS) CAS Registry Numbers are not always validated or used appropriately
SDC LF Lindane Identifiers Hazardous Waste Codes D013 U129 Parameter Codes CAS Registry Numbers Pollutant Codes HCCH
SDC LF Lindane Names R-BHC (Lindane) gamma gamma-BHC BHC-gamma Lindane 1,2,3,4,5,6-Hexachlorocyclohexane, gamma Benzene hexachloride, gamma
SDC LF Interim Attempts to Resolve Problems EPA CAS Registry Number Data Standard Required CAS Registry Number for all data records about chemical substances Envirofacts Master Chemical Integrator (EMCI) Provided a cross reference to chemical identifiers across EPA data systems
SDC LF More is Needed EPA CAS Registry Number Data Standard does not go far enough to establish accuracy and consistency EMCI is limited to the data systems in the Envirofacts warehouse; it does not address the scope of Agency needs for integrated data sharing
SDC LF Reinventing Environmental Information (REI) Program Need for standards to represent key identifiers in six essential areas: Date Facility Latitude/longitude Business classifications Biological taxonomy Chemical identification
SDC LF Data Standards Process Establish workgroups of knowledgeable EPA staff Define exact data requirements for the standard Determine business rules for implementation Record data elements and rules in the EPA Environmental Data Registry (EDR)
SDC LF Interim Chemical Identification Data Standard CAS Registry Number CAS systematic chemical name (preferably using 9th CI nomenclature) EPA Chemical Registry Name EPA Identification Number for groupings of chemicals of interest to the Agency
SDC LF Optional Data Elements for Chemical Identification Molecular formula Molecular weight Sources of chemical names and synonyms Linear structural formula Graphic structural formula Definition, to further explain Comments, to add information
SDC LF Business Rules for Implementation Overview of the standard Definitions Applicability Data requirements Processing Roles and responsibilities Implementation Maintenance
SDC LF Special Requirements Determine the EPA Chemical Registry Name Establish a central EPA Chemical Registry System (CRS) to support the standard