Skip to main content
Log in

An automated framework for compiling and integrating chemical hazard data

  • Original Paper
  • Published:
Clean Technologies and Environmental Policy Aims and scope Submit manuscript

Abstract

Comparative chemical hazard assessment, which compares hazards for several endpoints across several chemicals, can be used for a variety of purposes including alternatives assessment and the prioritization of chemicals for further assessment. A new framework was developed to compile and integrate chemical hazard data for several human health and ecotoxicity endpoints from public online sources including hazardous chemical lists, Globally Harmonized System hazard codes (H-codes) or hazard categories from government health agencies, experimental quantitative toxicity values, and predicted values using Quantitative Structure–Activity Relationship (QSAR) models. QSAR model predictions were obtained using EPA’s Toxicity Estimation Software Tool. Java programming was used to download hazard data, convert data from each source into a consistent score record format, and store the data in a database. Scoring criteria based on the EPA’s Design for the Environment Program Alternatives Assessment Criteria for Hazard Evaluation were used to determine ordinal hazard scores (i.e., low, medium, high, or very high) for each score record. Different methodologies were assessed for integrating data from multiple sources into one score for each hazard endpoint for each chemical. The chemical hazard assessment (CHA) Database developed in this study currently contains more than 990,000 score records for more than 85,000 chemicals. The CHA Database and the methods used in its development may contribute to several cheminformatics, public health, and environmental activities.

Graphic abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

Download references

Acknowledgements

The authors thank Valery Tkachenko and Jonathan Fox for their assistance with code development and data mining. The authors thank Michael Gonzalez, William Barrett, Richard Judson, and Maureen Gwinn for helpful feedback on earlier drafts of the manuscript, and Sudhakar Takkellapati, Kidus Tadele, Paul Harten, Jane Bare, Grace Patlewicz, and Antony Williams for helpful suggestions and insights on comparative chemical hazard assessment. Dr. Vegosen gratefully acknowledges support by an appointment to the Research Participation Program at the U.S. Environmental Protection Agency (EPA), administered by the Oak Ridge Institute for Science and Education (ORISE) through an interagency agreement between the U.S. Department of Energy and EPA.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Todd M. Martin.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The views expressed in this journal article are those of the authors and do not necessarily represent the views or policies of the U.S. Environmental Protection Agency.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (XLSX 81161 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vegosen, L., Martin, T.M. An automated framework for compiling and integrating chemical hazard data. Clean Techn Environ Policy 22, 441–458 (2020). https://doi.org/10.1007/s10098-019-01795-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10098-019-01795-w

Keywords

Navigation