[X] CLOSEMAIN MENU

[X] CLOSEIN THIS SECTION

ToxicDocs: A database of once-secret chemical industry documents

 

October 14, 2025
2:00 pm US Eastern Time

The ToxicDocs dataset and website contain millions of pages of industry documents about lead, asbestos, silica, PCBs, and other toxic substances. This collection includes internal memoranda, emails, slides, board minutes, unpublished scientific studies, and other documents that became publicly available through toxic tort litigation. 

The resource has been tapped by researchers, journalists, and others exploring a new world of environmental health risk and how it came to be.

In this webinar, one of the ToxicDocs founders, Dr. Merlin Chowkwanyun, gave an overview of this continuously growing dataset, introducing the interface, explaining the technology behind it, and offering a tour of the searchable content. 

The collection is curated by data scientists and researchers at Columbia University's Center for the History and Ethics of Public Health and the City University of New York's Graduate Center. Recent innovations in parallel and cloud computing have made it easier to convert these documents into machine-readable, searchable text.

The discussion was moderated by CHE Science Communications Fellow Haleigh Cavalier

This webinar is one in a series of conversations related to publicly available industry documents. These conversations may include a variety of opinions and perspectives. Any opinions expressed in these webinars are those of the speakers, and do not necessarily reflect the views of CHE or its partner organizations.

Tags