UL Safety Data Lake Introduces Enhancements
April 14, 2020
Data Science has introduced new data sources and functional enhancements to the UL Safety Data Lake to better serve its diverse audience. The Data Lake, publicly available at https://opendata.ul.org, consolidates open source data from multiple safety-focused databases into a single site environment. The resource enables product safety professionals to easily access multiple safety data sources—curated for content and quality—to support their product safety analysis and research.
Newly added data sets include the OECD Global Recalls database and Health Canada’s Consumer Product Recalls. Updated functionality includes natural language processing of incident narratives based on user-defined subjects, comprehensive and simultaneous search capability across all data sets, and recommendation engines that sort through data to find incidents of a similar nature to the ones currently being viewed.
Introduced in 2018, the Data Lake consolidated data sources from the U.S. Consumer Product Safety Commission (CPSC) SaferProducts.gov website and the National Electronic Injury Surveillance System (NEISS). Additional data sets include recall data from the EU’s Safety Gate, the FDA’s Medical Device incident database (Manufacturer and User Device Experience – MAUDE), and Pipeline and Hazardous Materials Safety Administration (PHMSA) incident data.
The site continues to provide data visualizations and summary statistics for each of the data files, with the ability to search and query the data in straightforward ways (keyword search, for example). Site visitors can also use an API to bring the data into Excel spreadsheets and analysis tools for further exploration.