Catalyst Cooperative Documentation Index#

The Public Utility Data Liberation Project (PUDL)#

PUDL (pronounced puddle) is a data processing pipeline created by Catalyst Cooperative that cleans, integrates, and standardizes some of the most widely used public energy datasets in the US. The data serves researchers, activists, journalists, and policy makers that might not have the technical expertise to access it in its raw form, the time to clean and prepare the data for bulk analysis, or the means to purchase it from existing commercial providers.

PUDL Examples#

A collection of example notebooks that work with PUDL data.

PUDL Data Archivers#

This repo implements data archivers for The Public Utility Data Liberation Project (PUDL). It is responsible for downloading raw data from multiple sources, and create Zenodo archives containing that data.

Open Energy Data For All#

A two-day, 16-hour course on foundational software and data engineering skills aimed at energy systems graduate students looking to generate more robust, replicable energy analyses. Developed using The Carpentries pedagogical framework with financial support from the Alfred P. Sloan Foundation.

Catalyst Agent Skills#

A collection of agent skills that help LLM-based agents work with the PUDL data, metadata, and codebase.

FERC XBRL Extractor#

A Python package for extracting FERC Form 1, 2, 6, 60, & 714 data from the XML-based XBRL format it’s published in, into file-based SQLite and DuckDB databases for analytical use.

CAMD-EIA Crosswalk#

This repository is a fork of the original EPA-EIA crosswalk repo, and updates the system to be able to use any recent year of data and to build a crosswalk that covers multiple years of data.

Eel Hole#

Eel-hole is the web-app that runs the PUDL Data Viewer

Member Handbook#

The Catalyst handbook contains the cooperative’s bylaws, policies and general operating information.