pudl.extract.eia923#

Retrieves data from EIA Form 923 spreadsheets for analysis.

This module pulls data from archived copies of EIA’s published Excel spreadsheets.

Attributes#

Classes#

Extractor

Extractor for EIA form 923.

Functions#

extract_eia923(context, raw_eia923__all_dfs)

Extract raw EIA-923 data from excel sheets into dataframes.

Module Contents#

pudl.extract.eia923.logger[source]#
class pudl.extract.eia923.Extractor(*args, **kwargs)[source]#

Bases: pudl.extract.excel.ExcelExtractor

Extractor for EIA form 923.

METADATA[source]#

Instance of metadata object to use with this extractor.

BLACKLISTED_PAGES = ['plant_frame'][source]#

List of supported pages that should not be extracted.

cols_added = [][source]#
process_raw(df, page, **partition)[source]#

Prepare raw table for extraction.

Check extraction configuration is sensible, drop reserved columns, switch to standardized column names, and perform other broadly-applicable cleanup of data formats, types, and missingness.

static process_renamed(df, page, **partition)[source]#

Cleans up unnamed_0 column in stocks page, drops invalid plan_id_eia rows.

process_final_page(df, page)[source]#

Removes reserved columns from the final dataframe.

static get_dtypes(page, **partition)[source]#

Returns dtypes for plant id columns and county FIPS column.

pudl.extract.eia923.raw_eia923__all_dfs[source]#
pudl.extract.eia923.extract_eia923(context, raw_eia923__all_dfs)[source]#

Extract raw EIA-923 data from excel sheets into dataframes.