pudl.extract.eia176

Extract EIA Form 176 data from CSVs.

Attributes

Classes

Extractor

Extractor for EIA form 176.

Functions

raw_eia176_asset_factory(in_page[, out_page])

Create raw EIA 176 asset for a specific page.

Module Contents

class pudl.extract.eia176.Extractor(*args, **kwargs)[source]

Bases: pudl.extract.csv.CsvExtractor

Extractor for EIA form 176.

METADATA[source]

Instance of metadata object to use with this extractor.

source_filename(page: str, **partition: pudl.extract.extractor.PartitionSelection) str[source]

Produce the source file name as it will appear in the ZIP archive.

For this archive in particular, we control the naming of the CSV files because they are created by scraping EIA’s natural gas query viewer interface. Rather than creating a file_map.csv like the other EIA extractors, we handle the one the missing company_list file in certain years here, returning “-1” since that mirrors the behavior of the other extractors that do rely on file_map.csv.

Parameters:
  • page – the name of the “page” within the dataset to extract. For EIA-176 this is the descriptive portion of the name of one of the CSV files in the ZIP archive, e.g. “natural_gas_deliveries”.

  • partition – a dictionary uniquely identifying a partition to extract, e.g. {“year”: “2019”, “format”: “by_report”}

Returns:

Full name of the CSV file within the ZIP archive as a string.

process_raw(df: pandas.DataFrame, page: str, **partition: pudl.extract.extractor.PartitionSelection) pandas.DataFrame[source]

Append report year to df to distinguish data from other years.

pudl.extract.eia176.raw_eia176__all_dfs[source]
pudl.extract.eia176.raw_eia176_asset_factory(in_page: str, out_page: str | None = None)[source]

Create raw EIA 176 asset for a specific page.

pudl.extract.eia176.raw_eia176_assets[source]