pudl.extract.vcerare
====================

.. py:module:: pudl.extract.vcerare

.. autoapi-nested-parse::

   Extract VCE Resource Adequacy Renewable Energy (RARE) Power Dataset.

   This dataset has 1,000s of columns, so we don't want to manually specify a rename on
   import because we'll pivot these to a column in the transform step. We adapt the
   standard extraction infrastructure to simply read in the data.

   Each annual zip folder contains a folder with three files:
   Wind_Power_140m_Offshore_county.csv
   Wind_Power_100m_Onshore_county.csv
   Fixed_SolarPV_Lat_UPV_county.csv

   The drive also contains one more CSV file: vce_county_lat_long_fips_table.csv. This gets
   read in when the fips partition is set to True.


Attributes
----------

.. autoapisummary::

   pudl.extract.vcerare.logger
   pudl.extract.vcerare.VCERARE_PAGES


Functions
---------

.. autoapisummary::

   pudl.extract.vcerare._clean_column_names
   pudl.extract.vcerare.extract_vcerare
   pudl.extract.vcerare.raw_vcerare__lat_lon_fips


Module Contents
---------------

.. py:data:: logger

.. py:data:: VCERARE_PAGES

.. py:function:: _clean_column_names(table_relation: duckdb.DuckDBPyRelation) -> duckdb.DuckDBPyRelation

   Apply basic cleaning to column names.


.. py:function:: extract_vcerare(context) -> tuple[dict[int, pudl.helpers.ParquetData], dict[int, pudl.helpers.ParquetData], dict[int, pudl.helpers.ParquetData]]

   Extract data from all vcerare pages and write to parquet files.


.. py:function:: raw_vcerare__lat_lon_fips(context) -> pandas.DataFrame

   Extract lat/lon to FIPS and county mapping CSV.

   This dataframe is static, so it has a distinct partition from the other datasets and
   its extraction is controlled by a boolean in the ETL run.