pudl.extract.eia860

Retrieve data from EIA Form 860 spreadsheets for analysis.

This modules pulls data from EIA’s published Excel spreadsheets.

This code is for use analyzing EIA Form 860 data.

Module Contents

Classes

Extractor

Extractor for the excel dataset EIA860.

Attributes

logger

pudl.extract.eia860.logger[source]
class pudl.extract.eia860.Extractor(*args, **kwargs)[source]

Bases: pudl.extract.excel.GenericExtractor

Extractor for the excel dataset EIA860.

process_raw(self, df, page, **partition)[source]

Apply necessary pre-processing to the dataframe.

  • Rename columns based on our compiled spreadsheet metadata

  • Add report_year if it is missing

  • Add a flag indicating if record came from EIA 860, or EIA 860M

  • Fix any generator_id values with leading zeroes.

extract(self, settings: pudl.settings.Eia860Settings = Eia860Settings())[source]

Extracts dataframes.

Returns dict where keys are page names and values are DataFrames containing data across given years.

Parameters

settings – Object containing validated settings relevant to EIA 860. Contains the tables and years to be loaded into PUDL.

static get_dtypes(page, **partition)[source]

Returns dtypes for plant id columns.