pudl.extract.eia923#

Retrieves data from EIA Form 923 spreadsheets for analysis.

This modules pulls data from EIA’s published Excel spreadsheets.

This code is for use analyzing EIA Form 923 data. Currenly only years 2009-2016 work, as they share nearly identical file formatting.

Module Contents#

Classes#

Extractor

Extractor for EIA form 923.

Functions#

extract_eia923(context, raw_eia923__all_dfs)

Extract raw EIA-923 data from excel sheets into dataframes.

Attributes#

pudl.extract.eia923.logger[source]#
class pudl.extract.eia923.Extractor(*args, **kwargs)[source]#

Bases: pudl.extract.excel.ExcelExtractor

Extractor for EIA form 923.

process_raw(df, page, **partition)[source]#

Drops reserved columns.

static process_renamed(df, page, **partition)[source]#

Cleans up unnamed_0 column in stocks page, drops invalid plan_id_eia rows.

process_final_page(df, page)[source]#

Removes reserved columns from the final dataframe.

static get_dtypes(page, **partition)[source]#

Returns dtypes for plant id columns and county FIPS column.

pudl.extract.eia923.eia_raw_table_names = ('raw_eia923__boiler_fuel', 'raw_eia923__fuel_receipts_costs', 'raw_eia923__generation_fuel',...[source]#
pudl.extract.eia923.raw_eia923__all_dfs[source]#
pudl.extract.eia923.extract_eia923(context, raw_eia923__all_dfs)[source]#

Extract raw EIA-923 data from excel sheets into dataframes.

Parameters:

context – dagster keyword that provides access to resources and config.

Returns:

A tuple of extracted EIA dataframes.