Extract FERC Form 6 data from DBF archives.

Module Contents#



Extracts FERC Form 6 data from the legacy DBF archives.

class pudl.extract.ferc6.Ferc6DbfExtractor(datastore: pudl.workspace.datastore.Datastore, settings: pudl.settings.FercToSqliteSettings, output_path: pathlib.Path, clobber: bool = False)[source]#

Bases: pudl.extract.dbf.FercDbfExtractor

Extracts FERC Form 6 data from the legacy DBF archives.

DATASET = 'ferc6'[source]#
DATABASE_NAME = 'ferc6_dbf.sqlite'[source]#
get_settings(global_settings: pudl.settings.FercToSqliteSettings) pudl.settings.GenericDatasetSettings[source]#

Returns settings for FERC Form 6 DBF dataset.

finalize_schema(meta: sqlalchemy.MetaData) sqlalchemy.MetaData[source]#

Add primary and foreign keys for respondent_id.

aggregate_table_frames(table_name: str, dfs: list[pudl.extract.dbf.PartitionedDataFrame]) pandas.DataFrame | None[source]#

Runs the deduplication on f6_s0_respondent_id table.

Other tables are aggregated as usual, meaning that the partial frames are simply concatenated.