pudl.extract.ferc6
#
Extract FERC Form 6 data from DBF archives.
Module Contents#
Classes#
Extracts FERC Form 6 data from the legacy DBF archives. |
- class pudl.extract.ferc6.Ferc6DbfExtractor(datastore: pudl.workspace.datastore.Datastore, settings: pudl.settings.FercToSqliteSettings, output_path: pathlib.Path, clobber: bool = False)[source]#
Bases:
pudl.extract.dbf.FercDbfExtractor
Extracts FERC Form 6 data from the legacy DBF archives.
- get_settings(global_settings: pudl.settings.FercToSqliteSettings) pudl.settings.GenericDatasetSettings [source]#
Returns settings for FERC Form 6 DBF dataset.
- finalize_schema(meta: sqlalchemy.MetaData) sqlalchemy.MetaData [source]#
Add primary and foreign keys for respondent_id.
- aggregate_table_frames(table_name: str, dfs: list[pudl.extract.dbf.PartitionedDataFrame]) pandas.DataFrame | None [source]#
Runs the deduplication on f6_s0_respondent_id table.
Other tables are aggregated as usual, meaning that the partial frames are simply concatenated.