pudl.extract.ferc2

Extract FERC Form 2 data from SQLite DBs derived from original DBF files.

The Form No. 2 is a compilation of financial and operational information from major interstate natural gas pipelines subject to the jurisdiction of the FERC. The form contains data for a calendar year. Among other things, the form contains a Comparative Balance Sheet, Statement of Income, Statement of Retained Earnings, Statement of Cash Flows, and Notes to Financial Statements.

Major is defined as having combined gas transported or stored for a fee that exceeds 50 million dekatherms.

Module Contents

Classes

Ferc2DbfExtractor

Wrapper for running the foxpro to sqlite conversion of FERC1 dataset.

Attributes

pudl.extract.ferc2.logger[source]
class pudl.extract.ferc2.Ferc2DbfExtractor(datastore: pudl.workspace.datastore.Datastore, settings: pudl.settings.FercToSqliteSettings, output_path: pathlib.Path, clobber: bool = False)[source]

Bases: pudl.extract.dbf.FercDbfExtractor

Wrapper for running the foxpro to sqlite conversion of FERC1 dataset.

DATASET = 'ferc2'[source]
DATABASE_NAME = 'ferc2_dbf.sqlite'[source]
get_settings(global_settings: pudl.settings.FercToSqliteSettings) pudl.settings.GenericDatasetSettings[source]

Returns settings for FERC Form 1 DBF dataset.

finalize_schema(meta: sqlalchemy.MetaData) sqlalchemy.MetaData[source]

Add primary and foreign keys for respondent_id.

static is_valid_partition(fl: dict[str, Any])[source]

Drops partition with non-empty part fields.

aggregate_table_frames(table_name: str, dfs: list[pudl.extract.dbf.PartitionedDataFrame]) pandas.DataFrame | None[source]

Runs the deduplication on f2_s0_respondent_id table.

Other tables are aggregated as usual, meaning that the partial frames are simply concatenated.