PUDL Data Dictionary#

The following data tables have been cleaned and transformed by our ETL process.

balancing_authorities_eia#

A coding table describing balancing authorities in EIA-860 and EIA-923. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

boiler_fuel_eia923#

EIA-923 Monthly Boiler Fuel Consumption and Emissions Time Series. From EIA-923 Schedule 3. Browse or query this table in Datasette.

Field Name

Type

Description

ash_content_pct

number

Ash content percentage by weight to the nearest 0.1 percent.

boiler_id

string

Alphanumeric boiler ID.

energy_source_code

string

The fuel code associated with the fuel receipt. Two or three character alphanumeric.

fuel_consumed_units

number

Consumption of the fuel type in physical units. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

number

Heat content of the fuel in millions of Btus per physical unit.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

date

Date reported.

sulfur_content_pct

number

Sulfur content percentage by weight to the nearest 0.01 percent.

boiler_generator_assn_eia860#

Associations between boilers and generators as reported in EIA-860 Schedule 6, Part A. Augmented with various heuristics within PUDL. Browse or query this table in Datasette.

Field Name

Type

Description

bga_source

string

The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).

boiler_generator_assn_type_code

string

Indicates whether boiler associations with generator during the year were actual or theoretical. Only available before 2013.

boiler_id

string

Alphanumeric boiler ID.

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

generator_id

string

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

date

Date reported.

steam_plant_type_code

integer

Code that describes types of steam plants from EIA 860. See steam_plant_types_eia table for more details.

unit_id_eia

string

EIA-assigned unit identification code.

unit_id_pudl

integer

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

boiler_generator_assn_types_eia#

A coding table describing different types of boiler-generator associations in the EIA-860. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

boilers_entity_eia#

Static boiler attributes compiled from the EIA-860 and EIA-923 data. Browse or query this table in Datasette.

Field Name

Type

Description

boiler_id

string

Alphanumeric boiler ID.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

prime_mover_code

string

Code for the type of prime mover (e.g. CT, CG)

coalmine_eia923#

Coal mine attributes originally reported within the Fuel Receipts and Costs table via EIA-923 Schedule 2, Part C. Browse or query this table in Datasette.

Field Name

Type

Description

county_id_fips

string

County ID from the Federal Information Processing Standard Publication 6-4.

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

mine_id_msha

integer

MSHA issued mine identifier.

mine_id_pudl

integer

Dynamically assigned PUDL mine identifier.

mine_name

string

Coal mine name.

mine_type_code

string

Type of coal mine.

state

string

Two letter US state abbreviation.

coalmine_types_eia#

A coding table describing different types of coalmines reported as fuel sources in the EIA-923. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

contract_types_eia#

A coding table describing the various types of fuel supply contracts reported in EIA-923. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

data_maturities#

Level of maturities of data records. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

datasources#

No table description available. Browse or query this table in Datasette.

Field Name

Type

Description

datasource

string

Code identifying a dataset available within PUDL.

doi

string

Unique digitial object identifier of Zenodo archive.

partitions

string

The data parititions used to generate this instance of the database.

pudl_version

string

The version of PUDL used to generate this database.

demand_hourly_pa_ferc714#

Hourly electricity demand by plannting area. FERC Form 714, Part III, Schedule 2a. Browse or query this table in Datasette.

Field Name

Type

Description

demand_mwh

number

N/A

report_date

date

Date reported.

respondent_id_ferc714

integer

N/A

timezone

string

IANA timezone name

utc_datetime

datetime

N/A

electric_energy_dispositions_ferc1#

Electricity utilities unload to end users, internal uses losses, etc. Browse or query this table in Datasette.

Field Name

Type

Description

energy_disposition_type

string

Type of energy disposition reported in the electric_energy_dispositions_ferc1. Dispositions include sales to customers, re-sales of energy, energy used internally, losses, etc.

energy_mwh

number

Sources and uses of energy in MWh.

report_year

integer

Four-digit year in which the data was reported.

row_type_xbrl

string

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

electric_energy_sources_ferc1#

Amount of electricity the utility obtained from each of several sources, by year. Browse or query this table in Datasette.

Field Name

Type

Description

energy_mwh

number

Sources and uses of energy in MWh.

energy_source_type

string

Type of energy source reported in the electric_energy_sources_ferc1 table. There are three groups of energy sources: generation, power exchanges and transmission.

report_year

integer

Four-digit year in which the data was reported.

row_type_xbrl

string

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

energy_sources_eia#

Codes and metadata pertaining to energy sources reported to EIA. Compiled from EIA-860 instructions and EIA-923 file layout spreadsheets. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

fuel_derived_from

string

Original fuel from which this refined fuel was derived.

fuel_group_eia

string

High level fuel group defined in the 2021-2023 EIA Form 860 instructions, Table 28.

fuel_phase

string

Physical phase of matter of the fuel.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

fuel_units

string

Reported units of measure for fuel.

label

string

Longer human-readable code using snake_case

max_fuel_mmbtu_per_unit

number

Maximum heat content per physical unit of fuel in MMBtu.

min_fuel_mmbtu_per_unit

number

Minimum heat content per physical unit of fuel in MMBtu.

entity_types_eia#

Descriptive labels for EIA entity type and ownership codes, taken from the EIA-861 form instructions, valid through 2023-05-31. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

epacamd_eia#

No table description available. Browse or query this table in Datasette.

Field Name

Type

Description

boiler_id

string

Alphanumeric boiler ID.

emissions_unit_id_epa

string

Emissions (smokestack) unit monitored by EPA CEMS.

generator_id

string

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

generator_id_epa

string

Generator ID used by the EPA.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_epa

integer

The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.

ferc_accounts#

Account numbers from the FERC Uniform System of Accounts for Electric Plant, which is defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101). Browse or query this table in Datasette.

Field Name

Type

Description

ferc_account_description

string

N/A

ferc_account_id

string

Account identifier from FERC's Uniform System of Accounts for Electric Plant. Includes higher level labeled categories.

ferc_depreciation_lines#

PUDL assigned FERC Form 1 line identifiers and long descriptions from FERC Form 1 page 219, Accumulated Provision for Depreciation of Electric Utility Plant (Account 108). Browse or query this table in Datasette.

Field Name

Type

Description

ferc_account_description

string

N/A

line_id

string

A human readable string uniquely identifying the FERC depreciation account. Used in lieu of the actual line number, as those numbers are not guaranteed to be consistent from year to year.

fuel_ferc1#

Annual fuel cost and quanitiy for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants. As reported on page 402 of FERC Form 1 and extracted from the f1_fuel table in FERC's FoxPro Database. Browse or query this table in Datasette.

Field Name

Type

Description

fuel_consumed_units

number

Consumption of the fuel type in physical units. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_cost_per_mmbtu

number

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_per_unit_burned

number

Average cost of fuel consumed in the report year per reported fuel unit (USD).

fuel_cost_per_unit_delivered

number

Average cost of fuel delivered in the report year per reported fuel unit (USD).

fuel_mmbtu_per_unit

number

Heat content of the fuel in millions of Btus per physical unit.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

fuel_units

string

Reported units of measure for fuel.

plant_name_ferc1

string

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

fuel_receipts_costs_aggs_eia#

Aggregated fuel receipts and costs from the EIA bulk electricty data. Browse or query this table in Datasette.

Field Name

Type

Description

fuel_agg

string

Category of fuel aggregation in EIA bulk electricity data.

fuel_cost_per_mmbtu

number

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_received_mmbtu

number

Aggregated fuel receipts, in MMBtu, in EIA bulk electricity data.

geo_agg

string

Category of geographic aggregation in EIA bulk electricity data.

report_date

date

Date reported.

sector_agg

string

Category of sectoral aggregation in EIA bulk electricity data.

temporal_agg

string

Category of temporal aggregation in EIA bulk electricity data.

fuel_receipts_costs_eia923#

Monthly fuel contract information, purchases, and costs reported in EIA-923 Schedule 2, Part A. Browse or query this table in Datasette.

Field Name

Type

Description

ash_content_pct

number

Ash content percentage by weight to the nearest 0.1 percent.

chlorine_content_ppm

number

N/A

contract_expiration_date

date

Date contract expires.Format: MMYY.

contract_type_code

string

Purchase type under which receipts occurred in the reporting month. C: Contract, NC: New Contract, S: Spot Purchase, T: Tolling Agreement.

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

energy_source_code

string

The fuel code associated with the fuel receipt. Two or three character alphanumeric.

fuel_cost_per_mmbtu

number

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_group_code

string

Fuel groups used in the Electric Power Monthly

fuel_mmbtu_per_unit

number

Heat content of the fuel in millions of Btus per physical unit.

fuel_received_units

number

Quanity of fuel received in tons, barrel, or Mcf.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

mercury_content_ppm

number

Mercury content in parts per million (ppm) to the nearest 0.001 ppm.

mine_id_pudl

integer

Dynamically assigned PUDL mine identifier.

moisture_content_pct

number

N/A

natural_gas_delivery_contract_type_code

string

Contract type for natrual gas delivery service:

natural_gas_transport_code

string

Contract type for natural gas transportation service.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

primary_transportation_mode_code

string

Transportation mode for the longest distance transported.

report_date

date

Date reported.

secondary_transportation_mode_code

string

Transportation mode for the second longest distance transported.

sulfur_content_pct

number

Sulfur content percentage by weight to the nearest 0.01 percent.

supplier_name

string

Company that sold the fuel to the plant or, in the case of Natural Gas, pipline owner.

fuel_transportation_modes_eia#

Long descriptions of the fuel transportation modes reported in the EIA-860 and EIA-923. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

fuel_types_aer_eia#

Descriptive labels for aggregated fuel types used in the Annual Energy Review. See EIA-923 Fuel Code table for additional information. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

generation_eia923#

EIA-923 Monthly Generating Unit Net Generation Time Series. From EIA-923 Schedule 3. Browse or query this table in Datasette.

Field Name

Type

Description

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

generator_id

string

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

date

Date reported.

generation_fuel_eia923#

EIA-923 Monthly Generation and Fuel Consumption Time Series. From EIA-923 Schedule 3. Monthly electricity generation and fuel consumption reported for each combination of fuel and prime mover within a plant. This table does not include data from nuclear plants as they report at the generation unit level, rather than the plant level. See the generation_fuel_nuclear_eia923 table for nuclear electricity generation and fuel consumption. Browse or query this table in Datasette.

Field Name

Type

Description

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

energy_source_code

string

The fuel code associated with the fuel receipt. Two or three character alphanumeric.

fuel_consumed_for_electricity_mmbtu

number

Total consumption of fuel to produce electricity, in physical units, year to date.

fuel_consumed_for_electricity_units

number

Consumption for electric generation of the fuel type in physical units.

fuel_consumed_mmbtu

number

Total consumption of fuel in physical units, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_units

number

Consumption of the fuel type in physical units. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

number

Heat content of the fuel in millions of Btus per physical unit.

fuel_type_code_aer

string

A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER). Two or three letter alphanumeric.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

prime_mover_code

string

Code for the type of prime mover (e.g. CT, CG)

report_date

date

Date reported.

generation_fuel_nuclear_eia923#

EIA-923 Monthly Generation and Fuel Consumption Time Series. From EIA-923 Schedule 3. Monthly electricity generation and fuel consumption reported for each combination of fuel and prime mover within a nuclear generation unit. Browse or query this table in Datasette.

Field Name

Type

Description

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

energy_source_code

string

The fuel code associated with the fuel receipt. Two or three character alphanumeric.

fuel_consumed_for_electricity_mmbtu

number

Total consumption of fuel to produce electricity, in physical units, year to date.

fuel_consumed_for_electricity_units

number

Consumption for electric generation of the fuel type in physical units.

fuel_consumed_mmbtu

number

Total consumption of fuel in physical units, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_units

number

Consumption of the fuel type in physical units. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

number

Heat content of the fuel in millions of Btus per physical unit.

fuel_type_code_aer

string

A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER). Two or three letter alphanumeric.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

nuclear_unit_id

string

For nuclear plants only, the unit number .One digit numeric. Nuclear plants are the only type of plants for which data are shown explicitly at the generating unit level.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

prime_mover_code

string

Code for the type of prime mover (e.g. CT, CG)

report_date

date

Date reported.

generators_eia860#

Annually varying generator attributes compiled from across EIA-860 and EIA-923 data. Browse or query this table in Datasette.

Field Name

Type

Description

capacity_mw

number

Total installed (nameplate) capacity, in megawatts.

carbon_capture

boolean

Indicates whether the generator uses carbon capture technology.

cofire_fuels

boolean

Can the generator co-fire fuels?.

current_planned_operating_date

date

The most recently updated effective date on which the generator is scheduled to start operation

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

deliver_power_transgrid

boolean

Indicate whether the generator can deliver power to the transmission grid.

distributed_generation

boolean

Whether the generator is considered distributed generation

energy_source_1_transport_1

string

Primary mode of transport for energy source 1.

energy_source_1_transport_2

string

Secondary mode of transport for energy source 1.

energy_source_1_transport_3

string

Tertiary mode of transport for energy source 1.

energy_source_2_transport_1

string

Primary mode of transport for energy source 2.

energy_source_2_transport_2

string

Secondary mode of transport for energy source 2.

energy_source_2_transport_3

string

Tertiary mode of transport for energy source 2.

energy_source_code_1

string

The code representing the most predominant type of energy that fuels the generator.

energy_source_code_2

string

The code representing the second most predominant type of energy that fuels the generator

energy_source_code_3

string

The code representing the third most predominant type of energy that fuels the generator

energy_source_code_4

string

The code representing the fourth most predominant type of energy that fuels the generator

energy_source_code_5

string

The code representing the fifth most predominant type of energy that fuels the generator

energy_source_code_6

string

The code representing the sixth most predominant type of energy that fuels the generator

energy_storage_capacity_mwh

number

Energy storage capacity in MWh (e.g. for batteries).

ferc_qualifying_facility

boolean

Indicatates whether or not a generator is a qualifying FERC cogeneation facility.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

generator_id

string

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

minimum_load_mw

number

The minimum load at which the generator can operate at continuosuly.

multiple_fuels

boolean

Can the generator burn multiple fuels?

nameplate_power_factor

number

The nameplate power factor of the generator.

net_capacity_mwdc

number

Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.

operational_status

string

The operating status of the generator. This is based on which tab the generator was listed in in EIA 860.

operational_status_code

string

The operating status of the generator.

other_modifications_date

date

Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.

other_planned_modifications

boolean

Indicates whether there are there other modifications planned for the generator.

owned_by_non_utility

boolean

Whether any part of generator is owned by a nonutilty

ownership_code

string

Identifies the ownership for each generator.

planned_derate_date

date

Planned effective month that the generator is scheduled to enter operation after the derate modification.

planned_energy_source_code_1

string

New energy source code for the planned repowered generator.

planned_modifications

boolean

Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.

planned_net_summer_capacity_derate_mw

number

Decrease in summer capacity expected to be realized from the derate modification to the equipment.

planned_net_summer_capacity_uprate_mw

number

Increase in summer capacity expected to be realized from the modification to the equipment.

planned_net_winter_capacity_derate_mw

number

Decrease in winter capacity expected to be realized from the derate modification to the equipment.

planned_net_winter_capacity_uprate_mw

number

Increase in winter capacity expected to be realized from the uprate modification to the equipment.

planned_new_capacity_mw

number

The expected new namplate capacity for the generator.

planned_new_prime_mover_code

string

New prime mover for the planned repowered generator.

planned_repower_date

date

Planned effective date that the generator is scheduled to enter operation after the repowering is complete.

planned_retirement_date

date

Planned effective date of the scheduled retirement of the generator.

planned_uprate_date

date

Planned effective date that the generator is scheduled to enter operation after the uprate modification.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

prime_mover_code

string

Code for the type of prime mover (e.g. CT, CG)

reactive_power_output_mvar

number

Reactive Power Output (MVAr)

report_date

date

Date reported.

retirement_date

date

Date of the scheduled or effected retirement of the generator.

startup_source_code_1

string

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_2

string

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_3

string

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_4

string

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

summer_capacity_estimate

boolean

Whether the summer capacity value was an estimate

summer_capacity_mw

number

The net summer capacity.

summer_estimated_capability_mw

number

EIA estimated summer capacity (in MWh).

switch_oil_gas

boolean

Indicates whether the generator switch between oil and natural gas.

syncronized_transmission_grid

boolean

Indicates whether standby generators (SB status) can be synchronized to the grid.

technology_description

string

High level description of the technology used by the generator to produce electricity.

time_cold_shutdown_full_load_code

string

The minimum amount of time required to bring the unit to full load from shutdown.

turbines_inverters_hydrokinetics

integer

Number of wind turbines, or hydrokinetic buoys.

turbines_num

integer

Number of wind turbines, or hydrokinetic buoys.

uprate_derate_completed_date

date

The date when the uprate or derate was completed.

uprate_derate_during_year

boolean

Was an uprate or derate completed on this generator during the reporting year?

utility_id_eia

integer

The EIA Utility Identification number.

winter_capacity_estimate

boolean

Whether the winter capacity value was an estimate

winter_capacity_mw

number

The net winter capacity.

winter_estimated_capability_mw

number

EIA estimated winter capacity (in MWh).

generators_entity_eia#

Static generator attributes compiled from across the EIA-860 and EIA-923 data. Browse or query this table in Datasette.

Field Name

Type

Description

associated_combined_heat_power

boolean

Indicates whether the generator is associated with a combined heat and power system

bypass_heat_recovery

boolean

Can this generator operate while bypassing the heat recovery steam generator?

duct_burners

boolean

Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas

fluidized_bed_tech

boolean

Indicates whether the generator uses fluidized bed technology

generator_id

string

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

operating_date

date

Date the generator began commercial operation

operating_switch

string

Indicates whether the fuel switching generator can switch when operating

original_planned_operating_date

date

The date the generator was originally scheduled to be operational

other_combustion_tech

boolean

Indicates whether the generator uses other combustion technologies

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

previously_canceled

boolean

Indicates whether the generator was previously reported as indefinitely postponed or canceled

pulverized_coal_tech

boolean

Indicates whether the generator uses pulverized coal technology

rto_iso_lmp_node_id

string

The designation used to identify the price node in RTO/ISO Locational Marginal Price reports

rto_iso_location_wholesale_reporting_id

string

The designation used to report ths specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report

solid_fuel_gasification

boolean

Indicates whether the generator is part of a solid fuel gasification system

stoker_tech

boolean

Indicates whether the generator uses stoker technology

subcritical_tech

boolean

Indicates whether the generator uses subcritical technology

supercritical_tech

boolean

Indicates whether the generator uses supercritical technology

topping_bottoming_code

string

If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle

ultrasupercritical_tech

boolean

Indicates whether the generator uses ultra-supercritical technology

hourly_emissions_epacems#

Hourly emissions and plant operational data reported via Continuous Emissions Monitoring Systems as required by 40 CFR Part 75. Browse or query this table in Datasette.

Field Name

Type

Description

co2_mass_measurement_code

string

Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.

co2_mass_tons

number

Carbon dioxide emissions in short tons.

emissions_unit_id_epa

string

Emissions (smokestack) unit monitored by EPA CEMS.

gross_load_mw

number

Average power in megawatts delivered during time interval measured.

heat_content_mmbtu

number

The energy contained in fuel burned, measured in million BTU.

nox_mass_lbs

number

NOx emissions in pounds.

nox_mass_measurement_code

string

Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.

operating_datetime_utc

datetime

Date and time measurement began (UTC).

operating_time_hours

number

Length of time interval measured.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_epa

integer

The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.

so2_mass_lbs

number

Sulfur dioxide emissions in pounds.

so2_mass_measurement_code

string

Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.

state

string

Two letter US state abbreviation.

steam_load_1000_lbs

number

Total steam pressure produced by a unit during the reported hour.

year

integer

Year the data was reported in, used for partitioning EPA CEMS.

momentary_interruptions_eia#

A coding table for utility definitions of momentary service interruptions. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

operational_status_eia#

Codes and metadata pertaining to operational status reported to EIA. Compiled from EIA-860 instructions and EIA-923 file layout spreadsheets. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

operational_status

string

The operating status of the generator. This is based on which tab the generator was listed in in EIA 860.

ownership_eia860#

Generator Ownership, reported in EIA-860 Schedule 4. Includes only jointly or third-party owned generators. Browse or query this table in Datasette.

Field Name

Type

Description

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

fraction_owned

number

Proportion of generator ownership.

generator_id

string

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

owner_city

string

City of owner.

owner_country

string

Three letter ISO-3166 country code.

owner_name

string

Name of owner.

owner_state

string

Two letter ISO-3166 political subdivision code.

owner_street_address

string

Steet address of owner.

owner_utility_id_eia

integer

EIA-assigned owner's identification number.

owner_zip_code

string

Zip code of owner.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

date

Date reported.

utility_id_eia

integer

The EIA Utility Identification number.

plant_in_service_ferc1#

Balances and changes to FERC Electric Plant in Service accounts, as reported on FERC Form 1. Data originally from the f1_plant_in_srvce table in FERC's FoxPro database. Account numbers correspond to the FERC Uniform System of Accounts for Electric Plant, which is defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101). Each FERC respondent reports starting and ending balances for each account annually. Balances are organization wide, and are not broken down on a per-plant basis. End of year balance should equal beginning year balance plus the sum of additions, retirements, adjustments, and transfers. Browse or query this table in Datasette.

Field Name

Type

Description

additions

number

Cost of acquisition of items classified within the account.

adjustments

number

Cost of adjustments to the account.

ending_balance

number

Account balance at end of year.

ferc_account

string

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

ferc_account_label

string

Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

retirements

number

Cost of disposal of items classified within the account.

row_type_xbrl

string

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

starting_balance

number

Account balance at beginning of year.

transfers

number

Cost of transfers into (out of) the account.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

plant_parts_eia#

Output table with the aggregation of all EIA plant parts. For use with matching to FERC 1. Browse or query this table in Datasette.

Field Name

Type

Description

appro_part_label

string

Plant part of the associated true granularity record.

appro_record_id_eia

string

EIA record ID of the associated true granularity record.

capacity_eoy_mw

number

Total end of year installed (nameplate) capacity for a plant part, in megawatts.

capacity_factor

number

Fraction of potential generation that was actually reported for a plant part.

capacity_mw

number

Total installed (nameplate) capacity, in megawatts.

construction_year

integer

Year the plant's oldest still operational unit was built.

energy_source_code_1

string

The code representing the most predominant type of energy that fuels the generator.

ferc_acct_name

string

Name of FERC account, derived from technology description and prime mover code.

fraction_owned

number

Proportion of generator ownership.

fuel_cost_per_mmbtu

number

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_per_mwh

number

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

fuel_type_code_pudl

string

Simplified fuel type code used in PUDL

generator_id

string

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

heat_rate_mmbtu_mwh

number

Fuel content per unit of electricity generated. Coming from MCOE calculation.

installation_year

integer

Year the plant's most recently built unit was installed.

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

operating_year

integer

Year a generator went into service.

operational_status

string

The operating status of the generator. This is based on which tab the generator was listed in in EIA 860.

operational_status_pudl

string

The operating status of the generator using PUDL categories.

ownership_dupe

boolean

Whether a plant part record has a duplicate record with different ownership status.

ownership_record_type

string

Whether each generator record is for one owner or represents a total of all ownerships.

planned_retirement_date

date

Planned effective date of the scheduled retirement of the generator.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

integer

A manually assigned PUDL plant ID. May not be constant over time.

plant_id_report_year

string

PUDL plant ID and report year of the record.

plant_name_eia

string

Plant name.

plant_name_ppe

string

Derived plant name that includes EIA plant name and other strings associated with ID and PK columns of the plant part.

plant_part

string

The part of the plant a record corresponds to.

plant_part_id_eia

string

Contains EIA plant ID, plant part, ownership, and EIA utility id

prime_mover_code

string

Code for the type of prime mover (e.g. CT, CG)

record_count

integer

Number of distinct generator IDs that partcipated in the aggregation for a plant part list record.

report_date

date

Date reported.

report_year

integer

Four-digit year in which the data was reported.

retirement_date

date

Date of the scheduled or effected retirement of the generator.

technology_description

string

High level description of the technology used by the generator to produce electricity.

total_fuel_cost

number

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu

number

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

true_gran

boolean

Indicates whether a plant part list record is associated with the highest priority plant part for all identical records.

unit_id_pudl

integer

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

utility_id_eia

integer

The EIA Utility Identification number.

utility_id_pudl

integer

A manually assigned PUDL utility ID. May not be stable over time.

plants_eia#

Association between EIA Plant IDs and manually assigned PUDL Plant IDs Browse or query this table in Datasette.

Field Name

Type

Description

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

integer

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

string

Plant name.

plants_eia860#

Annually varying plant attributes, compiled from across all EIA-860 and EIA-923 data. Browse or query this table in Datasette.

Field Name

Type

Description

ash_impoundment

boolean

Is there an ash impoundment (e.g. pond, reservoir) at the plant?

ash_impoundment_lined

boolean

If there is an ash impoundment at the plant, is the impoundment lined?

ash_impoundment_status

string

If there is an ash impoundment at the plant, the ash impoundment status as of December 31 of the reporting year.

balancing_authority_code_eia

string

EIA short code identifying a balancing authority.

balancing_authority_name_eia

string

Name of the balancing authority.

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

datum

string

Geodetic coordinate system identifier (e.g. NAD27, NAD83, or WGS84).

energy_storage

boolean

Indicates if the facility has energy storage capabilities.

ferc_cogen_docket_no

string

The docket number relating to the FERC cogenerator status. See FERC Form 556.

ferc_cogen_status

boolean

Indicates whether the plant has FERC qualifying facility cogenerator status. See FERC Form 556.

ferc_exempt_wholesale_generator

boolean

Indicates whether the plant has FERC qualifying facility exempt wholesale generator status

ferc_exempt_wholesale_generator_docket_no

string

The docket number relating to the FERC qualifying facility exempt wholesale generator status.

ferc_qualifying_facility_docket_no

string

The docket number relating to the FERC qualifying facility cogenerator status. See FERC Form 556.

ferc_small_power_producer

boolean

Indicates whether the plant has FERC qualifying facility small power producer status. See FERC Form 556.

ferc_small_power_producer_docket_no

string

The docket number relating to the FERC qualifying facility small power producer status. See FERC Form 556.

grid_voltage_1_kv

number

Plant's grid voltage at point of interconnection to transmission or distibution facilities

grid_voltage_2_kv

number

Plant's grid voltage at point of interconnection to transmission or distibution facilities

grid_voltage_3_kv

number

Plant's grid voltage at point of interconnection to transmission or distibution facilities

iso_rto_code

string

The code of the plant's ISO or RTO. NA if not reported in that year.

liquefied_natural_gas_storage

boolean

Indicates if the facility have the capability to store the natural gas in the form of liquefied natural gas.

natural_gas_local_distribution_company

string

Names of Local Distribution Company (LDC), connected to natural gas burning power plants.

natural_gas_pipeline_name_1

string

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

natural_gas_pipeline_name_2

string

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

natural_gas_pipeline_name_3

string

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

natural_gas_storage

boolean

Indicates if the facility have on-site storage of natural gas.

nerc_region

string

NERC region in which the plant is located

net_metering

boolean

Did this plant have a net metering agreement in effect during the reporting year? (Only displayed for facilities that report the sun or wind as an energy source). This field was only reported up until 2015

pipeline_notes

string

Additional owner or operator of natural gas pipeline.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

primary_purpose_id_naics

integer

North American Industry Classification System (NAICS) code that best describes the primary purpose of the reporting plant

regulatory_status_code

string

Indicates whether the plant is regulated or non-regulated.

report_date

date

Date reported.

reporting_frequency_code

string

Code that specifies what time period data has to be reported (i.e. monthly data or annual totals) and how often the power plant reports this data to EIA. See reporting_frequencies_eia for more details.

sector_id_eia

integer

EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

sector_name_eia

string

EIA assigned sector name, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

service_area

string

Service area in which plant is located; for unregulated companies, it's the electric utility with which plant is interconnected

transmission_distribution_owner_id

integer

EIA-assigned code for owner of transmission/distribution system to which the plant is interconnected.

transmission_distribution_owner_name

string

Name of the owner of the transmission or distribution system to which the plant is interconnected.

transmission_distribution_owner_state

string

State location for owner of transmission/distribution system to which the plant is interconnected.

utility_id_eia

integer

The EIA Utility Identification number.

water_source

string

Name of water source associated with the plant.

plants_entity_eia#

Static plant attributes, compiled from across all EIA-860 and EIA-923 data. Browse or query this table in Datasette.

Field Name

Type

Description

city

string

N/A

county

string

County name.

latitude

number

Latitude of the plant's location, in degrees.

longitude

number

Longitude of the plant's location, in degrees.

plant_id_eia

integer

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_name_eia

string

Plant name.

state

string

Two letter US state abbreviation.

street_address

string

N/A

timezone

string

IANA timezone name

zip_code

string

Five digit US Zip Code.

plants_ferc1#

FERC 1 Plants and their associated manually assigned PUDL Plant IDs Browse or query this table in Datasette.

Field Name

Type

Description

plant_id_pudl

integer

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_ferc1

string

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

plants_hydro_ferc1#

Hydroelectric generating plant statistics for large plants. Large plants have an installed nameplate capacity of more than 10 MW. As reported on FERC Form 1, pages 406-407, and extracted from the f1_hydro table in FERC's FoxPro database. Browse or query this table in Datasette.

Field Name

Type

Description

asset_retirement_cost

number

Asset retirement cost (USD).

avg_num_employees

number

N/A

capacity_mw

number

Total installed (nameplate) capacity, in megawatts.

capex_equipment

number

Cost of plant: equipment (USD).

capex_facilities

number

Cost of plant: reservoirs, dams, and waterways (USD).

capex_land

number

Cost of plant: land and land rights (USD).

capex_per_mw

number

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_roads

number

Cost of plant: roads, railroads, and bridges (USD).

capex_structures

number

Cost of plant: structures and improvements (USD).

capex_total

number

Total cost of plant (USD).

construction_type

string

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

integer

Year the plant's oldest still operational unit was built.

installation_year

integer

Year the plant's most recently built unit was installed.

net_capacity_adverse_conditions_mw

number

Net plant capability under the least favorable operating conditions, in megawatts.

net_capacity_favorable_conditions_mw

number

Net plant capability under the most favorable operating conditions, in megawatts.

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

opex_dams

number

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_electric

number

Production expenses: electric expenses (USD).

opex_engineering

number

Production expenses: maintenance, supervision, and engineering (USD).

opex_generation_misc

number

Production expenses: miscellaneous power generation expenses (USD).

opex_hydraulic

number

Production expenses: hydraulic expenses (USD).

opex_misc_plant

number

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_operations

number

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

number

Total production expenses (USD per MWh generated).

opex_plant

number

Production expenses: maintenance of electric plant (USD).

opex_rents

number

Production expenses: rents (USD).

opex_structures

number

Production expenses: maintenance of structures (USD).

opex_total

number

Total production expenses, excluding fuel (USD).

opex_water_for_power

number

Production expenses: water for power (USD).

peak_demand_mw

number

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_hours_connected_while_generating

number

Hours the plant was connected to load while generating in the report year.

plant_name_ferc1

string

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

plant_type

string

N/A

project_num

integer

FERC Licensed Project Number.

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

plants_pudl#

Home table for PUDL assigned plant IDs. These IDs are manually generated each year when new FERC and EIA reporting is integrated, and any newly identified plants are added to the list with a new ID. Each ID maps to a power plant which is reported in at least one FERC or EIA data set. This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx Browse or query this table in Datasette.

Field Name

Type

Description

plant_id_pudl

integer

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_pudl

string

Plant name, chosen arbitrarily from the several possible plant names available in the plant matching process. Included for human readability only.

plants_pumped_storage_ferc1#

Generating plant statistics for hydroelectric pumped storage plants with an installed nameplate capacity of 10+ MW. As reported on page 408 of FERC Form 1 and extracted from the f1_pumped_storage table in FERC's FoxPro Database. Browse or query this table in Datasette.

Field Name

Type

Description

asset_retirement_cost

number

Asset retirement cost (USD).

avg_num_employees

number

N/A

capacity_mw

number

Total installed (nameplate) capacity, in megawatts.

capex_equipment_electric

number

Cost of plant: accessory electric equipment (USD).

capex_equipment_misc

number

Cost of plant: miscellaneous power plant equipment (USD).

capex_facilities

number

Cost of plant: reservoirs, dams, and waterways (USD).

capex_land

number

Cost of plant: land and land rights (USD).

capex_per_mw

number

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_roads

number

Cost of plant: roads, railroads, and bridges (USD).

capex_structures

number

Cost of plant: structures and improvements (USD).

capex_total

number

Total cost of plant (USD).

capex_wheels_turbines_generators

number

Cost of plant: water wheels, turbines, and generators (USD).

construction_type

string

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

integer

Year the plant's oldest still operational unit was built.

energy_used_for_pumping_mwh

number

Energy used for pumping, in megawatt-hours.

installation_year

integer

Year the plant's most recently built unit was installed.

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

net_load_mwh

number

Net output for load (net generation - energy used for pumping) in megawatt-hours.

opex_dams

number

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_electric

number

Production expenses: electric expenses (USD).

opex_engineering

number

Production expenses: maintenance, supervision, and engineering (USD).

opex_generation_misc

number

Production expenses: miscellaneous power generation expenses (USD).

opex_misc_plant

number

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_operations

number

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

number

Total production expenses (USD per MWh generated).

opex_plant

number

Production expenses: maintenance of electric plant (USD).

opex_production_before_pumping

number

Total production expenses before pumping (USD).

opex_pumped_storage

number

Production expenses: pumped storage (USD).

opex_pumping

number

Production expenses: We are here to PUMP YOU UP! (USD).

opex_rents

number

Production expenses: rents (USD).

opex_structures

number

Production expenses: maintenance of structures (USD).

opex_total

number

Total production expenses, excluding fuel (USD).

opex_water_for_power

number

Production expenses: water for power (USD).

peak_demand_mw

number

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_capability_mw

number

Net plant capability in megawatts.

plant_hours_connected_while_generating

number

Hours the plant was connected to load while generating in the report year.

plant_name_ferc1

string

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

project_num

integer

FERC Licensed Project Number.

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

plants_small_ferc1#

Generating plant statistics for steam plants with less than 25 MW installed nameplate capacity and internal combustion plants, gas turbine-plants, conventional hydro plants, and pumped storage plants with less than 10 MW installed nameplate capacity. As reported on FERC Form 1 pages 410-411, and extracted from the FERC FoxPro database table f1_gnrt_plant. Browse or query this table in Datasette.

Field Name

Type

Description

capacity_mw

number

Total installed (nameplate) capacity, in megawatts.

capex_per_mw

number

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_total

number

Total cost of plant (USD).

construction_year

integer

Year the plant's oldest still operational unit was built.

fuel_cost_per_mmbtu

number

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_type

string

N/A

license_id_ferc1

integer

FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

opex_fuel

number

Production expenses: fuel (USD).

opex_maintenance

number

Production expenses: Maintenance (USD).

opex_operations

number

Production expenses: operations, supervision, and engineering (USD).

peak_demand_mw

number

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_name_ferc1

string

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

plant_type

string

N/A

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

plants_steam_ferc1#

Generating plant statistics for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants. As reported on page 402 of FERC Form 1 and extracted from the f1_gnrt_plant table in FERC's FoxPro Database. Browse or query this table in Datasette.

Field Name

Type

Description

asset_retirement_cost

number

Asset retirement cost (USD).

avg_num_employees

number

N/A

capacity_mw

number

Total installed (nameplate) capacity, in megawatts.

capex_equipment

number

Cost of plant: equipment (USD).

capex_land

number

Cost of plant: land and land rights (USD).

capex_per_mw

number

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_structures

number

Cost of plant: structures and improvements (USD).

capex_total

number

Total cost of plant (USD).

construction_type

string

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

integer

Year the plant's oldest still operational unit was built.

installation_year

integer

Year the plant's most recently built unit was installed.

net_generation_mwh

number

Net electricity generation for the specified period in megawatt-hours (MWh).

not_water_limited_capacity_mw

number

Plant capacity in MW when not limited by condenser water.

opex_allowances

number

Allowances.

opex_boiler

number

Maintenance of boiler (or reactor) plant.

opex_coolants

number

Cost of coolants and water (nuclear plants only)

opex_electric

number

Production expenses: electric expenses (USD).

opex_engineering

number

Production expenses: maintenance, supervision, and engineering (USD).

opex_fuel

number

Production expenses: fuel (USD).

opex_misc_power

number

Miscellaneous steam (or nuclear) expenses.

opex_misc_steam

number

Maintenance of miscellaneous steam (or nuclear) plant.

opex_operations

number

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

number

Total production expenses (USD per MWh generated).

opex_plants

number

Maintenance of electrical plant.

opex_production_total

number

Total operating expenses.

opex_rents

number

Production expenses: rents (USD).

opex_steam

number

Steam expenses.

opex_steam_other

number

Steam from other sources.

opex_structures

number

Production expenses: maintenance of structures (USD).

opex_transfer

number

Steam transferred (Credit).

peak_demand_mw

number

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_capability_mw

number

Net plant capability in megawatts.

plant_hours_connected_while_generating

number

Hours the plant was connected to load while generating in the report year.

plant_id_ferc1

integer

Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.

plant_name_ferc1

string

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

plant_type

string

N/A

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

water_limited_capacity_mw

number

Plant capacity in MW when limited by condenser water.

political_subdivisions#

Various static attributes associated with states, provinces, and other sub- national political subdivisions. Browse or query this table in Datasette.

Field Name

Type

Description

country_code

string

Three letter ISO-3166 country code (e.g. USA or CAN).

country_name

string

Full country name (e.g. United States of America).

division_code_us_census

string

Three-letter US Census division code as it appears in the bulk electricity data published by the EIA. Note that EIA splits the Pacific division into distinct contiguous (CA, OR, WA) and non-contiguous (AK, HI) states. For reference see this US Census region and division map: https://www2.census.gov/geo/pdfs/maps-data/maps/reference/us_regdiv.pdf

division_name_us_census

string

Longer human readable name describing the US Census division.

is_epacems_state

boolean

Indicates whether the associated state reports data within the EPA's Continuous Emissions Monitoring System.

region_name_us_census

string

Human-readable name of a US Census region.

state_id_fips

string

Two digit state FIPS code.

subdivision_code

string

Two-letter ISO-3166 political subdivision code (e.g. US state or Canadian provice abbreviations like CA or AB).

subdivision_name

string

Full name of political subdivision (e.g. US state or Canadian province names like California or Alberta.

subdivision_type

string

ISO-3166 political subdivision type. E.g. state, province, outlying_area.

timezone_approx

string

IANA timezone name of the timezone which encompasses the largest portion of the population in the associated geographic area.

power_purchase_types_ferc1#

Coding table defining different types of electricity power purchases. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

prime_movers_eia#

Long descriptions explaining the short prime mover codes reported in the EIA-860 and EIA-923. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

purchased_power_ferc1#

Purchased Power (Account 555) including power exchanges (i.e. transactions involving a balancing of debits and credits for energy, capacity, etc.) and any settlements for imbalanced exchanges. Reported on pages 326-327 of FERC Form 1. Extracted from the f1_purchased_pwr table in FERC's FoxPro database. Browse or query this table in Datasette.

Field Name

Type

Description

billing_demand_mw

number

Monthly average billing demand (for requirements purchases, and any transactions involving demand charges). In megawatts.

coincident_peak_demand_mw

number

Average monthly coincident peak (CP) demand (for requirements purchases, and any transactions involving demand charges). Monthly CP demand is the metered demand during the hour (60-minute integration) in which the supplier's system reaches its monthly peak. In megawatts.

delivered_mwh

number

Gross megawatt-hours delivered in power exchanges and used as the basis for settlement.

demand_charges

number

Demand charges (USD).

energy_charges

number

Energy charges (USD).

non_coincident_peak_demand_mw

number

Average monthly non-coincident peak (NCP) demand (for requirements purhcases, and any transactions involving demand charges). Monthly NCP demand is the maximum metered hourly (60-minute integration) demand in a month. In megawatts.

other_charges

number

Other charges, including out-of-period adjustments (USD).

purchase_type_code

string

Categorization based on the original contractual terms and conditions of the service. Must be one of 'requirements', 'long_firm', 'intermediate_firm', 'short_firm', 'long_unit', 'intermediate_unit', 'electricity_exchange', 'other_service', or 'adjustment'. Requirements service is ongoing high reliability service, with load integrated into system resource planning. 'Long term' means 5+ years. 'Intermediate term' is 1-5 years. 'Short term' is less than 1 year. 'Firm' means not interruptible for economic reasons. 'unit' indicates service from a particular designated generating unit. 'exchange' is an in-kind transaction.

purchased_mwh

number

Megawatt-hours shown on bills rendered to the respondent.

received_mwh

number

Gross megawatt-hours received in power exchanges and used as the basis for settlement.

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

seller_name

string

Name of the seller, or the other party in an exchange transaction.

tariff

string

FERC Rate Schedule Number or Tariff. (Note: may be incomplete if originally reported on multiple lines.)

total_settlement

number

Sum of demand, energy, and other charges (USD). For power exchanges, the settlement amount for the net receipt of energy. If more energy was delivered than received, this amount is negative.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

reporting_frequencies_eia#

A coding table describing different types of reporting frequencies in plants in the EIA-923. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

respondent_id_ferc714#

Respondent identification. FERC Form 714, Part I, Schedule 1. Browse or query this table in Datasette.

Field Name

Type

Description

eia_code

integer

N/A

respondent_id_ferc714

integer

N/A

respondent_name_ferc714

string

N/A

sector_consolidated_eia#

Long descriptions for the EIA consolidated NAICS sector codes. Codes and descriptions taken from the EIA-923 File Layout spreadsheet. Browse or query this table in Datasette.

Field Name

Type

Description

code

integer

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

steam_plant_types_eia#

A coding table describing different types of steam plants in the EIA-860. Browse or query this table in Datasette.

Field Name

Type

Description

code

string

Originally reported short code.

description

string

Long human-readable description of the meaning of a code/label.

label

string

Longer human-readable code using snake_case

transmission_ferc1#

Information concerning transmission lines, cost of lines, and expenses for year. Browse or query this table in Datasette.

Field Name

Type

Description

capex_land

number

Cost of Land and land rights for the transmission line.

capex_other

number

Construction and other costs for the transmission line.

capex_total

number

Total costs for the transmission line.

conductor_size_and_material

string

Size of transmission conductor and material of the transmission line.

designed_voltage_kv

number

Manufactured (Designed) voltage, expressed in kilo-volts, for three-phase 60 cycle alternative current transmission lines

end_point

string

The end point of a transmission line.

num_transmission_circuits

integer

Number of circuits in a transmission line.

operating_voltage_kv

number

The operating voltage, expressed kilo-volts, for three-phase 60 cycle alternative current transmission lines.

opex_maintenance

number

Maintenance expenses for the transmission line.

opex_operations

number

Operating expenses for the transmission line.

opex_rents

number

Rent expenses for the transmission line.

opex_total

number

Overall expenses for the transmission line.

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

start_point

string

The starting point of a transmission line.

supporting_structure_type

string

Supporting structure of the transmission line.

transmission_line_and_structures_length_miles

number

Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are agrregated with other lines / structures (whose cost are aggregated and combined with other structures).

transmission_line_length_miles

number

Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are stand alone structures (whose cost are reported on a stand-alone basis).

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utilities_eia#

Associations between the EIA Utility IDs and the manually assigned PUDL Utility IDs. Browse or query this table in Datasette.

Field Name

Type

Description

utility_id_eia

integer

The EIA Utility Identification number.

utility_id_pudl

integer

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

string

The name of the utility.

utilities_eia860#

Annually varying utility attributes, compiled from all EIA data. Browse or query this table in Datasette.

Field Name

Type

Description

address_2

string

N/A

attention_line

string

N/A

city

string

N/A

contact_firstname

string

First name of utility contact 1.

contact_firstname_2

string

First name of utility contact 2.

contact_lastname

string

Last name of utility contact 1.

contact_lastname_2

string

Last name of utility contact 2.

contact_title

string

Title of of utility contact 1.

contact_title_2

string

Title of utility contact 2.

data_maturity

string

Level of maturity of the data record. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

entity_type

string

Entity type of principal owner.

phone_extension

string

Phone extension for utility contact 1

phone_extension_2

string

Phone extension for utility contact 2

phone_number

string

Phone number for utility contact 1.

phone_number_2

string

Phone number for utility contact 2.

plants_reported_asset_manager

boolean

Is the reporting entity an asset manager of power plants reported on Schedule 2 of the form?

plants_reported_operator

boolean

Is the reporting entity an operator of power plants reported on Schedule 2 of the form?

plants_reported_other_relationship

boolean

Does the reporting entity have any other relationship to the power plants reported on Schedule 2 of the form?

plants_reported_owner

boolean

Is the reporting entity an owner of power plants reported on Schedule 2 of the form?

report_date

date

Date reported.

state

string

Two letter US state abbreviation.

street_address

string

N/A

utility_id_eia

integer

The EIA Utility Identification number.

zip_code

string

Five digit US Zip Code.

zip_code_4

string

Four digit US Zip Code suffix.

utilities_entity_eia#

Static attributes of utilities, compiled from all EIA data. Browse or query this table in Datasette.

Field Name

Type

Description

utility_id_eia

integer

The EIA Utility Identification number.

utility_name_eia

string

The name of the utility.

utilities_ferc1#

This table maps two manually assigned utility IDs: a PUDL ID and a FERC1 ID. The PUDL ID maps EIA and FERC1 utilities. The FERC1 ID maps the older DBF respondent IDs to new XBRL entity IDs. This table is generated from a table stored in the PUDL repository: src/package_data/glue/utility_id_pudl.csv Browse or query this table in Datasette.

Field Name

Type

Description

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_pudl

integer

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

string

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

utilities_ferc1_dbf#

This table maps the assign utility ID FERC1 to the native utility ID from the FERC1 DBF inputs - originally reported as respondent_id. Browse or query this table in Datasette.

Field Name

Type

Description

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

integer

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utilities_ferc1_xbrl#

This table maps the assign utility ID FERC1 to the native utility ID from the FERC1 XBRL inputs - originally reported as entity_id. Browse or query this table in Datasette.

Field Name

Type

Description

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_xbrl

string

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utilities_pudl#

Home table for PUDL assigned utility IDs. These IDs are manually generated each year when new FERC and EIA reporting is integrated, and any newly found utilities are added to the list with a new ID. Each ID maps to a power plant owning or operating entity which is reported in at least one FERC or EIA data set. This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx Browse or query this table in Datasette.

Field Name

Type

Description

utility_id_pudl

integer

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_pudl

string

Utility name, chosen arbitrarily from the several possible utility names available in the utility matching process. Included for human readability only.

utility_plant_assn#

No table description available. Browse or query this table in Datasette.

Field Name

Type

Description

plant_id_pudl

integer

A manually assigned PUDL plant ID. May not be constant over time.

utility_id_pudl

integer

A manually assigned PUDL utility ID. May not be stable over time.

utility_plant_summary_ferc1#

Summary of utility plant and accumulated provisions for depreviation, amortization and depletion of utilty plant assets reported annually at the end of the report year. Browse or query this table in Datasette.

Field Name

Type

Description

record_id

string

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

integer

Four-digit year in which the data was reported.

row_type_xbrl

string

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

utility_id_ferc1

integer

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_plant_asset_type

string

Type of utility plant asset reported in the utility_plant_summary_ferc1 table. Assets include those leased to others, held for future use, construction work-in-progress and details of accumulated depreciation.

utility_plant_value

number

Utility plant value.

utility_type

string

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

utility_type_other

string

Freeform description of type of utility reported in one of the other three other utility_type sections in the utility_plant_summary_ferc1 table. This field is reported only in the DBF reporting years (1994-2020).