Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report#
Annual reports submitted to PHMSA from gas distribution, gas gathering, gas transmission, liquefied natural gas, and underground gas storage system operators. Annual reports include information such as total pipeline mileage, facilities, commodities transported, miles by material, and installation dates.
The PHMSA Natural Gas Annual Report, published by the Pipeline and Hazardous Materials Safety Administration (part of the US Dept. of Transportation), collects data about natural gas gathering and transmission and distribution systems (including their age, length, diameter, materials, and carrying capacity). PHMSA also has information about natural gas storage facilities and liquefied natural gas shipping facilities.
There are six different forms used by PHMSA. To begin, PUDL will focus on integrating the transmission and distribution data, which is available from 1970 to the present. For more details, see the official PHMSA data page.
Download the following files for further context:
Documentation for the gas distribution form:
Documentation for the gas transmission and gathering form:
How much of the data is accessible through PUDL?#
PHMSA data goes back to 1970 and is formatted as Microsoft Excel spreadsheets. To begin, PUDL will focus on integrating tables from the distribution and transmission forms, from 1990-present. Data prior to 1989 and from other tables will be integrated as funding allows.
Who is required to fill out the form?#
The Code of Federal Regulations (49 CFR Parts 191, 195) requires operators of gas distribution, gas gathering, gas transmission, hazardous liquid, LNG, and UNGS to submit annual reports to PHMSA. For further details, see the PHMSA’s pipeline safety regulations.
What does the original data look like?#
PHMSA typically publishes both CSV and Excel spreadsheets for each form once a year. The content of the spreadsheets varies from year to year as the questions in the form are updated, with new questions and parts of the form added over time. For some sections of the form, respondents must respond to each question one time per commodity and/or state of operation, meaning that different form sections have different primary keys. Older data maybe be revised after publication through the filing of a supplementary report. To ensure reproducible analyses, we archive versioned snapshots of the EIA-860 data on Zenodo. These archives are periodically refreshed with new data from the PHMSA website.
To understand the details of how the form and data have evolved over time, we recommend reading the Form Instructions from different years, linked above.
At this moment, we are still in the early stages of cleaning and integrating PHMSA data into PUDL. This section will be updated as we learn more about the particularities of this dataset.
PUDL Data Transformations#
The PUDL transformation process cleans the input data so that it is adjusted for uniformity, corrected for errors, and ready for bulk programmatic use.
To see the transformations applied to the data in each table, you can read the
pudl.transform.phmsagas created for each tables’ respective