pudl.extract.epaipm module¶
Retrieve data from EPA’s Integrated Planning Model (IPM) v6.
Unlike most of the PUDL data sources, IPM is not an annual timeseries. This file assumes that only v6 will be used as an input, so there are a limited number of files.
This module was written by @gschivley
-
pudl.extract.epaipm.
create_dfs_epaipm
(files, data_dir)[source]¶ Makes dictionary of pages (keys) to dataframes (values) for epaipm tabs.
-
pudl.extract.epaipm.
extract
(epaipm_tables, data_dir)[source]¶ Extracts data from IPM files.
- Parameters
epaipm_tables (iterable) – A tuple or list of table names to extract
data_dir (path-like) – Path to the top directory of the PUDL datastore.
- Returns
dictionary of DataFrames with extracted (but not yet transformed) data from each file.
- Return type
-
pudl.extract.epaipm.
get_epaipm_file
(filename, read_file_args, data_dir)[source]¶ Reads in files to create dataframes.
No need to use ExcelFile objects with the IPM files because each file is only a single sheet.