pudl.extract.epacems module¶
Retrieve data from EPA CEMS hourly zipped CSVs.
This modules pulls data from EPA’s published CSV files.
-
pudl.extract.epacems.
extract
(epacems_years, states, data_dir)[source]¶ Coordinate the extraction of EPA CEMS hourly DataFrames.
- Parameters
- Yields
dict – a dictionary of States (keys) and DataFrames of CEMS data (values)
Todo
This is really slow. Can we do some parallel processing?