Find the energy data you need:
PUDL has over 300 tables from over a dozen different datasets. You can start by searching the PUDL Data Viewer for terms or data points you care about directly. We recommend working with tables with the out prefix, as these tables contain the most-complete data that is the easiest to work with.
If you're not sure what to search for, or just browsing, here are some ways to get started with different collections of PUDL tables:
Know the original source of the data you'd like to access? Select an organization below to see all sources we support for that organization, and find all tables from each source.
United States Census Bureau (1 source)
Census DP1 -- Profile of General Demographic Characteristics
A 2010 copy of the US Census Demographic Profile 1 (DP1) County and Tract GeoDatabase, including select demographic information at the state, county, and census tract level, along with the geometries of those geographic areas. It is a convenient source of state and county boundaries for use in mapping and visualizations, and we use county-level population to provide context for the FERC-714 hourly electricity demand. However, it should not be considered canonical information since it's from 2010 and more recent analogous data is available from the Census directly.
EIA - U.S. Energy Information Administration (7 sources)
EIA Form 176 -- Annual Report of Natural and Supplemental Gas Supply and Disposition
The EIA Form 176, also known as the Annual Report of Natural and Supplemental Gas Supply and Disposition, describes the origins, suppliers, and disposition of natural gas on a yearly and state by state basis.
EIA Form 860 -- Annual Electric Generator Report
US Energy Information Administration (EIA) Form 860 data for electric power plants with 1 megawatt or greater combined nameplate capacity.
EIA Form 860M -- Monthly Update to the Annual Electric Generator Report
US Energy Information Administration (EIA) Form 860 M data for electric power plants with 1 megawatt or greater combined nameplate capacity. Preliminary Monthly Electric Generator Inventory (based on Form EIA-860M as a supplement to Form EIA-860)
EIA Form 861 -- Annual Electric Power Industry Report
EIA Form 861 Annual Electric Power Industry Report, detailed data files.
EIA Form 923 -- Power Plant Operations Report
The EIA Form 923 collects detailed monthly and annual electric power data on electricity generation, fuel consumption, fossil fuel stocks, and receipts at the power plant and prime mover level.
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
The EIA Form 930 provides hourly demand and generation statistics by balancing area (or sub-balancing area in the case of some larger areas). These statistics include a breakdown by energy source (coal, gas, hydro, wind, solar, etc.) as well as interchange between the balancing areas, including international exchanges with Canada and Mexico.
EIA Annual Energy Outlook (AEO)
The EIA Annual Energy Outlook provides projections of future fuel prices, energy supply and consumption, and carbon dioxide emissions by sector and region.
EIA Bulk API Data
All data made available in bulk through the EIA Open Data API. At present, PUDL integrates only a few specific data series related to fuel receipts and costs figures from the Bulk Electricity API.
EPA - U.S. Environmental Protection Agency (2 sources)
EPA Hourly Continuous Emission Monitoring System (CEMS)
US EPA hourly Continuous Emissions Monitoring System (CEMS) data.Hourly CO2, SO2, NOx emissions and gross load.
EPA CAMD to EIA Power Sector Data Crosswalk
A file created collaboratively by EPA and EIA that connects EPA CAMD smokestacks (units) with corresponding EIA plant part ids reported in EIA Forms 860 and 923 (plant_id_eia, boiler_id, generator_id). This one-to-many connection is necessary because pollutants from various plant parts are collecitvely emitted and measured from one point-source.
FERC - Federal Energy Regulatory Commission (3 sources)
FERC Form 1 -- Annual Report of Major Electric Utilities
The Federal Energy Regulatory Commission (FERC) Form 1 is a comprehensive financial and operating report submitted annually for electric rate regulation, market oversight analysis, and financial audits by Major electric utilities, licensees and others.
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report
Electric transmitting utilities operating balancing authority areas and planning areas with annual peak demand over 200MW are required to file Form 714 with the Federal Energy Regulatory Commission (FERC), reporting balancing authority area generation, actual and scheduled inter-balancing authority area power transfers, and net energy for load, summer-winter generation peaks and system lambda.
FERC Form 920 -- Electric Quarterly Report (EQR)
The Electric Quarterly Report (EQR) is submitted by sellers participating in bilateral electricity market transactions. The reports summarize the contractual terms and conditions in agreements for all jurisdictional services, including cost-based sales, market-based rate sales, and transmission service, as well as transaction information for short-term and long-term market-based power sales and cost-based power sales.
GridLab (1 source)
GridPath Resource Adequacy Toolkit Data
Hourly renewable generation profiles compiled for the Western United States as part of the GridPath Resource Adequacy Toolkit. This data also contains some daily weather data from several sites across the western US and tables describing the way in which individual wind and solar projects were aggregated up to the level of balancing authority or transmission zone.
NREL/NLR - National Laboratory of the Rockies (1 source)
NREL Annual Technology Baseline (ATB) for Electricity
The NREL Annual Technology Baseline (ATB) for Electricity publishes annual projections of operational and capital expenditures (by technology and vintage), as well as operating characteristics (by technology).
PHMSA - Pipelines and Hazardous Materials Safety Administration (1 source)
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report
Annual reports submitted to PHMSA from gas distribution, gas gathering, gas transmission, liquefied natural gas, and underground gas storage system operators. Annual reports include information such as total pipeline mileage, facilities, commodities transported, miles by material, and installation dates.
SEC - U.S. Securities and Exchange Commission (1 source)
U.S. Securities and Exchange Commission (SEC) Form 10-K
The SEC Form 10-K is an annual report required by the U.S. Securities and Exchange Commission (SEC), that gives a comprehensive summary of a company's financial performance.
USDA RUS - Rural Utilities Service (2 sources)
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution
The United States Department of Agriculture (USDA) Financial and Operating Report for Electric Distribution (formerly known as Form 7) is an annual report submitted by rural electric utilities that receive funding from the USDA Rural Utilities Service (RUS) for distribution services. This data was obtained through multiple Freedom of Information Access (FOIA) requests.
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply
The United States Department of Agriculture (USDA) Financial and Operating Report for Electric Power Supply (formerly known as Form 12) is an annual report submitted by rural electric utilities that receive funding from the USDA Rural Utilities Service (RUS) for power supply (generation). This data was obtained through multiple Freedom of Information Access (FOIA) requests.
VCE - Vibrant Clean Energy (1 source)
Vibrant Clean Energy Resource Adequacy Renewable Energy (RARE) Power Dataset
This dataset was produced by Vibrant Clean Energy and is licensed to the public under the Creative Commons Attribution 4.0 International license (CC-BY-4.0). The data consists of hourly, county-level renewable generation profiles in the continental United States and was compiled based on outputs from the NOAA HRRR weather model. Profiles are stated as a capacity factor (a percentage of nameplate capacity) and exist for onshore wind, offshore wind, and fixed-tilt solar generation types.
We have curated a few clusters of tables for topics we know PUDL users care about.
-
Generation plant operations
- out_eia__monthly_generators
- Monthly time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.
- out_eia__yearly_generators
- Annual time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.
- out_ferc1__yearly_all_plants
- Annual time series of plants reported in the steam, small generators, hydro, and pumped storage tables.
- out_pudl__yearly_assn_eia_ferc1_plant_parts
- Annual Association table providing connections between power plant data reported in FERC Form 1 and any available EIA data related to the plant parts covered by each FERC Form 1 record.
-
Emissions
- core_epacems__hourly_emissions
- Hourly time series of emissions and plant operational data as required by 40 CFR Part 75.
- out_eia860__yearly_emissions_control_equipment
- Annual time series of the cost, type, operating status, retirement date, and install year of emissions control equipment reported to EIA.
- core_eia860__assn_yearly_boiler_emissions_control_equipment
- Association table providing connections between EIA boiler IDs and emissions control IDs for NOx, SO2, mercury, and particulate monitoring.
-
Rate base & utility finance
- out_ferc1__yearly_rate_base
- Annual time series of granular accounting data consisting of what utilities can typically include in their rate bases.
- core_eia861__yearly_sales
- Annual time series of electricity sales to ultimate customers by utility, balancing authority, state, and customer class.
- core_eia861__yearly_operational_data_revenue
- Annual time series of utility revenue by type of electric operating revenue.
- out_ferc1__yearly_detailed_balance_sheet_assets
- Annual time series of granular, de-duplicated accounting data of utilities' balance sheet assets.
- out_ferc1__yearly_detailed_balance_sheet_liabilities
- Annual time series of granular, de-duplicated accounting data of utilities' balance sheet liabilities.
- out_ferc1__yearly_operating_revenues_sched300
- Annual time series of utilities' electric operating revenues.
- out_ferc1__yearly_sales_by_rate_schedules_sched304
- Annual time series of utilities' electric sales from all rate schedules in effect throughout the year.
-
Demand
- out_eia930__hourly_operations
- Hourly time series of balancing authority net generation, interchange, and demand with imputed demand.
- out_eia930__hourly_subregion_demand
- Hourly time series of balancing authority subregion demand with imputed demand.
- out_eia930__hourly_aggregated_demand
- Hourly time series of aggregated balancing authority demand by EIA region, interconnect, and continental US.
- out_ferc714__hourly_planning_area_demand
- Hourly time series of electricity demand by planning area.
- out_ferc714__hourly_estimated_state_demand
- Hourly time series of estimated electricity demand for each state, scaled such that it matches the total electricity sales by state reported in EIA 861.
- out_ferc714__summarized_demand
- Summarized demand statistics and FERC-714 respondent attributes by respondent-year.
-
Renewables
- out_vcerare__hourly_available_capacity_factor
- Hourly time series of estimated county-averaged capacity factors for wind and solar generating facilities across the contiguous United States (US), to be used as a tool and input for resource adequacy modeling and planning.
- out_gridpathratoolkit__hourly_available_capacity_factor
- Hourly time series of capacity factors defining the capacity available from an aggregated group of generators.
- core_eia860__scd_generators_energy_storage
- Slowly changing dimension (SCD) table describing attributes of energy storage which may vary from year to year.
- core_eia860__scd_generators_solar
- Slowly changing dimension (SCD) table describing attributes of solar generators that may vary from year to year.
- core_eia860__scd_generators_wind
- Slowly changing dimension (SCD) table describing attributes of wind generators that may vary from year to year.
- core_eia923__monthly_energy_storage
- Monthly time series of energy consumed and discharged ("generated") by energy storage units.
-
Cost and deployment predictions
- core_eiaaeo__yearly_projected_electric_sales
- Annual time series of projected electricity sales by region and customer class.
- core_eiaaeo__yearly_projected_energy_use_by_sector_and_type
- Annual time series of projected energy use for commercial, electric power, industrial, residential, and transportation sectors, drawn from AEO Table 2.
- core_eiaaeo__yearly_projected_fuel_cost_in_electric_sector_by_type
- Annual time series of projected fuel prices for the electric power sector, including electricity-only and combined-heat-and-power plants that have a regulatory status.
- core_eiaaeo__yearly_projected_generation_in_electric_sector_by_technology
- Annual time series of projected generation capacity & total generation in the electric sector, broken out by technology.
- core_eiaaeo__yearly_projected_generation_in_end_use_sectors_by_fuel_type
- Annual time series of projected generation capacity and total generation in the end-use sector, broken out by fuel type.
- core_nrelatb__yearly_projected_cost_performance
- Annual time series of projections of costs and performance for various technologies.
- core_nrelatb__yearly_projected_financial_cases
- Annual time series of financial assumptions for each model case (model_case_nrelatb), and sub-type of technology (technology_description).
- core_nrelatb__yearly_projected_financial_cases_by_scenario
- Annual time series of additional financial assumptions for NREL ATB projections that also vary by technology innovation scenario (scenario_atb), tax credit case (model_tax_credit_case_nrelatb), and cost recovery period (cost_recovery_period_years).
- core_nrelatb__yearly_technology_status
- Annual time series of technology statuses, indicating whether technologies in the ATB scenarios are mature, and whether they are the default technologies.
-
Distributed energy and demand management programs
- core_eia861__yearly_advanced_metering_infrastructure
- Annual time series of advanced metering infrastructure (AMI) and automated meter reading (AMR) by state, sector, and balancing authority.
- core_eia861__yearly_demand_response
- Annual time series of demand response programs by state, sector, and balancing authority.
- core_eia861__yearly_demand_response_water_heater
- Annual time series of grid-connected water heaters enrolled in demand response programs.
- core_eia861__yearly_demand_side_management_ee_dr
- Annual time series of The impact of energy efficiency and load management programs on total energy sold (MWh) and peak demand (MW) by customer class.
- core_eia861__yearly_demand_side_management_misc
- Annual time series of demand-side management (DSM) program information.
- core_eia861__yearly_demand_side_management_sales
- Annual time series of electricity sales related to demand-side management (DSM).
- core_eia861__yearly_distributed_generation_fuel
- Annual time series of the energy sources used for utility or customer-owned distributed generation capacity.
- core_eia861__yearly_distributed_generation_misc
- Annual time series of the capacity and quantity of utility or customer-owned distributed generation.
- core_eia861__yearly_distributed_generation_tech
- Annual time series of the technology used for utility or customer-owned distributed generation.
- core_eia861__yearly_distribution_systems
- Annual time series of distribution circuits and circuits with voltage optimization by state.
- core_eia861__yearly_dynamic_pricing
- Annual time series of enrollment in dynamic pricing programs by state, sector, and balancing authority.
- core_eia861__yearly_energy_efficiency
- Annual time series of incremental energy savings, peak demand savings, weighted average life
- core_eia861__yearly_green_pricing
- Annual time series of green pricing program revenue, sales, and customer count by sector and state.
- core_eia861__yearly_net_metering_customer_fuel_class
- Annual time series of net metering by customer and fuel class.
- core_eia861__yearly_net_metering_misc
- Annual time series of PV current flow type for net metered capacity.
-
Utility ownership
- out_eia__yearly_generators_by_ownership
- Annual time series of all EIA generators with ownership integrated.
- out_eia923__yearly_generation_fuel_by_generator_energy_source_owner
- Annual time series of of estimated net generation and fuel consumption for each generator, associated with each combination of generator, energy source, prime mover, and owner.
- out_sec10k__parents_and_subsidiaries
- A denormalized table containing information about parent companies that file SEC Form 10-K and their subsidiaries, which may or may not file Form 10-K.
- core_sec10k__assn_sec10k_filers_and_eia_utilities
- Association table providing connections between SEC 10-K filers and EIA utilities.
- core_sec10k__assn_exhibit_21_subsidiaries_and_eia_utilities
- Association table providing connections between subsidiaries listed in Exhibit 21 and EIA utilities.
-
Transmission
- out_ferc1__yearly_transmission_lines_sched422
- Annual time series of statistics about transmission lines.
-
Gas distribution
- core_eia176__yearly_gas_disposition
- Annual time series of a company's natural and supplemental gas disposition for the report state.
- core_eia176__yearly_gas_disposition_by_consumer
- Annual time series of a company's natural gas deliveries to end-use consumers within the report state.
- core_phmsagas__yearly_distribution_operators
- Annual time series of distribution operator information.
- _core_phmsagas__yearly_distribution_by_install_decade
- Annual time series of miles of mains and the number of services in operation at the end of the year by install decade.
- _core_phmsagas__yearly_distribution_by_material
- Annual time series of miles of mains and the number of services in operation at the end of the year by material for each gas distribution operator.
- _core_phmsagas__yearly_distribution_by_material_and_size
- Annual time series of miles of mains and the number of services in operation at the end of the year by material and size of pipe.
- _core_phmsagas__yearly_distribution_excavation_damages
- Annual time series of excavation damages from various sources.
- _core_phmsagas__yearly_distribution_filings
- Annual time series of filings (aka submissions) from gas distribution system operators.
- _core_phmsagas__yearly_distribution_leaks
- Annual time series of total and hazardous leaks eliminated or repaired during the report year.
- _core_phmsagas__yearly_distribution_misc
- Annual time series of miscellaneous distribution information.
-
Electricity markets / transactions / interchange
- core_ferceqr__contracts
- Contains information about contracts between companies selling and buying electricity market products.
- core_ferceqr__transactions
- Contains information about individual electricity market transactions that took place during a given reporting quarter. Reported by the seller.
- core_eia930__hourly_interchange
- Hourly time series of balancing authority interchange.
- out_ferc1__yearly_purchased_power_and_exchanges_sched326
- Annual time series of purchased power (Account 555) including power exchanges (transactions involving a balancing of debits and credits for energy, capacity, etc.) and any settlements for imbalanced exchanges.
-
Generation/plant operational costs
- out_eia923__monthly_fuel_receipts_costs
- Monthly time series of fuel deliveries to power plants.
- out_ferc1__yearly_all_plants
- Annual time series of plants reported in the steam, small generators, hydro, and pumped storage tables.
-
Geospatial information
- out_ferc714__georeferenced_respondents
- Processing:
- out_ferc714__respondents_with_fips
- Annual respondents with the county FIPS IDs for their service territories.
- out_censusdp1tract__states
- Processing:
- out_censusdp1tract__counties
- Processing:
- out_censusdp1tract__tracts
- Processing:
- out_eia861__yearly_balancing_authority_service_territory
- Annual time series of counties in balancing authority service territories.
- out_eia861__yearly_utility_service_territory
- Annual time series of counties in utility service territories.
- out_eia923__monthly_fuel_receipts_costs
- Monthly time series of fuel deliveries to power plants.
- out_vcerare__hourly_available_capacity_factor
- Hourly time series of estimated county-averaged capacity factors for wind and solar generating facilities across the contiguous United States (US), to be used as a tool and input for resource adequacy modeling and planning.
- core_eia861__yearly_service_territory
- Annual time series of counties in utility service territories.
- core_eia923__entity_coalmine
- Entity table containing static information about coal mines reporting deliveries in the Fuel Receipts and Costs table.
- _core_eia860__fgd_equipment
- Information about flue gas desulfurization equipment at generation facilities.
- out_eia__monthly_generators
- Monthly time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.
- out_eia__yearly_generators
- Annual time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.
- out_eia__yearly_boilers
- Annual time series of all EIA boiler attributes.
- out_eia__yearly_plants
- Annual time series of all EIA plant attributes.
- core_eia__entity_plants
- Entity table containing static information about plants, compiled from across all EIA-860 and EIA-923 data.
Want to explore the most widely used PUDL tables?
- out_eia__yearly_generators and out_eia__monthly_generators - Time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.
- core_eia860__scd_plants - Slowly changing dimension (SCD) table describing attributes of plants which may vary from year to year. Compiled from across all EIA-860 and EIA-923 data.
- core_epacems__hourly_emissions - Hourly time series of emissions and plant operational data as required by 40 CFR Part 75.
- core_nrelatb__yearly_projected_cost_performance - Annual time series of projections of costs and performance for various technologies.
- out_vcerare__hourly_available_capacity_factor - Hourly time series of estimated county-averaged capacity factors for wind and solar generating facilities across the contiguous United States (US), to be used as a tool and input for resource adequacy modeling and planning.
- core_eiaaeo__yearly_projected_fuel_cost_in_electric_sector_by_type - Annual time series of projected fuel prices for the electric power sector, including electricity-only and combined-heat-and-power plants that have a regulatory status.
- out_eia__yearly_utilities - Annual time series of all EIA utility attributes.
- core_eia860__scd_generators_energy_storage - Slowly changing dimension (SCD) table describing attributes of energy storage which may vary from year to year.
- core_eia930__hourly_interchange - Hourly time series of balancing authority interchange.
- out_eia923__monthly_fuel_receipts_costs - Monthly time series of fuel deliveries to power plants.
- out_ferc714__hourly_planning_area_demand - Hourly time series of electricity demand by planning area.
- core_nrelatb__yearly_projected_financial_cases_by_scenario - Annual time series of additional financial assumptions for NREL ATB projections that also vary by technology innovation scenario (scenario_atb), tax credit case (model_tax_credit_case_nrelatb), and cost recovery period (cost_recovery_period_years).
- out_eia__yearly_plants - Annual time series of all EIA plant attributes.
Can't find what you are looking for?
Contact us at [email protected]!
All the ways to access PUDL data
Found what you are looking for but want a different way to access PUDL data?
| Platform | Format | Version | User Types | Use Cases |
|---|---|---|---|---|
| PUDL Data Viewer | Parquet, CSV | nightly | Data Explorer, Spreadsheet Analyst, Jupyter Notebook User | You are here! Explore PUDL data interactively in a web browser, including hourly timeseries data. Select data to download as CSVs for local analysis in spreadsheets. Download full tables as Parquet files to play with programmatically. |
| Cloud Storage | SQLite, Parquet | nightly, stable | Data Scientist, Analytics Engineer, Data Engineer, Cloud Developer | Performant remote queries of clearly versioned PUDL Parquet outputs from cloud computing platforms or GitHub Actions. Fast bulk download of SQLite or Parquet outputs for local use. Parquet based data warehouse for large-scale data analysis in the cloud. Integrates well with Pandas, DuckDB, and other dataframe libraries. |
| Zenodo Archives | SQLite, Parquet | stable | Researcher, Publisher, Archivist | Access a specific, immutable version of the PUDL data by DOI for citation in academic publications or other applications where long-term reproducibility is needed. Web-based bulk download of data for local analysis. |
| Kaggle | SQLite, Parquet | nightly | Data Scientist, Data Analyst, Jupyter Notebook User | Work with PUDL data products in Jupyter Notebooks via the web with minimal setup. Explore curated and contributed analyses and visualizations using PUDL data. Create and share your own interactive notebooks using PUDL data. |
Decoding table names
Intrepid PUDL spelunkers might notice we use a naming convention for our tables. Knowing a little about it might help you better find what you are looking for.
{layer}_{source}__{asset_type}_{asset_name}
-
layer
is the level of processing of the table. The options are:
- core indicates the table is tidy or well-normalized. Data in this layer is cleaned, well-structured and minimally modified.
- out tables are in a wide and complete format suitable for direct analysis. This means we've added back in a lot of the helpful human-readable labels for numeric identifiers and abbreviated codes. We recommend you start here!
- _core or _out means the table is intermediate or experimental and has not been fully cleaned and polished. We are probably publishing it because we want it out in the world so you can use it, but we're still vetting it and don't love its level of cleanliness yet.
- source is an abbreviation of the original source of the data. For example, eia860, ferc1 and epacems. You can browse our complete list of sources above.
-
asset_type
describes how the asset is modeled and how it is designed to be used. The
options are:
- assn provides connections between entities; for example, core_epa__assn_eia_epacamd connects EPA units with their associated EIA plants, boilers, and generators.
- changelog tracks changes in an entity's values over time, preserving the first reported instance when any of the tracked variables changed, and ignoring rows where data stays the same.
- codes expands the descriptions of categorical codes.
- entity contains static information about entities that don't change over time, like the state a plant is located in or the plant a boiler is a part of.
- scd describes attributes of entities that change only rarely, like the ownership or the capacity of a plant. It stands for "slowly-changing dimension".
- yearly|monthly|hourly indicates a timeseries, containing attributes that are expected to change for each reported timestamp.
- asset_name describes the entity, categorical code type, or measurement of the asset. Sometimes this also includes information about the original source, like how FERC Form 1 tables include the table's schedule number.
For more information on how we name tables and columns, see PUDL Naming Conventions.
PUDL data standards
There is a vast amount of U.S. energy data. It is reported in a variety of formats. Data is often reported in annual files. Data sources reporting about the same set of infrastructure - say power plants or generators - don't always include ways to link to other data sources. Sometimes the data originally reported is pretty clean while sometimes the data is pretty messy 🗑️🔥. We do our best to raise the floor on data quality while making decisions that a wide range of users will find acceptable.
We aim to publish data in consistent and familiar data formats, with long timeseries, using standardized names and codes, with reliable connections both within and between datasets, and at as granular a level as possible. We pull in and publish big data updates annually and quarterly, smaller ones monthly, and make our nightly builds available so that users have access to the freshest data that meet our standards.