Welcome! The PUDL data viewer is in beta right now. We filmed a short tutorial video, and would love any feedback you have for the site!
Welcome to PUDL!
The Public Utility Data Liberation (PUDL) project takes publicly available U.S. energy data and makes it publicly accessible. PUDL is a free and open source project managed and maintained by Catalyst Cooperative.

Find the energy data you need:

PUDL has over 300 tables from over a dozen different datasets. You can start by searching the PUDL Data Viewer for terms or data points you care about directly. We recommend working with tables with the out prefix, as these tables contain the most-complete data that is the easiest to work with.

If you're not sure what to search for, or just browsing, here are some ways to get started with different collections of PUDL tables:

Know the original source of the data you'd like to access? Select an organization below to see all sources we support for that organization, and find all tables from each source.

United States Census Bureau (1 source)

Census DP1 -- Profile of General Demographic Characteristics

A 2010 copy of the US Census Demographic Profile 1 (DP1) County and Tract GeoDatabase, including select demographic information at the state, county, and census tract level, along with the geometries of those geographic areas. It is a convenient source of state and county boundaries for use in mapping and visualizations, and we use county-level population to provide context for the FERC-714 hourly electricity demand. However, it should not be considered canonical information since it's from 2010 and more recent analogous data is available from the Census directly.

EIA - U.S. Energy Information Administration (7 sources)

EIA Form 176 -- Annual Report of Natural and Supplemental Gas Supply and Disposition

The EIA Form 176, also known as the Annual Report of Natural and Supplemental Gas Supply and Disposition, describes the origins, suppliers, and disposition of natural gas on a yearly and state by state basis.

EIA Form 860 -- Annual Electric Generator Report

US Energy Information Administration (EIA) Form 860 data for electric power plants with 1 megawatt or greater combined nameplate capacity.

EIA Form 860M -- Monthly Update to the Annual Electric Generator Report

US Energy Information Administration (EIA) Form 860 M data for electric power plants with 1 megawatt or greater combined nameplate capacity. Preliminary Monthly Electric Generator Inventory (based on Form EIA-860M as a supplement to Form EIA-860)

EIA Form 861 -- Annual Electric Power Industry Report

EIA Form 861 Annual Electric Power Industry Report, detailed data files.

EIA Form 923 -- Power Plant Operations Report

The EIA Form 923 collects detailed monthly and annual electric power data on electricity generation, fuel consumption, fossil fuel stocks, and receipts at the power plant and prime mover level.

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

The EIA Form 930 provides hourly demand and generation statistics by balancing area (or sub-balancing area in the case of some larger areas). These statistics include a breakdown by energy source (coal, gas, hydro, wind, solar, etc.) as well as interchange between the balancing areas, including international exchanges with Canada and Mexico.

EIA Annual Energy Outlook (AEO)

The EIA Annual Energy Outlook provides projections of future fuel prices, energy supply and consumption, and carbon dioxide emissions by sector and region.

EIA Bulk API Data

All data made available in bulk through the EIA Open Data API. At present, PUDL integrates only a few specific data series related to fuel receipts and costs figures from the Bulk Electricity API.

EPA - U.S. Environmental Protection Agency (2 sources)

EPA Hourly Continuous Emission Monitoring System (CEMS)

US EPA hourly Continuous Emissions Monitoring System (CEMS) data.Hourly CO2, SO2, NOx emissions and gross load.

EPA CAMD to EIA Power Sector Data Crosswalk

A file created collaboratively by EPA and EIA that connects EPA CAMD smokestacks (units) with corresponding EIA plant part ids reported in EIA Forms 860 and 923 (plant_id_eia, boiler_id, generator_id). This one-to-many connection is necessary because pollutants from various plant parts are collecitvely emitted and measured from one point-source.

FERC - Federal Energy Regulatory Commission (3 sources)

FERC Form 1 -- Annual Report of Major Electric Utilities

The Federal Energy Regulatory Commission (FERC) Form 1 is a comprehensive financial and operating report submitted annually for electric rate regulation, market oversight analysis, and financial audits by Major electric utilities, licensees and others.

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report

Electric transmitting utilities operating balancing authority areas and planning areas with annual peak demand over 200MW are required to file Form 714 with the Federal Energy Regulatory Commission (FERC), reporting balancing authority area generation, actual and scheduled inter-balancing authority area power transfers, and net energy for load, summer-winter generation peaks and system lambda.

FERC Form 920 -- Electric Quarterly Report (EQR)

The Electric Quarterly Report (EQR) is submitted by sellers participating in bilateral electricity market transactions. The reports summarize the contractual terms and conditions in agreements for all jurisdictional services, including cost-based sales, market-based rate sales, and transmission service, as well as transaction information for short-term and long-term market-based power sales and cost-based power sales.

GridLab (1 source)

GridPath Resource Adequacy Toolkit Data

Hourly renewable generation profiles compiled for the Western United States as part of the GridPath Resource Adequacy Toolkit. This data also contains some daily weather data from several sites across the western US and tables describing the way in which individual wind and solar projects were aggregated up to the level of balancing authority or transmission zone.

NREL/NLR - National Laboratory of the Rockies (1 source)

NREL Annual Technology Baseline (ATB) for Electricity

The NREL Annual Technology Baseline (ATB) for Electricity publishes annual projections of operational and capital expenditures (by technology and vintage), as well as operating characteristics (by technology).

PHMSA - Pipelines and Hazardous Materials Safety Administration (1 source)

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report

Annual reports submitted to PHMSA from gas distribution, gas gathering, gas transmission, liquefied natural gas, and underground gas storage system operators. Annual reports include information such as total pipeline mileage, facilities, commodities transported, miles by material, and installation dates.

SEC - U.S. Securities and Exchange Commission (1 source)

U.S. Securities and Exchange Commission (SEC) Form 10-K

The SEC Form 10-K is an annual report required by the U.S. Securities and Exchange Commission (SEC), that gives a comprehensive summary of a company's financial performance.

USDA RUS - Rural Utilities Service (2 sources)

USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution

The United States Department of Agriculture (USDA) Financial and Operating Report for Electric Distribution (formerly known as Form 7) is an annual report submitted by rural electric utilities that receive funding from the USDA Rural Utilities Service (RUS) for distribution services. This data was obtained through multiple Freedom of Information Access (FOIA) requests.

USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply

The United States Department of Agriculture (USDA) Financial and Operating Report for Electric Power Supply (formerly known as Form 12) is an annual report submitted by rural electric utilities that receive funding from the USDA Rural Utilities Service (RUS) for power supply (generation). This data was obtained through multiple Freedom of Information Access (FOIA) requests.

VCE - Vibrant Clean Energy (1 source)

Vibrant Clean Energy Resource Adequacy Renewable Energy (RARE) Power Dataset

This dataset was produced by Vibrant Clean Energy and is licensed to the public under the Creative Commons Attribution 4.0 International license (CC-BY-4.0). The data consists of hourly, county-level renewable generation profiles in the continental United States and was compiled based on outputs from the NOAA HRRR weather model. Profiles are stated as a capacity factor (a percentage of nameplate capacity) and exist for onshore wind, offshore wind, and fixed-tilt solar generation types.

We have curated a few clusters of tables for topics we know PUDL users care about.

Want to explore the most widely used PUDL tables?

Can't find what you are looking for?

Contact us at [email protected]!


All the ways to access PUDL data

Found what you are looking for but want a different way to access PUDL data?

Platform Format Version User Types Use Cases
PUDL Data Viewer Parquet, CSV nightly Data Explorer, Spreadsheet Analyst, Jupyter Notebook User You are here! Explore PUDL data interactively in a web browser, including hourly timeseries data. Select data to download as CSVs for local analysis in spreadsheets. Download full tables as Parquet files to play with programmatically.
Cloud Storage SQLite, Parquet nightly, stable Data Scientist, Analytics Engineer, Data Engineer, Cloud Developer Performant remote queries of clearly versioned PUDL Parquet outputs from cloud computing platforms or GitHub Actions. Fast bulk download of SQLite or Parquet outputs for local use. Parquet based data warehouse for large-scale data analysis in the cloud. Integrates well with Pandas, DuckDB, and other dataframe libraries.
Zenodo Archives SQLite, Parquet stable Researcher, Publisher, Archivist Access a specific, immutable version of the PUDL data by DOI for citation in academic publications or other applications where long-term reproducibility is needed. Web-based bulk download of data for local analysis.
Kaggle SQLite, Parquet nightly Data Scientist, Data Analyst, Jupyter Notebook User Work with PUDL data products in Jupyter Notebooks via the web with minimal setup. Explore curated and contributed analyses and visualizations using PUDL data. Create and share your own interactive notebooks using PUDL data.

Decoding table names

Intrepid PUDL spelunkers might notice we use a naming convention for our tables. Knowing a little about it might help you better find what you are looking for.

{layer}_{source}__{asset_type}_{asset_name}

  • layer is the level of processing of the table. The options are:
    • core indicates the table is tidy or well-normalized. Data in this layer is cleaned, well-structured and minimally modified.
    • out tables are in a wide and complete format suitable for direct analysis. This means we've added back in a lot of the helpful human-readable labels for numeric identifiers and abbreviated codes. We recommend you start here!
    • _core or _out means the table is intermediate or experimental and has not been fully cleaned and polished. We are probably publishing it because we want it out in the world so you can use it, but we're still vetting it and don't love its level of cleanliness yet.
  • source is an abbreviation of the original source of the data. For example, eia860, ferc1 and epacems. You can browse our complete list of sources above.
  • asset_type describes how the asset is modeled and how it is designed to be used. The options are:
    • assn provides connections between entities; for example, core_epa__assn_eia_epacamd connects EPA units with their associated EIA plants, boilers, and generators.
    • changelog tracks changes in an entity's values over time, preserving the first reported instance when any of the tracked variables changed, and ignoring rows where data stays the same.
    • codes expands the descriptions of categorical codes.
    • entity contains static information about entities that don't change over time, like the state a plant is located in or the plant a boiler is a part of.
    • scd describes attributes of entities that change only rarely, like the ownership or the capacity of a plant. It stands for "slowly-changing dimension".
    • yearly|monthly|hourly indicates a timeseries, containing attributes that are expected to change for each reported timestamp.
  • asset_name describes the entity, categorical code type, or measurement of the asset. Sometimes this also includes information about the original source, like how FERC Form 1 tables include the table's schedule number.

For more information on how we name tables and columns, see PUDL Naming Conventions.


PUDL data standards

There is a vast amount of U.S. energy data. It is reported in a variety of formats. Data is often reported in annual files. Data sources reporting about the same set of infrastructure - say power plants or generators - don't always include ways to link to other data sources. Sometimes the data originally reported is pretty clean while sometimes the data is pretty messy 🗑️🔥. We do our best to raise the floor on data quality while making decisions that a wide range of users will find acceptable.

We aim to publish data in consistent and familiar data formats, with long timeseries, using standardized names and codes, with reliable connections both within and between datasets, and at as granular a level as possible. We pull in and publish big data updates annually and quarterly, smaller ones monthly, and make our nightly builds available so that users have access to the freshest data that meet our standards.