out_eia__monthly_generators
Monthly time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
report_date, plant_id_eia, generator_id
Usage Warnings
Contains estimated values.
Due to coverage problems in other tables, some generators have heat rate estimates but not fuel cost estimates.
Not all columns are originally reported in or calculable from the input tables. Expect nulls.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This table includes all monthly attributes for all generators reported to EIA-860 and EIA-923.
To provide a complete picture of generator data, this table compiles data from many different EIA tables, including directly reported, imputed and calculated columns. We suggest using this table if you want to explore monthly attributes about generators and would rather use a more complete and denormalized table. If you are more interested in the originally reported values, we recommend searching for core_eia with the column you are most interested in.
The calculations of capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh are based on the allocation of net generation reported on the basis of plant, prime mover and energy source to individual generators - which comes from this table: out_eia923__monthly_generation_fuel_by_generator.
Heat rates by generator-monthly are estimated by using allocated estimates for per-generator net generation and fuel consumption as well as the core_eia923__monthly_boiler_fuel table, which reports fuel consumed by boiler. Heat rates are necessary to estimate the amount of fuel consumed by a generation unit, and thus the fuel cost per MWh generated.
Plant specific fuel prices are taken from the core_eia923__monthly_fuel_receipts_costs table, which only has ~70% coverage, leading to some generators with heat rate estimates still lacking fuel cost estimates.
This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Date reported.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
High level description of the technology used by the generator to produce electricity.
The code representing the most predominant type of energy that fuels the generator.
Code for the type of prime mover (e.g. CT, CG)
Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.
Date of the scheduled or effected retirement of the generator.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
Total installed (nameplate) capacity, in megawatts.
Simplified fuel type code used in PUDL
Planned effective date of the scheduled retirement of the generator.
Fraction of potential generation that was actually reported for a plant part.
Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).
Average fuel cost per mmBTU of heat content in nominal USD.
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Fuel content per unit of electricity generated. Coming from MCOE calculation.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
Indicates whether the generator is associated with a combined heat and power system
The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).
Can this generator operate while bypassing the heat recovery steam generator?
Indicates whether the generator uses carbon capture technology.
Name of the city.
Whether the generator can co-fire fuels.
County name.
The most recently updated effective date on which the generator is scheduled to start operation
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
Indicate whether the generator can deliver power to the transmission grid.
Whether the generator is considered distributed generation
Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas
Primary mode of transport for energy source 1.
Secondary mode of transport for energy source 1.
Tertiary mode of transport for energy source 1.
Primary mode of transport for energy source 2.
Secondary mode of transport for energy source 2.
Tertiary mode of transport for energy source 2.
The code representing the second most predominant type of energy that fuels the generator
The code representing the third most predominant type of energy that fuels the generator
The code representing the fourth most predominant type of energy that fuels the generator
The code representing the fifth most predominant type of energy that fuels the generator
The code representing the sixth most predominant type of energy that fuels the generator
Energy storage capacity in MWh (e.g. for batteries).
Indicates whether or not a generator is a qualifying FERC cogeneration facility.
Indicates whether the generator uses fluidized bed technology
A count of how many different simple energy sources there are associated with a generator.
Latitude of the plant's location, in degrees.
Longitude of the plant's location, in degrees.
The minimum load at which the generator can operate at continuosuly.
Whether the generator can burn multiple fuels.
The nameplate power factor of the generator.
Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.
Indicates whether a fuel switching generator can switch fuels while operating.
The operating status of the asset.
The date the generator was originally scheduled to be operational
Indicates whether the generator uses other combustion technologies
Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.
Indicates whether there are there other modifications planned for the generator.
Whether any part of generator is owned by a nonutilty
Identifies the ownership for each generator.
Planned effective month that the generator is scheduled to enter operation after the derate modification.
New energy source code for the planned repowered generator.
Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.
Decrease in summer capacity expected to be realized from the derate modification to the equipment.
Increase in summer capacity expected to be realized from the modification to the equipment.
Decrease in winter capacity expected to be realized from the derate modification to the equipment.
Increase in winter capacity expected to be realized from the uprate modification to the equipment.
The expected new namplate capacity for the generator.
New prime mover for the planned repowered generator.
Planned effective date that the generator is scheduled to enter operation after the repowering is complete.
Planned effective date that the generator is scheduled to enter operation after the uprate modification.
Indicates whether the generator was previously reported as indefinitely postponed or canceled
Indicates whether the generator uses pulverized coal technology
Reactive Power Output (MVAr)
The designation used to identify the price node in RTO/ISO Locational Marginal Price reports
The designation used to report the specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report
Indicates whether the generator is part of a solid fuel gasification system
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
Two letter US state abbreviation.
Indicates whether the generator uses stoker technology
Physical street address.
Indicates whether the generator uses subcritical technology
Whether the summer capacity value was an estimate
The net summer capacity.
EIA estimated summer capacity (in MWh).
Indicates whether the generator uses supercritical technology
Whether the generator can switch between oil and natural gas.
Indicates whether standby generators (SB status) can be synchronized to the grid.
The minimum amount of time required to bring the unit to full load from shutdown.
IANA timezone name
If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle
Number of wind turbines, or hydrokinetic buoys.
Number of wind turbines, or hydrokinetic buoys.
Indicates whether the generator uses ultra-supercritical technology
The date when the uprate or derate was completed.
Was an uprate or derate completed on this generator during the reporting year?
Whether the winter capacity value was an estimate
The net winter capacity.
EIA estimated winter capacity (in MWh).
Five digit US Zip Code.
out_eia923__fuel_receipts_costs
fuel deliveries to power plants.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)
- Primary key:
This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.
Usage Warnings
Date column arbitrarily uses the first of the month.
Some values have been redacted.
Time of fuel deliveries is not necessarily connected with time of fuel consumption.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.
Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.
Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Two letter US state abbreviation.
Purchase type under which receipts occurred in the reporting month. C: Contract, NC: New Contract, S: Spot Purchase, T: Tolling Agreement.
Date contract expires.Format: MMYY.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Simplified fuel type code used in PUDL
Fuel groups used in the Electric Power Monthly
Company that sold the fuel to the plant or, in the case of Natural Gas, pipeline owner.
Quantity of fuel received in tons, barrel, or Mcf.
Heat content of the fuel in millions of Btus per physical unit.
Average fuel cost per mmBTU of heat content in nominal USD.
Fuel cost per mmbtu reported in the EIA bulk electricity data. This is an aggregate average fuel price for a whole state, region, month, sector, etc. Used to fill in missing fuel prices.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Mercury content in parts per million (ppm) to the nearest 0.001 ppm.
Transportation mode for the longest distance transported.
Transportation mode for the second longest distance transported.
Contract type for natural gas transportation service.
Contract type for natural gas delivery service:
For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.
For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
MSHA issued mine identifier.
Coal mine name.
State where the coal mine is located. Two letter abbreviation.
County ID from the Federal Information Processing Standard Publication 6-4. This is the county where the coal mine is located.
Type of coal mine.
out_ferc1__yearly_all_plants
Annual time series of plants reported in the steam, small generators, hydro, and pumped storage tables.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 402, 404, 406 and 408)
- Primary key:
record_id
Usage Warnings
Not all columns are originally reported in all of the input plant tables. Expect nulls.
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is a concatenation of the following plant tables:
core_ferc1__yearly_steam_plants_sched402
core_ferc1__yearly_hydroelectric_plants_sched406
core_ferc1__yearly_small_plants_sched410
core_ferc1__yearly_pumped_storage_plants_sched408
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
A manually assigned PUDL plant ID. May not be constant over time.
Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Asset retirement cost (USD).
The average number of employees assigned to each plant.
Fraction of potential generation that was actually reported for a plant part.
Total installed (nameplate) capacity, in megawatts.
Annual capital addition into capex_total.
Year-to-date capital addition into capex_total.
Annual capital addition into capex_total per kw.
Annual capital addition into capex_total per MW.
Year-to-date capital addition into capex_total per MW.
Annual capital addition into capex_total per MWh.
Year-to-date capital addition into capex_total per MWh.
Cost of plant: equipment (USD).
Cost of plant: land and land rights (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Cost of plant: structures and improvements (USD).
Total cost of plant (USD).
Total cost of plant (USD) without retirements.
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Net electricity generation for the specified period in megawatt-hours (MWh).
Plant capacity in MW when not limited by condenser water.
Allowances.
Maintenance of boiler (or reactor) plant.
Cost of coolants and water (nuclear plants only)
Production expenses: electric expenses (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: fuel (USD).
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Miscellaneous steam (or nuclear) expenses.
Maintenance of miscellaneous steam (or nuclear) plant.
Investments in non-fuel production expenses per Mwh.
Production expenses: operations, supervision, and engineering (USD).
Total production expenses (USD per MWh generated).
Production expenses: maintenance of electric plant (USD).
Total operating expenses.
Production expenses: rents (USD).
Steam expenses.
Steam from other sources.
Production expenses: maintenance of structures (USD).
Total production expenses, excluding fuel (USD).
Steam transferred (Credit).
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Net plant capability in megawatts.
Hours the plant was connected to load while generating in the report year.
Type of plant.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Plant capacity in MW when limited by condenser water.
Average fuel cost per mmBTU of heat content in nominal USD.
Type of fuel.
FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.
Production expenses: Maintenance (USD).
Total production expenses, excluding fuel (USD).
Cost of plant: reservoirs, dams, and waterways (USD).
Cost of plant: roads, railroads, and bridges (USD).
Net plant capability under the least favorable operating conditions, in megawatts.
Net plant capability under the most favorable operating conditions, in megawatts.
Production expenses: maintenance of reservoirs, dams, and waterways (USD).
Production expenses: miscellaneous power generation expenses (USD).
Production expenses: hydraulic expenses (USD).
Production expenses: maintenance of miscellaneous hydraulic plant (USD).
Production expenses: water for power (USD).
The FERC license ID of a project.
Cost of plant: accessory electric equipment (USD).
Cost of plant: miscellaneous power plant equipment (USD).
Cost of plant: water wheels, turbines, and generators (USD).
Energy used for pumping, in megawatt-hours.
Net output for load (net generation - energy used for pumping) in megawatt-hours.
Total production expenses before pumping (USD).
Production expenses: pumped storage (USD).
Production expenses: We are here to PUMP YOU UP! (USD).
out_eia__yearly_generators
Annual time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
report_date, plant_id_eia, generator_id
Usage Warnings
Contains estimated values.
Due to coverage problems in other tables, some generators have heat rate estimates but not fuel cost estimates.
Not all columns are originally reported in or calculable from the input tables. Expect nulls.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This table includes all yearly attributes for all generators reported to EIA-860 and EIA-923.
To provide a complete picture of generator data, this table compiles data from many different EIA tables, including directly reported, imputed and calculated columns. We suggest using this table if you want to explore yearly attributes about generators and would rather use a more complete and denormalized table. If you are more interested in the originally reported values, we recommend searching for core_eia with the column you are most interested in.
The calculations of capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh are based on the allocation of net generation reported on the basis of plant, prime mover and energy source to individual generators - which comes from this table: out_eia923__yearly_generation_fuel_by_generator.
Heat rates by generator-yearly are estimated by using allocated estimates for per-generator net generation and fuel consumption as well as the core_eia923__monthly_boiler_fuel table, which reports fuel consumed by boiler. Heat rates are necessary to estimate the amount of fuel consumed by a generation unit, and thus the fuel cost per MWh generated.
Plant specific fuel prices are taken from the core_eia923__monthly_fuel_receipts_costs table, which only has ~70% coverage, leading to some generators with heat rate estimates still lacking fuel cost estimates.
This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Date reported.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
High level description of the technology used by the generator to produce electricity.
The code representing the most predominant type of energy that fuels the generator.
Code for the type of prime mover (e.g. CT, CG)
Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.
Date of the scheduled or effected retirement of the generator.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
Total installed (nameplate) capacity, in megawatts.
Simplified fuel type code used in PUDL
Planned effective date of the scheduled retirement of the generator.
Fraction of potential generation that was actually reported for a plant part.
Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).
Average fuel cost per mmBTU of heat content in nominal USD.
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Fuel content per unit of electricity generated. Coming from MCOE calculation.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
Indicates whether the generator is associated with a combined heat and power system
The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).
Can this generator operate while bypassing the heat recovery steam generator?
Indicates whether the generator uses carbon capture technology.
Name of the city.
Whether the generator can co-fire fuels.
County name.
The most recently updated effective date on which the generator is scheduled to start operation
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
Indicate whether the generator can deliver power to the transmission grid.
Whether the generator is considered distributed generation
Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas
Primary mode of transport for energy source 1.
Secondary mode of transport for energy source 1.
Tertiary mode of transport for energy source 1.
Primary mode of transport for energy source 2.
Secondary mode of transport for energy source 2.
Tertiary mode of transport for energy source 2.
The code representing the second most predominant type of energy that fuels the generator
The code representing the third most predominant type of energy that fuels the generator
The code representing the fourth most predominant type of energy that fuels the generator
The code representing the fifth most predominant type of energy that fuels the generator
The code representing the sixth most predominant type of energy that fuels the generator
Energy storage capacity in MWh (e.g. for batteries).
Indicates whether or not a generator is a qualifying FERC cogeneration facility.
Indicates whether the generator uses fluidized bed technology
A count of how many different simple energy sources there are associated with a generator.
Latitude of the plant's location, in degrees.
Longitude of the plant's location, in degrees.
The minimum load at which the generator can operate at continuosuly.
Whether the generator can burn multiple fuels.
The nameplate power factor of the generator.
Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.
Indicates whether a fuel switching generator can switch fuels while operating.
The operating status of the asset.
The date the generator was originally scheduled to be operational
Indicates whether the generator uses other combustion technologies
Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.
Indicates whether there are there other modifications planned for the generator.
Whether any part of generator is owned by a nonutilty
Identifies the ownership for each generator.
Planned effective month that the generator is scheduled to enter operation after the derate modification.
New energy source code for the planned repowered generator.
Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.
Decrease in summer capacity expected to be realized from the derate modification to the equipment.
Increase in summer capacity expected to be realized from the modification to the equipment.
Decrease in winter capacity expected to be realized from the derate modification to the equipment.
Increase in winter capacity expected to be realized from the uprate modification to the equipment.
The expected new namplate capacity for the generator.
New prime mover for the planned repowered generator.
Planned effective date that the generator is scheduled to enter operation after the repowering is complete.
Planned effective date that the generator is scheduled to enter operation after the uprate modification.
Indicates whether the generator was previously reported as indefinitely postponed or canceled
Indicates whether the generator uses pulverized coal technology
Reactive Power Output (MVAr)
The designation used to identify the price node in RTO/ISO Locational Marginal Price reports
The designation used to report the specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report
Indicates whether the generator is part of a solid fuel gasification system
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
Two letter US state abbreviation.
Indicates whether the generator uses stoker technology
Physical street address.
Indicates whether the generator uses subcritical technology
Whether the summer capacity value was an estimate
The net summer capacity.
EIA estimated summer capacity (in MWh).
Indicates whether the generator uses supercritical technology
Whether the generator can switch between oil and natural gas.
Indicates whether standby generators (SB status) can be synchronized to the grid.
The minimum amount of time required to bring the unit to full load from shutdown.
IANA timezone name
If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle
Number of wind turbines, or hydrokinetic buoys.
Number of wind turbines, or hydrokinetic buoys.
Indicates whether the generator uses ultra-supercritical technology
The date when the uprate or derate was completed.
Was an uprate or derate completed on this generator during the reporting year?
Whether the winter capacity value was an estimate
The net winter capacity.
EIA estimated winter capacity (in MWh).
Five digit US Zip Code.
out_censusdp1tract__counties
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
Census DP1 -- Profile of General Demographic Characteristics
- Primary key:
county_id_fips
Additional Details
County-level summarized US Census demographic information, along with a geometry describing county boundaries. Based on the 2010 Census. Used as a convenient source of geospatial information within PUDL, but should not be treated as authoritative.
Columns
County ID from the Federal Information Processing Standard Publication 6-4.
Geospatial representation of the feature.
County name.
The functional status (FUNCSTAT) code defines the current functional status of a geographic entity. These codes can be found in the TIGER/Line products, gazetteer files, and other products.
Land area in square meters.
Water area in square meters.
Internal point latitude in decimal degrees.
Internal point longitude in decimal degrees.
Length of the feature's perimeter in degrees.
Area of the feature in square degrees.
DPSF1. Sex and age - Total population
DPSF1. Sex and age - Total population under 5 years
DPSF1. Sex and age - Total population 5 to 9 years
DPSF1. Sex and age - Total population 10 to 14 years
DPSF1. Sex and age - Total population 15 to 19 years
DPSF1. Sex and age - Total population 20 to 24 years
DPSF1. Sex and age - Total population 25 to 29 years
DPSF1. Sex and age - Total population 30 to 34 years
DPSF1. Sex and age - Total population 35 to 39 years
DPSF1. Sex and age - Total population 40 to 44 years
DPSF1. Sex and age - Total population 45 to 49 years
DPSF1. Sex and age - Total population 50 to 54 years
DPSF1. Sex and age - Total population 55 to 59 years
DPSF1. Sex and age - Total population 60 to 64 years
DPSF1. Sex and age - Total population 65 to 69 years
DPSF1. Sex and age - Total population 70 to 74 years
DPSF1. Sex and age - Total population 75 to 79 years
DPSF1. Sex and age - Total population 80 to 84 years
DPSF1. Sex and age - Total population 85 years and over
DPSF1. Sex and age - Male population total
DPSF1. Sex and age - Male population under 5 years
DPSF1. Sex and age - Male population 5 to 9 years
DPSF1. Sex and age - Male population 10 to 14 years
DPSF1. Sex and age - Male population 15 to 19 years
DPSF1. Sex and age - Male population 20 to 24 years
DPSF1. Sex and age - Male population 25 to 29 years
DPSF1. Sex and age - Male population 30 to 34 years
DPSF1. Sex and age - Male population 35 to 39 years
DPSF1. Sex and age - Male population 40 to 44 years
DPSF1. Sex and age - Male population 45 to 49 years
DPSF1. Sex and age - Male population 50 to 54 years
DPSF1. Sex and age - Male population 55 to 59 years
DPSF1. Sex and age - Male population 60 to 64 years
DPSF1. Sex and age - Male population 65 to 69 years
DPSF1. Sex and age - Male population 70 to 74 years
DPSF1. Sex and age - Male population 75 to 79 years
DPSF1. Sex and age - Male population 80 to 84 years
DPSF1. Sex and age - Male population 85 years and over
DPSF1. Sex and age - Female population total
DPSF1. Sex and age - Female population under 5 years
DPSF1. Sex and age - Female population 5 to 9 years
DPSF1. Sex and age - Female population 10 to 14 years
DPSF1. Sex and age - Female population 15 to 19 years
DPSF1. Sex and age - Female population 20 to 24 years
DPSF1. Sex and age - Female population 25 to 29 years
DPSF1. Sex and age - Female population 30 to 34 years
DPSF1. Sex and age - Female population 35 to 39 years
DPSF1. Sex and age - Female population 40 to 44 years
DPSF1. Sex and age - Female population 45 to 49 years
DPSF1. Sex and age - Female population 50 to 54 years
DPSF1. Sex and age - Female population 55 to 59 years
DPSF1. Sex and age - Female population 60 to 64 years
DPSF1. Sex and age - Female population 65 to 69 years
DPSF1. Sex and age - Female population 70 to 74 years
DPSF1. Sex and age - Female population 75 to 79 years
DPSF1. Sex and age - Female population 80 to 84 years
DPSF1. Sex and age - Female population 85 years and over
DPSF2. Median age by sex - Both sexes
DPSF2. Median age by sex - Male
DPSF2. Median age by sex - Female
DPSF3. Sex for population 16 years and over - Total
DPSF3. Sex for population 16 years and over - Male
DPSF3. Sex for population 16 years and over - Female
DPSF4. Sex for population 18 years and over - Total
DPSF4. Sex for population 18 years and over - Male
DPSF4. Sex for population 18 years and over - Female
DPSF5. Sex for population 21 years and over - Total
DPSF5. Sex for population 21 years and over - Male
DPSF5. Sex for population 21 years and over - Female
DPSF6. Sex for population 62 years and over - Total
DPSF6. Sex for population 62 years and over - Male
DPSF6. Sex for population 62 years and over - Female
DPSF7. Sex for population 65 years and over - Total
DPSF7. Sex for population 65 years and over - Male
DPSF7. Sex for population 65 years and over - Female
DPSF8. Race - Total population
DPSF8. Race - Population of one race
DPSF8. Race - White
DPSF8. Race - Black or African American
DPSF8. Race - American Indian and Alaska Native
DPSF8. Race - Asian total
DPSF8. Race - Asian Indian
DPSF8. Race - Chinese
DPSF8. Race - Filipino
DPSF8. Race - Japanese
DPSF8. Race - Korean
DPSF8. Race - Vietnamese
DPSF8. Race - Other Asian
DPSF8. Race - Native Hawaiian and Other Pacific Islander total
DPSF8. Race - Native Hawaiian
DPSF8. Race - Guamanian or Chamorro
DPSF8. Race - Samoan
DPSF8. Race - Other Pacific Islander
DPSF8. Race - Some Other Race
DPSF8. Race - Population of Two or More Races
DPSF8. Race - White; American Indian and Alaska Native
DPSF8. Race - White; Asian
DPSF8. Race - White; Black or African American
DPSF8. Race - White; Some Other Race
DPSF9. Race (total races tallied) - White alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Black or African American alone or in combination with one or more other races
DPSF9. Race (total races tallied) - American Indian and Alaska Native alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Asian alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Native Hawaiian and Other Pacific Islander alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Some Other Race alone or in combination with one or more other races
DPSF10. Hispanic or Latino by specific origin - Total population
DPSF10. Hispanic or Latino by specific origin - Hispanic or Latino (of any race)
DPSF10. Hispanic or Latino by specific origin - Mexican
DPSF10. Hispanic or Latino by specific origin - Puerto Rican
DPSF10. Hispanic or Latino by specific origin - Cuban
DPSF10. Hispanic or Latino by specific origin - Other Hispanic or Latino
DPSF10. Hispanic or Latino by specific origin - Not Hispanic or Latino
DPSF11. Hispanic or Latino and race - Total population
DPSF11. Hispanic or Latino and race - Hispanic or Latino total
DPSF11. Hispanic or Latino and race - Hispanic or Latino: White alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Black or African American alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: American Indian and Alaska Native alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Asian alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Some Other Race alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Two or More Races
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino total
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: White alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Black or African American alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: American Indian and Alaska Native alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Asian alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Some Other Race alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Two or More Races
DPSF12. Relationship - Total population
DPSF12. Relationship - In households
DPSF12. Relationship - Householder
DPSF12. Relationship - Spouse
DPSF12. Relationship - Child
DPSF12. Relationship - Own child under 18 years
DPSF12. Relationship - Other relatives
DPSF12. Relationship - Other relatives under 18 years
DPSF12. Relationship - Other relatives 65 years and over
DPSF12. Relationship - Nonrelatives
DPSF12. Relationship - Nonrelatives under 18 years
DPSF12. Relationship - Nonrelatives 65 years and over
DPSF12. Relationship - Unmarried partner
DPSF12. Relationship - In group quarters
DPSF12. Relationship - Institutionalized population
DPSF12. Relationship - Institutionalized population: Male
DPSF12. Relationship - Institutionalized population: Female
DPSF12. Relationship - Noninstitutionalized population
DPSF12. Relationship - Noninstitutionalized population: Male
DPSF12. Relationship - Noninstitutionalized population: Female
DPSF13. Households by type - Total households
DPSF13. Households by type - Family households (families)
DPSF13. Households by type - Family households with own children under 18 years
DPSF13. Households by type - Husband-wife family
DPSF13. Households by type - Husband-wife family with own children under 18 years
DPSF13. Households by type - Male householder, no wife present
DPSF13. Households by type - Male householder, no wife present, with own children under 18 years
DPSF13. Households by type - Female householder, no husband present
DPSF13. Households by type - Female householder, no husband present, with own children under 18 years
DPSF13. Households by type - Nonfamily households
DPSF13. Households by type - Householder living alone
DPSF13. Households by type - Householder living alone: Male
DPSF13. Households by type - Householder living alone: Male 65 years and over
DPSF13. Households by type - Householder living alone: Female
DPSF13. Households by type - Householder living alone: Female 65 years and over
DPSF14. Households with individuals under 18 years - Total
DPSF15. Households with individuals 65 years and over - Total
DPSF16. Average household size - Average household size
DPSF17. Average family size - Average family size
DPSF18. Housing occupancy - Total housing units
DPSF18. Housing occupancy - Occupied housing units
DPSF18. Housing occupancy - Vacant housing units
DPSF18. Housing occupancy - Vacant housing units for rent
DPSF18. Housing occupancy - Vacant housing units rented, not occupied
DPSF18. Housing occupancy - Vacant housing units for sale only
DPSF18. Housing occupancy - Vacant housing units sold, not occupied
DPSF18. Housing occupancy - Vacant housing units for seasonal, recreational, or occasional use
DPSF18. Housing occupancy - All other vacant housing units
DPSF19. Homeowner vacancy rate - Homeowner vacancy rate (percent)
DPSF20. Rental vacancy rate - Rental vacancy rate (percent)
DPSF21. Housing tenure - Total occupied housing units
DPSF21. Housing tenure - Owner-occupied housing units
DPSF21. Housing tenure - Renter-occupied housing units
DPSF22. Population in occupied housing units by tenure - Owner-occupied housing units
DPSF22. Population in occupied housing units by tenure - Renter-occupied housing units
DPSF23. Average household size by tenure - Owner occupied
DPSF23. Average household size by tenure - Renter occupied
out_censusdp1tract__states
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
Census DP1 -- Profile of General Demographic Characteristics
- Primary key:
state_id_fips
Additional Details
State-level summarized US Census demographic information, along with a geometry describing state boundaries. Based on the 2010 Census. Used as a convenient source of geospatial information within PUDL, but should not be treated as authoritative.
Columns
Two digit state FIPS code.
Geospatial representation of the feature.
Full name of the state.
Two letter US state abbreviation.
Land area in square meters.
Water area in square meters.
Internal point latitude in decimal degrees.
Internal point longitude in decimal degrees.
Length of the feature's perimeter in degrees.
Area of the feature in square degrees.
DPSF1. Sex and age - Total population
DPSF1. Sex and age - Total population under 5 years
DPSF1. Sex and age - Total population 5 to 9 years
DPSF1. Sex and age - Total population 10 to 14 years
DPSF1. Sex and age - Total population 15 to 19 years
DPSF1. Sex and age - Total population 20 to 24 years
DPSF1. Sex and age - Total population 25 to 29 years
DPSF1. Sex and age - Total population 30 to 34 years
DPSF1. Sex and age - Total population 35 to 39 years
DPSF1. Sex and age - Total population 40 to 44 years
DPSF1. Sex and age - Total population 45 to 49 years
DPSF1. Sex and age - Total population 50 to 54 years
DPSF1. Sex and age - Total population 55 to 59 years
DPSF1. Sex and age - Total population 60 to 64 years
DPSF1. Sex and age - Total population 65 to 69 years
DPSF1. Sex and age - Total population 70 to 74 years
DPSF1. Sex and age - Total population 75 to 79 years
DPSF1. Sex and age - Total population 80 to 84 years
DPSF1. Sex and age - Total population 85 years and over
DPSF1. Sex and age - Male population total
DPSF1. Sex and age - Male population under 5 years
DPSF1. Sex and age - Male population 5 to 9 years
DPSF1. Sex and age - Male population 10 to 14 years
DPSF1. Sex and age - Male population 15 to 19 years
DPSF1. Sex and age - Male population 20 to 24 years
DPSF1. Sex and age - Male population 25 to 29 years
DPSF1. Sex and age - Male population 30 to 34 years
DPSF1. Sex and age - Male population 35 to 39 years
DPSF1. Sex and age - Male population 40 to 44 years
DPSF1. Sex and age - Male population 45 to 49 years
DPSF1. Sex and age - Male population 50 to 54 years
DPSF1. Sex and age - Male population 55 to 59 years
DPSF1. Sex and age - Male population 60 to 64 years
DPSF1. Sex and age - Male population 65 to 69 years
DPSF1. Sex and age - Male population 70 to 74 years
DPSF1. Sex and age - Male population 75 to 79 years
DPSF1. Sex and age - Male population 80 to 84 years
DPSF1. Sex and age - Male population 85 years and over
DPSF1. Sex and age - Female population total
DPSF1. Sex and age - Female population under 5 years
DPSF1. Sex and age - Female population 5 to 9 years
DPSF1. Sex and age - Female population 10 to 14 years
DPSF1. Sex and age - Female population 15 to 19 years
DPSF1. Sex and age - Female population 20 to 24 years
DPSF1. Sex and age - Female population 25 to 29 years
DPSF1. Sex and age - Female population 30 to 34 years
DPSF1. Sex and age - Female population 35 to 39 years
DPSF1. Sex and age - Female population 40 to 44 years
DPSF1. Sex and age - Female population 45 to 49 years
DPSF1. Sex and age - Female population 50 to 54 years
DPSF1. Sex and age - Female population 55 to 59 years
DPSF1. Sex and age - Female population 60 to 64 years
DPSF1. Sex and age - Female population 65 to 69 years
DPSF1. Sex and age - Female population 70 to 74 years
DPSF1. Sex and age - Female population 75 to 79 years
DPSF1. Sex and age - Female population 80 to 84 years
DPSF1. Sex and age - Female population 85 years and over
DPSF2. Median age by sex - Both sexes
DPSF2. Median age by sex - Male
DPSF2. Median age by sex - Female
DPSF3. Sex for population 16 years and over - Total
DPSF3. Sex for population 16 years and over - Male
DPSF3. Sex for population 16 years and over - Female
DPSF4. Sex for population 18 years and over - Total
DPSF4. Sex for population 18 years and over - Male
DPSF4. Sex for population 18 years and over - Female
DPSF5. Sex for population 21 years and over - Total
DPSF5. Sex for population 21 years and over - Male
DPSF5. Sex for population 21 years and over - Female
DPSF6. Sex for population 62 years and over - Total
DPSF6. Sex for population 62 years and over - Male
DPSF6. Sex for population 62 years and over - Female
DPSF7. Sex for population 65 years and over - Total
DPSF7. Sex for population 65 years and over - Male
DPSF7. Sex for population 65 years and over - Female
DPSF8. Race - Total population
DPSF8. Race - Population of one race
DPSF8. Race - White
DPSF8. Race - Black or African American
DPSF8. Race - American Indian and Alaska Native
DPSF8. Race - Asian total
DPSF8. Race - Asian Indian
DPSF8. Race - Chinese
DPSF8. Race - Filipino
DPSF8. Race - Japanese
DPSF8. Race - Korean
DPSF8. Race - Vietnamese
DPSF8. Race - Other Asian
DPSF8. Race - Native Hawaiian and Other Pacific Islander total
DPSF8. Race - Native Hawaiian
DPSF8. Race - Guamanian or Chamorro
DPSF8. Race - Samoan
DPSF8. Race - Other Pacific Islander
DPSF8. Race - Some Other Race
DPSF8. Race - Population of Two or More Races
DPSF8. Race - White; American Indian and Alaska Native
DPSF8. Race - White; Asian
DPSF8. Race - White; Black or African American
DPSF8. Race - White; Some Other Race
DPSF9. Race (total races tallied) - White alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Black or African American alone or in combination with one or more other races
DPSF9. Race (total races tallied) - American Indian and Alaska Native alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Asian alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Native Hawaiian and Other Pacific Islander alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Some Other Race alone or in combination with one or more other races
DPSF10. Hispanic or Latino by specific origin - Total population
DPSF10. Hispanic or Latino by specific origin - Hispanic or Latino (of any race)
DPSF10. Hispanic or Latino by specific origin - Mexican
DPSF10. Hispanic or Latino by specific origin - Puerto Rican
DPSF10. Hispanic or Latino by specific origin - Cuban
DPSF10. Hispanic or Latino by specific origin - Other Hispanic or Latino
DPSF10. Hispanic or Latino by specific origin - Not Hispanic or Latino
DPSF11. Hispanic or Latino and race - Total population
DPSF11. Hispanic or Latino and race - Hispanic or Latino total
DPSF11. Hispanic or Latino and race - Hispanic or Latino: White alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Black or African American alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: American Indian and Alaska Native alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Asian alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Some Other Race alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Two or More Races
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino total
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: White alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Black or African American alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: American Indian and Alaska Native alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Asian alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Some Other Race alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Two or More Races
DPSF12. Relationship - Total population
DPSF12. Relationship - In households
DPSF12. Relationship - Householder
DPSF12. Relationship - Spouse
DPSF12. Relationship - Child
DPSF12. Relationship - Own child under 18 years
DPSF12. Relationship - Other relatives
DPSF12. Relationship - Other relatives under 18 years
DPSF12. Relationship - Other relatives 65 years and over
DPSF12. Relationship - Nonrelatives
DPSF12. Relationship - Nonrelatives under 18 years
DPSF12. Relationship - Nonrelatives 65 years and over
DPSF12. Relationship - Unmarried partner
DPSF12. Relationship - In group quarters
DPSF12. Relationship - Institutionalized population
DPSF12. Relationship - Institutionalized population: Male
DPSF12. Relationship - Institutionalized population: Female
DPSF12. Relationship - Noninstitutionalized population
DPSF12. Relationship - Noninstitutionalized population: Male
DPSF12. Relationship - Noninstitutionalized population: Female
DPSF13. Households by type - Total households
DPSF13. Households by type - Family households (families)
DPSF13. Households by type - Family households with own children under 18 years
DPSF13. Households by type - Husband-wife family
DPSF13. Households by type - Husband-wife family with own children under 18 years
DPSF13. Households by type - Male householder, no wife present
DPSF13. Households by type - Male householder, no wife present, with own children under 18 years
DPSF13. Households by type - Female householder, no husband present
DPSF13. Households by type - Female householder, no husband present, with own children under 18 years
DPSF13. Households by type - Nonfamily households
DPSF13. Households by type - Householder living alone
DPSF13. Households by type - Householder living alone: Male
DPSF13. Households by type - Householder living alone: Male 65 years and over
DPSF13. Households by type - Householder living alone: Female
DPSF13. Households by type - Householder living alone: Female 65 years and over
DPSF14. Households with individuals under 18 years - Total
DPSF15. Households with individuals 65 years and over - Total
DPSF16. Average household size - Average household size
DPSF17. Average family size - Average family size
DPSF18. Housing occupancy - Total housing units
DPSF18. Housing occupancy - Occupied housing units
DPSF18. Housing occupancy - Vacant housing units
DPSF18. Housing occupancy - Vacant housing units for rent
DPSF18. Housing occupancy - Vacant housing units rented, not occupied
DPSF18. Housing occupancy - Vacant housing units for sale only
DPSF18. Housing occupancy - Vacant housing units sold, not occupied
DPSF18. Housing occupancy - Vacant housing units for seasonal, recreational, or occasional use
DPSF18. Housing occupancy - All other vacant housing units
DPSF19. Homeowner vacancy rate - Homeowner vacancy rate (percent)
DPSF20. Rental vacancy rate - Rental vacancy rate (percent)
DPSF21. Housing tenure - Total occupied housing units
DPSF21. Housing tenure - Owner-occupied housing units
DPSF21. Housing tenure - Renter-occupied housing units
DPSF22. Population in occupied housing units by tenure - Owner-occupied housing units
DPSF22. Population in occupied housing units by tenure - Renter-occupied housing units
DPSF23. Average household size by tenure - Owner occupied
DPSF23. Average household size by tenure - Renter occupied
out_censusdp1tract__tracts
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
Census DP1 -- Profile of General Demographic Characteristics
- Primary key:
tract_id_fips
Additional Details
Census tract-level summarized US Census demographic information, along with a geometry describing tract boundaries. Based on the 2010 Census. Used as a convenient source of geospatial information within PUDL, but should not be treated as authoritative.
Columns
Census tract 10-digit FIPS code
Geospatial representation of the feature.
Census tract legal/statistical area description
Land area in square meters.
Water area in square meters.
Internal point latitude in decimal degrees.
Internal point longitude in decimal degrees.
Length of the feature's perimeter in degrees.
Area of the feature in square degrees.
DPSF1. Sex and age - Total population
DPSF1. Sex and age - Total population under 5 years
DPSF1. Sex and age - Total population 5 to 9 years
DPSF1. Sex and age - Total population 10 to 14 years
DPSF1. Sex and age - Total population 15 to 19 years
DPSF1. Sex and age - Total population 20 to 24 years
DPSF1. Sex and age - Total population 25 to 29 years
DPSF1. Sex and age - Total population 30 to 34 years
DPSF1. Sex and age - Total population 35 to 39 years
DPSF1. Sex and age - Total population 40 to 44 years
DPSF1. Sex and age - Total population 45 to 49 years
DPSF1. Sex and age - Total population 50 to 54 years
DPSF1. Sex and age - Total population 55 to 59 years
DPSF1. Sex and age - Total population 60 to 64 years
DPSF1. Sex and age - Total population 65 to 69 years
DPSF1. Sex and age - Total population 70 to 74 years
DPSF1. Sex and age - Total population 75 to 79 years
DPSF1. Sex and age - Total population 80 to 84 years
DPSF1. Sex and age - Total population 85 years and over
DPSF1. Sex and age - Male population total
DPSF1. Sex and age - Male population under 5 years
DPSF1. Sex and age - Male population 5 to 9 years
DPSF1. Sex and age - Male population 10 to 14 years
DPSF1. Sex and age - Male population 15 to 19 years
DPSF1. Sex and age - Male population 20 to 24 years
DPSF1. Sex and age - Male population 25 to 29 years
DPSF1. Sex and age - Male population 30 to 34 years
DPSF1. Sex and age - Male population 35 to 39 years
DPSF1. Sex and age - Male population 40 to 44 years
DPSF1. Sex and age - Male population 45 to 49 years
DPSF1. Sex and age - Male population 50 to 54 years
DPSF1. Sex and age - Male population 55 to 59 years
DPSF1. Sex and age - Male population 60 to 64 years
DPSF1. Sex and age - Male population 65 to 69 years
DPSF1. Sex and age - Male population 70 to 74 years
DPSF1. Sex and age - Male population 75 to 79 years
DPSF1. Sex and age - Male population 80 to 84 years
DPSF1. Sex and age - Male population 85 years and over
DPSF1. Sex and age - Female population total
DPSF1. Sex and age - Female population under 5 years
DPSF1. Sex and age - Female population 5 to 9 years
DPSF1. Sex and age - Female population 10 to 14 years
DPSF1. Sex and age - Female population 15 to 19 years
DPSF1. Sex and age - Female population 20 to 24 years
DPSF1. Sex and age - Female population 25 to 29 years
DPSF1. Sex and age - Female population 30 to 34 years
DPSF1. Sex and age - Female population 35 to 39 years
DPSF1. Sex and age - Female population 40 to 44 years
DPSF1. Sex and age - Female population 45 to 49 years
DPSF1. Sex and age - Female population 50 to 54 years
DPSF1. Sex and age - Female population 55 to 59 years
DPSF1. Sex and age - Female population 60 to 64 years
DPSF1. Sex and age - Female population 65 to 69 years
DPSF1. Sex and age - Female population 70 to 74 years
DPSF1. Sex and age - Female population 75 to 79 years
DPSF1. Sex and age - Female population 80 to 84 years
DPSF1. Sex and age - Female population 85 years and over
DPSF2. Median age by sex - Both sexes
DPSF2. Median age by sex - Male
DPSF2. Median age by sex - Female
DPSF3. Sex for population 16 years and over - Total
DPSF3. Sex for population 16 years and over - Male
DPSF3. Sex for population 16 years and over - Female
DPSF4. Sex for population 18 years and over - Total
DPSF4. Sex for population 18 years and over - Male
DPSF4. Sex for population 18 years and over - Female
DPSF5. Sex for population 21 years and over - Total
DPSF5. Sex for population 21 years and over - Male
DPSF5. Sex for population 21 years and over - Female
DPSF6. Sex for population 62 years and over - Total
DPSF6. Sex for population 62 years and over - Male
DPSF6. Sex for population 62 years and over - Female
DPSF7. Sex for population 65 years and over - Total
DPSF7. Sex for population 65 years and over - Male
DPSF7. Sex for population 65 years and over - Female
DPSF8. Race - Total population
DPSF8. Race - Population of one race
DPSF8. Race - White
DPSF8. Race - Black or African American
DPSF8. Race - American Indian and Alaska Native
DPSF8. Race - Asian total
DPSF8. Race - Asian Indian
DPSF8. Race - Chinese
DPSF8. Race - Filipino
DPSF8. Race - Japanese
DPSF8. Race - Korean
DPSF8. Race - Vietnamese
DPSF8. Race - Other Asian
DPSF8. Race - Native Hawaiian and Other Pacific Islander total
DPSF8. Race - Native Hawaiian
DPSF8. Race - Guamanian or Chamorro
DPSF8. Race - Samoan
DPSF8. Race - Other Pacific Islander
DPSF8. Race - Some Other Race
DPSF8. Race - Population of Two or More Races
DPSF8. Race - White; American Indian and Alaska Native
DPSF8. Race - White; Asian
DPSF8. Race - White; Black or African American
DPSF8. Race - White; Some Other Race
DPSF9. Race (total races tallied) - White alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Black or African American alone or in combination with one or more other races
DPSF9. Race (total races tallied) - American Indian and Alaska Native alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Asian alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Native Hawaiian and Other Pacific Islander alone or in combination with one or more other races
DPSF9. Race (total races tallied) - Some Other Race alone or in combination with one or more other races
DPSF10. Hispanic or Latino by specific origin - Total population
DPSF10. Hispanic or Latino by specific origin - Hispanic or Latino (of any race)
DPSF10. Hispanic or Latino by specific origin - Mexican
DPSF10. Hispanic or Latino by specific origin - Puerto Rican
DPSF10. Hispanic or Latino by specific origin - Cuban
DPSF10. Hispanic or Latino by specific origin - Other Hispanic or Latino
DPSF10. Hispanic or Latino by specific origin - Not Hispanic or Latino
DPSF11. Hispanic or Latino and race - Total population
DPSF11. Hispanic or Latino and race - Hispanic or Latino total
DPSF11. Hispanic or Latino and race - Hispanic or Latino: White alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Black or African American alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: American Indian and Alaska Native alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Asian alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Some Other Race alone
DPSF11. Hispanic or Latino and race - Hispanic or Latino: Two or More Races
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino total
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: White alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Black or African American alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: American Indian and Alaska Native alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Asian alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Some Other Race alone
DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Two or More Races
DPSF12. Relationship - Total population
DPSF12. Relationship - In households
DPSF12. Relationship - Householder
DPSF12. Relationship - Spouse
DPSF12. Relationship - Child
DPSF12. Relationship - Own child under 18 years
DPSF12. Relationship - Other relatives
DPSF12. Relationship - Other relatives under 18 years
DPSF12. Relationship - Other relatives 65 years and over
DPSF12. Relationship - Nonrelatives
DPSF12. Relationship - Nonrelatives under 18 years
DPSF12. Relationship - Nonrelatives 65 years and over
DPSF12. Relationship - Unmarried partner
DPSF12. Relationship - In group quarters
DPSF12. Relationship - Institutionalized population
DPSF12. Relationship - Institutionalized population: Male
DPSF12. Relationship - Institutionalized population: Female
DPSF12. Relationship - Noninstitutionalized population
DPSF12. Relationship - Noninstitutionalized population: Male
DPSF12. Relationship - Noninstitutionalized population: Female
DPSF13. Households by type - Total households
DPSF13. Households by type - Family households (families)
DPSF13. Households by type - Family households with own children under 18 years
DPSF13. Households by type - Husband-wife family
DPSF13. Households by type - Husband-wife family with own children under 18 years
DPSF13. Households by type - Male householder, no wife present
DPSF13. Households by type - Male householder, no wife present, with own children under 18 years
DPSF13. Households by type - Female householder, no husband present
DPSF13. Households by type - Female householder, no husband present, with own children under 18 years
DPSF13. Households by type - Nonfamily households
DPSF13. Households by type - Householder living alone
DPSF13. Households by type - Householder living alone: Male
DPSF13. Households by type - Householder living alone: Male 65 years and over
DPSF13. Households by type - Householder living alone: Female
DPSF13. Households by type - Householder living alone: Female 65 years and over
DPSF14. Households with individuals under 18 years - Total
DPSF15. Households with individuals 65 years and over - Total
DPSF16. Average household size - Average household size
DPSF17. Average family size - Average family size
DPSF18. Housing occupancy - Total housing units
DPSF18. Housing occupancy - Occupied housing units
DPSF18. Housing occupancy - Vacant housing units
DPSF18. Housing occupancy - Vacant housing units for rent
DPSF18. Housing occupancy - Vacant housing units rented, not occupied
DPSF18. Housing occupancy - Vacant housing units for sale only
DPSF18. Housing occupancy - Vacant housing units sold, not occupied
DPSF18. Housing occupancy - Vacant housing units for seasonal, recreational, or occasional use
DPSF18. Housing occupancy - All other vacant housing units
DPSF19. Homeowner vacancy rate - Homeowner vacancy rate (percent)
DPSF20. Rental vacancy rate - Rental vacancy rate (percent)
DPSF21. Housing tenure - Total occupied housing units
DPSF21. Housing tenure - Owner-occupied housing units
DPSF21. Housing tenure - Renter-occupied housing units
DPSF22. Population in occupied housing units by tenure - Owner-occupied housing units
DPSF22. Population in occupied housing units by tenure - Renter-occupied housing units
DPSF23. Average household size by tenure - Owner occupied
DPSF23. Average household size by tenure - Renter occupied
out_eia860__yearly_emissions_control_equipment
Annual time series of the cost, type, operating status, retirement date, and install year of emissions control equipment reported to EIA.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
report_year, plant_id_eia, emission_control_id_pudl
Additional Details
Includes control ids for sulfur dioxide (SO2), particulate matter, mercury, nitrogen oxide (NOX), and acid (HCl) gas monitoring.
This denormalized version includes plant name, utility id, pudl id, and utility name columns.
Columns
Four-digit year in which the data was reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
A PUDL-generated ID used to distinguish emission control units in the same report year and plant id. This ID should not be used to track units over time or between plants.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
Short code indicating the type of emission control equipment installed.
The operating status of the asset.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
Mercury control identification number. This ID is not a unique identifier.
Nitrogen oxide control identification number. This ID is not a unique identifier.
Particulate matter control identification number. This ID is not a unique identifier.
Sulfur dioxide control identification number. This ID is not a unique identifier.
Indicates whether the emissions control equipment controls acid (HCl) gas.
The total cost to install a piece of emission control equipment.
The date a piece of emissions control equipment began operating. Derived from month and year columns in the raw data.
The expected or actual retirement date for a piece of emissions control equipment. Derived from month and year columns in the raw data.
out_eia860__yearly_ownership
Annual time series of generator ownership. Includes only jointly or third-party owned generators.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 4)
- Primary key:
report_date, plant_id_eia, generator_id, owner_utility_id_eia
Additional Details
Denormalized to include plant and utility names and other associated IDs.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA utility Identification number for the owner company that is responsible for the day-to-day operations of the generator, not the operator utility.
The EIA utility Identification number for the operator utility.
A manually assigned PUDL utility ID for the owner company that is responsible for the day-to-day operations of the generator, not the operator utility. May not be stable over time.
The name of the EIA owner utility.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Two letter ISO-3166 political subdivision code.
City of owner.
Three letter ISO-3166 country code.
Steet address of owner.
Zip code of owner.
Proportion of generator ownership attributable to this utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia861__yearly_balancing_authority_service_territory
Annual time series of counties in balancing authority service territories.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
balancing_authority_id_eia, report_date, county_id_fips, county
Columns
County ID from the Federal Information Processing Standard Publication 6-4.
County name as specified in Census DP1 Data.
County population, sourced from Census DP1 data.
County area in km2.
Date reported.
EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.
Two letter US state abbreviation.
County name.
Two digit state FIPS code.
out_eia861__yearly_utility_service_territory
Annual time series of counties in utility service territories.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
utility_id_eia, report_date, county_id_fips
Additional Details
Contains additional information about counties.
Columns
County ID from the Federal Information Processing Standard Publication 6-4.
County name as specified in Census DP1 Data.
County population, sourced from Census DP1 data.
County area in km2.
Date reported.
The EIA Utility Identification number.
Two letter US state abbreviation.
County name.
Two digit state FIPS code.
out_eia923__boiler_fuel
boiler fuel consumption and emissions.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.
This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Alphanumeric boiler ID.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Code for the type of prime mover (e.g. CT, CG)
Simplified fuel type code used in PUDL
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__generation
net electricity generated by power plant generators.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, generator_id, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
Not all IDs are present.
Table has known low coverage - either geographic or temporal or otherwise.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.
Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__generation_fuel_combined
generation and fuel consumption of all generation units.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, report_date, prime_mover_code, energy_source_code
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
Denormalized, combined data from the core_eia923__monthly_generation_fuel and core_eia923__monthly_generation_fuel_nuclear with nuclear generation aggregated from the nuclear generation unit level up to the plant prime mover level, so as to be compatible with fossil fuel generation data.
This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.
The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Simplified fuel type code used in PUDL
A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER) or Monthly Energy Review (MER). Two or three letter alphanumeric.
Code for the type of prime mover (e.g. CT, CG)
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Consumption for electric generation of the fuel type in physical unit.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__monthly_boiler_fuel
Monthly time series of boiler fuel consumption and emissions.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.
This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Alphanumeric boiler ID.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Code for the type of prime mover (e.g. CT, CG)
Simplified fuel type code used in PUDL
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__monthly_fuel_receipts_costs
Monthly time series of fuel deliveries to power plants.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)
- Primary key:
This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.
Usage Warnings
Date column arbitrarily uses the first of the month.
Some values have been redacted.
Time of fuel deliveries is not necessarily connected with time of fuel consumption.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.
Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.
Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Two letter US state abbreviation.
Simplified fuel type code used in PUDL
Quantity of fuel received in tons, barrel, or Mcf.
Heat content of the fuel in millions of Btus per physical unit.
Average fuel cost per mmBTU of heat content in nominal USD.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Mercury content in parts per million (ppm) to the nearest 0.001 ppm.
For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.
For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__monthly_generation
Monthly time series of net electricity generated by power plant generators.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, generator_id, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
Not all IDs are present.
Table has known low coverage - either geographic or temporal or otherwise.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.
Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__monthly_generation_fuel_by_generator
Monthly time series of of estimated net generation and fuel consumption by generator.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
report_date, plant_id_eia, generator_id
Usage Warnings
Contains estimated values.
This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
Based on allocating net electricity generation and fuel consumption reported in the EIA-923 generation and generation_fuel tables to individual generators.
The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:
The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.
The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.
The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.
The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.
In this table, PUDL aggregates the net generation and fuel consumption that has been allocated to the generator_id/energy_source_code/prime_mover_code level in the``out_eia923__monthly_generation_fuel_by_generator_energy_source`` to the generator level.This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.
This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Total consumption of fuel to produce electricity, in physical unit, year to date.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Net electricity generation for the specified period in megawatt-hours (MWh).
out_eia923__monthly_generation_fuel_by_generator_energy_source
Monthly time series of of estimated net generation and fuel consumption associated with each combination of generator, energy source, and prime mover.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code
Usage Warnings
Contains estimated values.
This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Contains information from multiple raw inputs.
Additional Details
The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:
The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.
The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.
The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.
The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.
In this table, PUDL has allocated the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel to the generator_id/energy_source_code/prime_mover_code level.
The allocation process entails generating a fraction for each record based on the net generation in the core_eia923__monthly_generation table and the capacity from the core_eia860__scd_generators table. This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Code for the type of prime mover (e.g. CT, CG)
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
out_eia923__monthly_generation_fuel_combined
Monthly time series of generation and fuel consumption of all generation units.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, report_date, prime_mover_code, energy_source_code
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
Denormalized, combined data from the core_eia923__monthly_generation_fuel and core_eia923__monthly_generation_fuel_nuclear with nuclear generation aggregated from the nuclear generation unit level up to the plant prime mover level, so as to be compatible with fossil fuel generation data.
This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.
The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Simplified fuel type code used in PUDL
Code for the type of prime mover (e.g. CT, CG)
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Consumption for electric generation of the fuel type in physical unit.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__yearly_boiler_fuel
Annual time series of boiler fuel consumption and emissions.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.
This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Alphanumeric boiler ID.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Code for the type of prime mover (e.g. CT, CG)
Simplified fuel type code used in PUDL
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__yearly_fuel_receipts_costs
Annual time series of fuel deliveries to power plants.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)
- Primary key:
This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.
Usage Warnings
Date column arbitrarily uses the first of the month.
Some values have been redacted.
Time of fuel deliveries is not necessarily connected with time of fuel consumption.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.
Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.
Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Two letter US state abbreviation.
Simplified fuel type code used in PUDL
Quantity of fuel received in tons, barrel, or Mcf.
Heat content of the fuel in millions of Btus per physical unit.
Average fuel cost per mmBTU of heat content in nominal USD.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Mercury content in parts per million (ppm) to the nearest 0.001 ppm.
For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.
For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__yearly_generation
Annual time series of net electricity generated by power plant generators.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, generator_id, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
Not all IDs are present.
Table has known low coverage - either geographic or temporal or otherwise.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.
Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia923__yearly_generation_fuel_by_generator
Annual time series of of estimated net generation and fuel consumption by generator.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
report_date, plant_id_eia, generator_id
Usage Warnings
Contains estimated values.
This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
Based on allocating net electricity generation and fuel consumption reported in the EIA-923 generation and generation_fuel tables to individual generators.
The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:
The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.
The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.
The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.
The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.
In this table, PUDL aggregates the net generation and fuel consumption that has been allocated to the generator_id/energy_source_code/prime_mover_code level in the``out_eia923__yearly_generation_fuel_by_generator_energy_source`` to the generator level.This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.
This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Total consumption of fuel to produce electricity, in physical unit, year to date.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Net electricity generation for the specified period in megawatt-hours (MWh).
out_eia923__yearly_generation_fuel_by_generator_energy_source
Annual time series of of estimated net generation and fuel consumption associated with each combination of generator, energy source, and prime mover.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code
Usage Warnings
Contains estimated values.
This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Contains information from multiple raw inputs.
Additional Details
The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:
The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.
The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.
The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.
The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.
In this table, PUDL has allocated the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel to the generator_id/energy_source_code/prime_mover_code level.
The allocation process entails generating a fraction for each record based on the net generation in the core_eia923__monthly_generation table and the capacity from the core_eia860__scd_generators table. This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Code for the type of prime mover (e.g. CT, CG)
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
out_eia923__yearly_generation_fuel_by_generator_energy_source_owner
Annual time series of of estimated net generation and fuel consumption for each generator, associated with each combination of generator, energy source, prime mover, and owner.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code, utility_id_eia, ownership_record_type
Usage Warnings
Contains estimated values.
This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).
Date column arbitrarily uses the first of the month.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:
The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.
The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.
The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.
The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.
In this table, PUDL has scaled the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel by ownership from out_eia860__yearly_ownership. Note that the utility_id_eia in this table refers to the OWNER of the generator, not the operator.
This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.
This table inherits canonicalized values for generators and plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Code for the type of prime mover (e.g. CT, CG)
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
The EIA Utility Identification number.
Whether each generator record is for one owner or represents a total of all ownerships.
Proportion of generator ownership attributable to this utility.
Total installed (nameplate) capacity, in megawatts.
Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
out_eia923__yearly_generation_fuel_combined
Annual time series of generation and fuel consumption of all generation units.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, report_date, prime_mover_code, energy_source_code
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
Denormalized, combined data from the core_eia923__monthly_generation_fuel and core_eia923__monthly_generation_fuel_nuclear with nuclear generation aggregated from the nuclear generation unit level up to the plant prime mover level, so as to be compatible with fossil fuel generation data.
This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.
The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Simplified fuel type code used in PUDL
Code for the type of prime mover (e.g. CT, CG)
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Consumption for electric generation of the fuel type in physical unit.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_eia930__hourly_aggregated_demand
Hourly time series of aggregated balancing authority demand by EIA region, interconnect, and continental US.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
- Primary key:
datetime_utc, aggregation_level, aggregation_group
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
This table is experimental and/or a work in progress and may change in the future.
Additional Details
The spatial granularity of each record is indicated by aggregation_level.
Columns
Date and time converted to Coordinated Universal Time (UTC).
Indicates the spacial granularity of aggregated value.
Label identifying a group of balancing authorities to be used in aggregating demand E.g. a region of the US or a whole interconnect.
Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.
out_eia930__hourly_operations
Hourly time series of balancing authority net generation, interchange, and demand with imputed demand.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
- Primary key:
datetime_utc, balancing_authority_code_eia
Usage Warnings
Contains rows where missing values were imputed.
This table is experimental and/or a work in progress and may change in the future.
Additional Details
This table is based on core_eia930__hourly_operations, but adds imputed demand where the original data was missing or anomalous. Codes explaining why values have been imputed can be found in the core_pudl__codes_imputation_reasons table.
Columns
Date and time converted to Coordinated Universal Time (UTC).
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Unaltered originally reported net generation for the specified period.
Reported net generation adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.
Reported net generation with outlying values removed and missing values imputed by EIA.
Original reported energy interchange between adjacent balancing authorities.
Energy interchange between adjacent balancing authorities, adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.
Energy interchange between adjacent balancing authorities, with outliers and missing values imputed by EIA.
Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.
Electricity demand adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.
Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.
Code describing why a demand value was flagged for imputation.
Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed by EIA.
Day ahead demand forecast.
out_eia930__hourly_subregion_demand
Hourly time series of balancing authority subregion demand with imputed demand.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
- Primary key:
datetime_utc, balancing_authority_code_eia, balancing_authority_subregion_code_eia
Usage Warnings
Contains rows where missing values were imputed.
This table is experimental and/or a work in progress and may change in the future.
Additional Details
This table is based on core_eia930__hourly_subregion_demand, but adds imputed demand where the original data was missing or anomalous. Codes explaining why values have been imputed can be found in the core_pudl__codes_imputation_reasons table.
Columns
Date and time converted to Coordinated Universal Time (UTC).
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Code identifying subregions of larger balancing authorities.
Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.
Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.
Code describing why a demand value was flagged for imputation.
out_eia__yearly_assn_plant_parts_plant_gen
Annual Association table providing connections between plant parts (identified as record_id_eia) and 'plant_gen' records (identified as record_id_eia_plant_gen) from the out_eia__yearly_plant_parts table.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
record_id_eia, record_id_eia_plant_gen
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Contains information from multiple raw inputs.
Additional Details
The purpose of this table is to help determine what generator records are associated with every plant part record. The plant part records have a one to many relationship to the associated 'plant_gen' records. The 'plant_gen' records are also included in the plant part records, so for these records there is a one to one relationship and the record_id_eia and the record_id_eia_plant_gen are the same.
All of the columns in this table that have a suffix of '_plant_gen' are attributes of the record_id_eia_plant_gen record.
Columns
Identifier for EIA plant parts analysis records.
Identifier for EIA plant parts analysis records which is at the plant_part level of plant_gen - meaning each record pertains to one generator.
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
The EIA Utility Identification number.
Whether each generator record is for one owner or represents a total of all ownerships.
Generator ID of the record_id_eia_plant_gen record. This is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Code representing the most predominant type of energy that fuels the record_id_eia_plant_gen's generator.
Code for the type of prime mover (e.g. CT, CG) associated with the record_id_eia_plant_gen.
Dynamically assigned PUDL unit id of the record_id_eia_plant_gen. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
High level description of the technology used by the record_id_eia_plant_gen's generator to produce electricity.
Name of FERC account, derived from technology description and prime mover code. This name is associated with the record_id_eia_plant_gen record.
ID dynamically assigned by PUDL to EIA records with multiple matches to a single FERC ID in the FERC-EIA manual matching process. This ID is associated with the record_id_eia_plant_gen record. It depends on ferc1_generator_agg_id, which has not been updated since 2020, but only affects a couple hundred records total across all years.
The year an associated plant_gen's generator went into service.
The operating status of the asset using PUDL categories of the record_id_eia_plant_gen record .
The number of generators associated with each record_id_eia.
out_eia__yearly_boilers
Annual time series of all EIA boiler attributes.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
plant_id_eia, boiler_id, report_date
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This table inherits canonicalized values for boilers, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Alphanumeric boiler ID.
Total air flow including excess air at 100 percent load, reported at standard temperature and pressure (i.e. 68 F and one atmosphere pressure).
The code representing the most predominant type of energy that fuels the boiler.
The code representing the second most predominant type of energy that fuels the boiler.
The code representing the third most predominant type of energy that fuels the boiler.
The code representing the fourth most predominant type of energy that fuels the boiler.
Name of boiler manufacturer.
EIA short code for boiler manufacturer.
Date the boiler began or is planned to begin commercial operation.
Date of the scheduled or effected retirement of the boiler.
EIA short code identifying boiler operational status.
EIA short code indicating the standards under which the boiler is operating as described in the U.S. EPA regulation under 40 CFR.
Name of the city.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for mercury emissions.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for nitrogen oxide emissions.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for particulate matter emissions.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for sulfur dioxide emissions.
County name.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
Boiler efficiency percentage when burning at 100 percent load to the nearest 0.1 percent.
Boiler efficiency percentage when burning at 50 percent load to the nearest 0.1 percent.
Design firing rate at maximum continuous steam flow for coal to the nearest 0.1 ton per hour.
Design firing rate at maximum continuous steam flow for gas to the nearest 0.1 cubic feet per hour.
Design firing rate at maximum continuous steam flow for pet coke to the nearest 0.1 barrels per hour.
Design firing rate at maximum continuous steam flow for energy sources other than coal, petroleum, or natural gas.
EIA short code indicating the type of firing used by this boiler.
EIA short code indicating the type of firing used by this boiler.
EIA short code indicating the type of firing used by this boiler.
Indicates whether the boiler is capable of re-injecting fly ash.
indicates if the boiler is a heat recovery steam generator (HRSG).
Latitude of the plant's location, in degrees.
Longitude of the plant's location, in degrees.
Maximum continuous steam flow at 100 percent load.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Proposed strategy to comply with the most stringent mercury regulation.
Proposed strategy to comply with the most stringent mercury regulation.
Proposed strategy to comply with the most stringent mercury regulation.
Indicates whether the boiler is subject to New Source Review requirements.
Month of issued New Source Review permit.
New Source Review permit number.
Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategy to comply with the most stringent nitrogen oxide regulation.
Existing strategy to comply with the most stringent nitrogen oxide regulation.
Existing strategy to comply with the most stringent nitrogen oxide regulation.
Name of nitrogen oxide control manufacturer.
Code indicating the nitrogen oxide control burner manufacturer.
If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.
If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.
If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.
Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Proposed strategy to comply with the most stringent nitrogen oxide regulation.
Proposed strategy to comply with the most stringent nitrogen oxide regulation.
Proposed strategy to comply with the most stringent nitrogen oxide regulation.
Nitrogen oxide control status code.
If boiler is not in compliance with particulate matter regulations, strategy for compliance.
If boiler is not in compliance with particulate matter regulations, strategy for compliance.
If boiler is not in compliance with particulate matter regulations, strategy for compliance.
Most stringent type of statute or regulation code under which the boiler is operating for mercury control standards.
EIA short code for most stringent type of statute or regulation code under which the boiler is operating for nitrogen oxide control standards.
EIA short code for most stringent type of statute or regulation code under which the boiler is operating for particulate matter control standards.
EIA short code for most stringent type of statute or regulation code under which the boiler is operating for sulfur dioxide control standards.
Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategy to comply with the most stringent sulfur dioxide regulation.
Existing strategy to comply with the most stringent sulfur dioxide regulation.
Existing strategy to comply with the most stringent sulfur dioxide regulation.
If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.
If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.
If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.
Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Proposed strategy to comply with the most stringent sulfur dioxide regulation.
Proposed strategy to comply with the most stringent sulfur dioxide regulation.
Proposed strategy to comply with the most stringent sulfur dioxide regulation.
Numeric value for the unit of measurement specified for nitrogen oxide.
Numeric value for the unit of measurement specified for particulate matter.
The percent of sulfur dioxide to be scrubbed specified by the most stringent sulfur dioxide regulation.
Numeric value for the unit of measurement specified for sulfur dioxide.
Two letter US state abbreviation.
Physical street address.
IANA timezone name
The turndown ratio for the boiler.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Numeric value for the unit of measurement specified for nitrogen oxide.
Numeric value for the unit of measurement specified for particulate matter.
Numeric value for the unit of measurement specified for sulfur dioxide.
Design waste-heat input rate at maximum continuous steam flow where a waste-heat boiler is a boiler that receives all or a substantial portion of its energy input from the noncumbustible exhaust gases of a separate fuel-burning process (MMBtu per hour).
Wet or Dry Bottom where Wet Bottom is defined as slag tanks that are installed at furnace throat to contain and remove molten ash from the furnace, and Dry Bottom is defined as having no slag tanks at furnace throat area, throat area is clear, and bottom ash drops through throat to bottom ash water hoppers.
Five digit US Zip Code.
out_eia__yearly_generators_by_ownership
Annual time series of all EIA generators with ownership integrated.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
This table has no primary key. This table would have a primary key of the following columns, except there are ~200 records without utility IDs: ['plant_id_eia', 'generator_id', 'report_date', 'utility_id_eia', 'ownership_record_type'].
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
Generators that report to EIA can have multiple utility owners. EIA reports the ownership fraction for each of the owners of generators (see core_eia860__scd_ownership). This table includes two records for each generator's owner: one record represents a total ownership stake and the other record represents a proportional ownership stake (the ownership_record_type column labeled as "total" and "owned" respectively). All of the values reported for these generator-owner records are scaled to the ownership stake (represented by fraction_owned).
This table was created to represent every possible version of ownership slices of the EIA generators. It is primarily used as an input to out_eia__yearly_plant_parts.
This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Date reported.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
High level description of the technology used by the generator to produce electricity.
The code representing the most predominant type of energy that fuels the generator.
Code for the type of prime mover (e.g. CT, CG)
Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.
Date of the scheduled or effected retirement of the generator.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
Total installed (nameplate) capacity, in megawatts.
Simplified fuel type code used in PUDL
Planned effective date of the scheduled retirement of the generator.
Fraction of potential generation that was actually reported for a plant part.
Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).
Average fuel cost per mmBTU of heat content in nominal USD.
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Fuel content per unit of electricity generated. Coming from MCOE calculation.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
Name of FERC account, derived from technology description and prime mover code.
Year a generator went into service.
The operating status of the asset using PUDL categories.
Total end of year installed (nameplate) capacity for a plant part, in megawatts.
Proportion of generator ownership attributable to this utility.
Whether each generator record is for one owner or represents a total of all ownerships.
out_eia__yearly_plant_parts
Annual time series of all EIA plant parts, for use in matching to FERC 1.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
record_id_eia
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
Practically speaking, a plant is a collection of generator(s). There are many attributes of generators (i.e. prime mover, primary fuel source, technology type). We can use these generator attributes to group generator records into larger aggregate records which we call "plant parts". A plant part is a record which corresponds to a particular collection of generators that all share an identical attribute and utility owner. E.g. all of the generators with unit_id=2, or all of the generators with coal as their primary fuel source.
The EIA data about power plants (from EIA 923 and 860) is reported in tables with records that correspond to mostly generators and plants. Other datasets (cough cough FERC1) are less well organized and include plants, generators and other plant parts all in the same table without any clear labels. This plant part table is an attempt to create records corresponding to many different plant parts in order to connect specific slices of EIA plants to other datasets.
Because generators are often owned by multiple utilities, another dimension of this plant part table involves generating two records for each owner: one for the portion of the plant part they own and one for the plant part as a whole. The portion records are labeled in the ownership_record_type column as owned and the total records are labeled as total.
This table includes A LOT of duplicative information about EIA plants. It is primarily meant for use as an input into the record linkage between FERC1 plants and EIA.
This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Identifier for EIA plant parts analysis records.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Date reported.
The part of the plant a record corresponds to.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Code for the type of prime mover (e.g. CT, CG)
The code representing the most predominant type of energy that fuels the generator.
High level description of the technology used by the generator to produce electricity.
Name of FERC account, derived from technology description and prime mover code.
The EIA Utility Identification number.
Indicates whether a plant part list record is associated with the highest priority plant part for all identical records.
Plant part of the associated true granularity record.
EIA record ID of the associated true granularity record.
ID dynamically assigned by PUDL to EIA records with multiple matches to a single FERC ID in the FERC-EIA manual matching process. The ID is manually assigned and has not been updated since 2020, but only affects a couple hundred records total across all years.
Total end of year installed (nameplate) capacity for a plant part, in megawatts.
Fraction of potential generation that was actually reported for a plant part.
Total installed (nameplate) capacity, in megawatts.
Year the plant's oldest still operational unit was built.
Proportion of generator ownership attributable to this utility.
Average fuel cost per mmBTU of heat content in nominal USD.
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Simplified fuel type code used in PUDL
Date of the scheduled or effected retirement of the generator.
Fuel content per unit of electricity generated. Coming from MCOE calculation.
Year the plant's most recently built unit was installed.
Net electricity generation for the specified period in megawatt-hours (MWh).
Year a generator went into service.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
The operating status of the asset using PUDL categories.
Whether each generator record is for one owner or represents a total of all ownerships.
Whether a plant part record has a duplicate record with different ownership status.
Planned effective date of the scheduled retirement of the generator.
A manually assigned PUDL plant ID. May not be constant over time.
Plant name.
Derived plant name that includes EIA plant name and other strings associated with ID and PK columns of the plant part.
Contains EIA plant ID, plant part, ownership, and EIA utility id
Number of distinct generator IDs that participated in the aggregation for a plant part list record.
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Four-digit year in which the data was reported.
PUDL plant ID and report year of the record.
out_eia__yearly_plants
Annual time series of all EIA plant attributes.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
plant_id_eia, report_date
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This table inherits canonicalized values for plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Plant name.
Name of the city.
County name.
Latitude of the plant's location, in degrees.
Longitude of the plant's location, in degrees.
Two letter US state abbreviation.
Physical street address.
Five digit US Zip Code.
IANA timezone name
Date reported.
Is there an ash impoundment (e.g. pond, reservoir) at the plant?
If there is an ash impoundment at the plant, is the impoundment lined?
If there is an ash impoundment at the plant, the ash impoundment status as of December 31 of the reporting year.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
Geodetic coordinate system identifier (e.g. NAD27, NAD83, or WGS84).
Indicates if the facility has energy storage capabilities.
The docket number relating to the FERC cogenerator status. See FERC Form 556.
Indicates whether the plant has FERC qualifying facility cogenerator status. See FERC Form 556.
The docket number relating to the FERC qualifying facility exempt wholesale generator status.
Indicates whether the plant has FERC qualifying facility exempt wholesale generator status
The docket number relating to the FERC qualifying facility small power producer status. See FERC Form 556.
Indicates whether the plant has FERC qualifying facility small power producer status. See FERC Form 556.
The docket number relating to the FERC qualifying facility cogenerator status. See FERC Form 556.
Plant's grid voltage at point of interconnection to transmission or distribution facilities
Plant's grid voltage at point of interconnection to transmission or distribution facilities
Plant's grid voltage at point of interconnection to transmission or distribution facilities
The code of the plant's ISO or RTO. NA if not reported in that year.
Indicates if the facility have the capability to store the natural gas in the form of liquefied natural gas.
Names of Local Distribution Company (LDC), connected to natural gas burning power plants.
Indicates if the facility have on-site storage of natural gas.
The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.
The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.
The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.
NERC region in which the plant is located
Whether the plant has a net metering agreement in effect during the reporting year. (Only displayed for facilities that report the sun or wind as an energy source). This field was only reported up until 2015
Additional owner or operator of natural gas pipeline.
North American Industry Classification System (NAICS) code that best describes the primary purpose of the reporting plant
Indicates whether the plant is regulated or non-regulated.
Code that specifies what time period data has to be reported (i.e. monthly data or annual totals) and how often the power plant reports this data to EIA. See reporting_frequencies_eia for more details.
EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status
EIA assigned sector name, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status
Service area in which plant is located; for unregulated companies, it's the electric utility with which plant is interconnected
EIA-assigned code for owner of transmission/distribution system to which the plant is interconnected.
Name of the owner of the transmission or distribution system to which the plant is interconnected.
State location for owner of transmission/distribution system to which the plant is interconnected.
The EIA Utility Identification number.
Name of water source associated with the plant.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
A manually assigned PUDL plant ID. May not be constant over time.
The name of the utility.
A manually assigned PUDL utility ID. May not be stable over time.
Percentage consistency of balancing authority code across entity records.
out_eia__yearly_utilities
Annual time series of all EIA utility attributes.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
utility_id_eia, report_date
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This table inherits canonicalized values for utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
The name of the utility.
Date reported.
Physical street address.
Name of the city.
Two letter US state abbreviation.
Five digit US Zip Code.
Is the reporting entity an owner of power plants reported on Schedule 2 of the form?
Is the reporting entity an operator of power plants reported on Schedule 2 of the form?
Is the reporting entity an asset manager of power plants reported on Schedule 2 of the form?
Does the reporting entity have any other relationship to the power plants reported on Schedule 2 of the form?
Entity type of principal owner.
Mail attention name of the operator/owner.
Second line of the address.
Four digit US Zip Code suffix.
First name of utility contact 1.
Last name of utility contact 1.
Title of of utility contact 1.
Phone number for utility contact 1.
Phone extension for utility contact 1
First name of utility contact 2.
Last name of utility contact 2.
Title of utility contact 2.
Phone number for utility contact 2.
Phone extension for utility contact 2
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
out_ferc1__yearly_balance_sheet_assets_sched110
Annual time series of utility assets and other debits.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)
- Primary key:
utility_id_ferc1, report_year, asset_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Type of asset being reported to the core_ferc1__yearly_balance_sheet_assets_sched110 table.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Account balance at end of year.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Account balance at beginning of year.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
out_ferc1__yearly_balance_sheet_liabilities_sched110
Annual time series of utility liabilities and other credits.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)
- Primary key:
utility_id_ferc1, report_year, liability_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Account balance at beginning of year.
Account balance at end of year.
Type of liability being reported to the core_ferc1__yearly_balance_sheet_liabilities_sched110 table.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
out_ferc1__yearly_cash_flows_sched120
Annual time series of utility cash flow.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 120)
- Primary key:
utility_id_ferc1, report_year, amount_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Label describing the type of amount being reported. This could be a balance or a change in value.
Reported amount of dollars. This could be a balance or a change in value.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
out_ferc1__yearly_depreciation_by_function_sched219
Annual time series of ending balances in accumulated provision for depreciation of electric utility plant.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section B)
- Primary key:
utility_id_ferc1, report_year, depreciation_type, plant_function, plant_status, utility_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric utility plant refers to FERC Account 108. Section B includes the Balances at End of Year According to Functional Classification (plant_function).
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.
Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).
Utility plant financial status (in service, future, leased, total).
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Account balance at end of year.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
out_ferc1__yearly_depreciation_changes_sched219
Annual time series of changes in accumulated provision for depreciation of electric utility plant.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section A)
- Primary key:
utility_id_ferc1, report_year, depreciation_type, plant_status, utility_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric utility plant refers to FERC Account 108.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.
Utility plant financial status (in service, future, leased, total).
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Dollar value of reported income, expense, asset, or liability.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
out_ferc1__yearly_depreciation_summary_sched336
Annual time series of depreciation and amortization of electric plant.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 336 - Section A)
- Primary key:
utility_id_ferc1, report_year, plant_function, ferc_account_label
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric Plant refers to FERC Accounts 403, 404, and 405. This table only contains information from Section A: Summary of depreciation and amortization changes.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).
Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Dollar value of reported income, expense, asset, or liability.
out_ferc1__yearly_detailed_balance_sheet_assets
Annual time series of granular, de-duplicated accounting data of utilities' balance sheet assets.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 110, 200, 204 and 219)
- Primary key:
This table has no primary key.
Usage Warnings
The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is derived from four FERC Form 1 accounting tables with nested calculations:
core_ferc1__yearly_balance_sheet_assets_sched110
core_ferc1__yearly_utility_plant_summary_sched200
core_ferc1__yearly_plant_in_service_sched204
core_ferc1__yearly_depreciation_by_function_sched219
We reconciled the nested calculations within these tables and then identified the most granular data across the tables. We applied slight modifications to three columns (utility_type, plant_function & plant_status) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
The name of type of value which is a derivative of the XBRL fact name.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).
Utility plant financial status (in service, future, leased, total).
Account balance at end of year.
Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).
Whether or not a record from the detailed FERC1 accounting tables should be considered allowable in a utility's rate base based on utility accounting standards. This flag was manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes varies based on the utility_type, plant_status or plant_function.
A category of asset or liability that RMI compiled to use as a shorthand for various types of utility assets. These tags were compiled manually based on the xbrl_factoid and sometimes varies based on the utility_type, plant_function or plant_status as well.
The name of the PUDL database table where a given record originated from.
out_ferc1__yearly_detailed_balance_sheet_liabilities
Annual time series of granular, de-duplicated accounting data of utilities' balance sheet liabilities.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110 and 118)
- Primary key:
This table has no primary key.
Usage Warnings
The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is derived from two FERC Form 1 accounting tables with nested calculations:
core_ferc1__yearly_balance_sheet_liabilities_sched110
core_ferc1__yearly_retained_earnings_sched118
We reconciled the nested calculations within these tables and then identified the most granular data across the tables. We applied slight modifications to three columns (utility_type, plant_function & plant_status) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
The name of type of value which is a derivative of the XBRL fact name.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Account balance at end of year.
Whether or not a record from the detailed FERC1 accounting tables should be considered allowable in a utility's rate base based on utility accounting standards. This flag was manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes varies based on the utility_type, plant_status or plant_function.
A category of asset or liability that RMI compiled to use as a shorthand for various types of utility assets. These tags were compiled manually based on the xbrl_factoid and sometimes varies based on the utility_type, plant_function or plant_status as well.
The name of the PUDL database table where a given record originated from.
out_ferc1__yearly_detailed_income_statements
Annual time series of granular, de-duplicated accounting data of utilities' income statements.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 114, 300, 320 and 336)
- Primary key:
This table has no primary key.
Usage Warnings
The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is derived from four FERC Form 1 accounting tables with nested calculations:
core_ferc1__yearly_income_statements_sched114
core_ferc1__yearly_depreciation_summary_sched336
core_ferc1__yearly_operating_expenses_sched320
core_ferc1__yearly_operating_revenues_sched300
We reconciled the nested calculations within these tables and then identified the most granular data across the tables. We applied slight modifications to two columns (utility_type & plant_function) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
The name of type of value which is a derivative of the XBRL fact name.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).
The technology type associated with components of a utility's revenue requirement. These categories were manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes vary based on the utility_type or plant_function as well. This column is intended to be used to aggregate this table.
Dollar value of reported income, expense, asset, or liability.
Whether or not a record from the detailed income statement data is typically included in a utility's revenue requirement. This flag was manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes varies based on the utility_type or plant_function.
The category of revenue requirement associated with each component of utility'sincome statements. These categories were manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes vary based on the utility_type or plant_function. This column is intended to be used to aggregate this table.
The name of the PUDL database table where a given record originated from.
out_ferc1__yearly_energy_dispositions_sched401
Annual time series of dispositions of electric energy sold, exchanged, or stored.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)
- Primary key:
utility_id_ferc1, report_year, energy_disposition_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric Energy Account, dispositions only. Schedule 401a. Electricity utilities delivered to end users, internal losses, etc.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Type of energy disposition reported in the core_ferc1__yearly_energy_dispositions_sched401. Dispositions include sales to customers, re-sales of energy, energy used internally, losses, etc.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Sources and uses of energy in MWh.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
out_ferc1__yearly_energy_sources_sched401
Annual time series of sources of electric energy generated or purchased, exchanged and wheeled.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)
- Primary key:
utility_id_ferc1, report_year, energy_source_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric Energy Account, sources only. Schedule 401a. Amount of electricity the utility obtained from each of several sources.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Type of energy source reported in the core_ferc1__yearly_energy_sources_sched401 table. There are three groups of energy sources: generation, power exchanges and transmission.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Sources and uses of energy in MWh.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
out_ferc1__yearly_hydroelectric_plants_sched406
Annual time series of plant statistics for large hydroelectric generating plants.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 406)
- Primary key:
record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
The plant_type and construction_type are standardized into categorical values from free-form strings.
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Large plants have an installed nameplate capacity of more than 10 MW.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Asset retirement cost (USD).
The average number of employees assigned to each plant.
Fraction of potential generation that was actually reported for a plant part.
Total installed (nameplate) capacity, in megawatts.
Cost of plant: equipment (USD).
Cost of plant: reservoirs, dams, and waterways (USD).
Cost of plant: land and land rights (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Cost of plant: roads, railroads, and bridges (USD).
Cost of plant: structures and improvements (USD).
Total cost of plant (USD).
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Net plant capability under the least favorable operating conditions, in megawatts.
Net plant capability under the most favorable operating conditions, in megawatts.
Net electricity generation for the specified period in megawatt-hours (MWh).
Production expenses: maintenance of reservoirs, dams, and waterways (USD).
Production expenses: electric expenses (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: miscellaneous power generation expenses (USD).
Production expenses: hydraulic expenses (USD).
Production expenses: maintenance of miscellaneous hydraulic plant (USD).
Production expenses: operations, supervision, and engineering (USD).
Total production expenses (USD per MWh generated).
Production expenses: maintenance of electric plant (USD).
Production expenses: rents (USD).
Production expenses: maintenance of structures (USD).
Total production expenses, excluding fuel (USD).
Total production expenses, excluding fuel (USD).
Production expenses: water for power (USD).
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Hours the plant was connected to load while generating in the report year.
A manually assigned PUDL plant ID. May not be constant over time.
Type of plant.
FERC Licensed Project Number.
out_ferc1__yearly_income_statements_sched114
Annual time series of utility income statements.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 114)
- Primary key:
utility_id_ferc1, report_year, utility_type, income_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Type of income reported in core_ferc1__yearly_income_statements_sched114 table.
Dollar value of reported income, expense, asset, or liability.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
out_ferc1__yearly_operating_expenses_sched320
Annual time series of operating and maintenance costs associated with producing electricity.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 320)
- Primary key:
utility_id_ferc1, report_year, expense_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Dollar value of reported income, expense, asset, or liability.
The type of expense.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
out_ferc1__yearly_operating_revenues_sched300
Annual time series of utilities' electric operating revenues.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 300)
- Primary key:
utility_id_ferc1, report_year, revenue_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table includes only the structured part of schedule 300. There are a number of revenue_type's that do not have sales_mwh,or avg_customers_per_month provided, in which case these columnswill be NULL.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Label describing types of revenues.
Dollar value of reported income, expense, asset, or liability.
Quantity of electricity sold in MWh.
Average number of customers per month.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
out_ferc1__yearly_other_regulatory_liabilities_sched278
Annual time series of utilities' other regulatory liabilities, including rate order docket number.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 278)
- Primary key:
This table has no primary key. Respondents are able to enter any number of liabilities across many rows. There are no IDs or set fields enforced in the original table.
Usage Warnings
The description column is a free-form string.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Long human-readable description of the meaning of a code/label.
Account balance at end of year.
Account balance at beginning of year.
The increase during the reporting period of other regulatory liabilities.
Description of the account number credited from making debit adjustment to other regulatory liabilities.
The decrease during the reporting period of other regulatory liabilities.
out_ferc1__yearly_plant_in_service_sched204
Annual time series of utilities' balances and changes to FERC Electric Plant in Service accounts.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 204)
- Primary key:
utility_id_ferc1, report_year, ferc_account_label
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Account numbers correspond to the FERC Uniform System of Accounts for Electric Plant, which is defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101). Each FERC respondent reports starting and ending balances for each account annually. Balances are organization wide, and are not broken down on a per-plant basis. End of year balance should equal beginning year balance plus the sum of additions, retirements, adjustments, and transfers.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Utility plant financial status (in service, future, leased, total).
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Cost of acquisition of items classified within the account.
Cost of adjustments to the account.
Account balance at end of year.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.
Cost of disposal of items classified within the account.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Account balance at beginning of year.
Cost of transfers into (out of) the account.
out_ferc1__yearly_pumped_storage_plants_sched408
Annual time series of plant statistics for hydroelectric pumped storage plants with an installed nameplate capacity of 10+ MW.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 408)
- Primary key:
record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
As reported in Schedule 408 of FERC Form 1.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Asset retirement cost (USD).
The average number of employees assigned to each plant.
Fraction of potential generation that was actually reported for a plant part.
Total installed (nameplate) capacity, in megawatts.
Cost of plant: accessory electric equipment (USD).
Cost of plant: miscellaneous power plant equipment (USD).
Cost of plant: reservoirs, dams, and waterways (USD).
Cost of plant: land and land rights (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Cost of plant: roads, railroads, and bridges (USD).
Cost of plant: structures and improvements (USD).
Total cost of plant (USD).
Cost of plant: water wheels, turbines, and generators (USD).
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Energy used for pumping, in megawatt-hours.
Year the plant's most recently built unit was installed.
Net electricity generation for the specified period in megawatt-hours (MWh).
Net output for load (net generation - energy used for pumping) in megawatt-hours.
Production expenses: maintenance of reservoirs, dams, and waterways (USD).
Production expenses: electric expenses (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: miscellaneous power generation expenses (USD).
Production expenses: maintenance of miscellaneous hydraulic plant (USD).
Production expenses: operations, supervision, and engineering (USD).
Total production expenses (USD per MWh generated).
Production expenses: maintenance of electric plant (USD).
Total production expenses before pumping (USD).
Production expenses: pumped storage (USD).
Production expenses: We are here to PUMP YOU UP! (USD).
Production expenses: rents (USD).
Production expenses: maintenance of structures (USD).
Total production expenses, excluding fuel (USD).
Total production expenses, excluding fuel (USD).
Production expenses: water for power (USD).
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Net plant capability in megawatts.
Hours the plant was connected to load while generating in the report year.
A manually assigned PUDL plant ID. May not be constant over time.
FERC Licensed Project Number.
out_ferc1__yearly_purchased_power_and_exchanges_sched326
Annual time series of purchased power (Account 555) including power exchanges (transactions involving a balancing of debits and credits for energy, capacity, etc.) and any settlements for imbalanced exchanges.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 326)
- Primary key:
This table has no primary key. The primary key for this table would be record_id, however a small number of these values are duplicated in 2021, 2022, and 2024.
Usage Warnings
Contains columns which may appear categorical, but are actually free text.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table has data about inter-utility power purchases. This includes how much electricity was purchased, how much it cost, and who it was purchased from. Unfortunately the field describing which other utility the power was being bought from (seller_name) is poorly standardized, making it difficult to correlate with other data.
Purchased Power is considered FERC Account 555 according to FERC's Uniform System of Accounts. Reported on pages 326-327 of FERC Form 1.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Name of the seller, or the other party in an exchange transaction.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Monthly average billing demand (for requirements purchases, and any transactions involving demand charges). In megawatts.
Average monthly coincident peak (CP) demand (for requirements purchases, and any transactions involving demand charges). Monthly CP demand is the metered demand during the hour (60-minute integration) in which the supplier's system reaches its monthly peak. In megawatts.
Gross megawatt-hours delivered in power exchanges and used as the basis for settlement.
Demand charges (USD).
Energy charges (USD).
Average monthly non-coincident peak (NCP) demand (for requirements purhcases, and any transactions involving demand charges). Monthly NCP demand is the maximum metered hourly (60-minute integration) demand in a month. In megawatts.
Other charges, including out-of-period adjustments (USD).
Categorization based on the original contractual terms and conditions of the service. Must be one of 'requirements', 'long_firm', 'intermediate_firm', 'short_firm', 'long_unit', 'intermediate_unit', 'electricity_exchange', 'other_service', or 'adjustment'. Requirements service is ongoing high reliability service, with load integrated into system resource planning. 'Long term' means 5+ years. 'Intermediate term' is 1-5 years. 'Short term' is less than 1 year. 'Firm' means not interruptible for economic reasons. 'unit' indicates service from a particular designated generating unit. 'exchange' is an in-kind transaction.
Megawatt-hours shown on bills rendered to the respondent. Includes both electricity purchased for storage and non-storage purposes, which were lumped together prior to 2021.
Number of megawatt hours purchased during the period for energy storage.
Number of megawatt hours purchased during the period for other than energy storage.
Gross megawatt-hours received in power exchanges and used as the basis for settlement.
FERC Rate Schedule Number or Tariff. (Note: may be incomplete if originally reported on multiple lines.)
Sum of demand, energy, and other charges (USD). For power exchanges, the settlement amount for the net receipt of energy. If more energy was delivered than received, this amount is negative.
out_ferc1__yearly_rate_base
Annual time series of granular accounting data consisting of what utilities can typically include in their rate bases.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 110, 118, 200, 204 219 and 320)
- Primary key:
This table has no primary key.
Usage Warnings
The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is derived from seven FERC Form 1 accounting tables with nested calculations. We reconciled these nested calculations and then identified the most granular data across the tables. Here are the three direct upstream inputs - the two detailed tables have several core_ferc1 inputs each:
out_ferc1__yearly_detailed_balance_sheet_assets
out_ferc1__yearly_detailed_balance_sheet_liabilities
core_ferc1__yearly_operating_expenses_sched320
We applied slight modifications to three columns (utility_type, plant_function & plant_status) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Listing of utility plant types.
Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).
Utility plant financial status (in service, future, leased, total).
The name of type of value which is a derivative of the XBRL fact name.
Account balance at end of year.
Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).
A category of asset or liability that RMI compiled to use as a shorthand for various types of utility assets. These tags were compiled manually based on the xbrl_factoid and sometimes varies based on the utility_type, plant_function or plant_status as well.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Indicates whether or not records with null or total values in the utility_type column were disaggregated. See documentation for process: pudl.output.ferc1.disaggregate_null_or_total_tag
Indicates whether or not records with null values in the in_rate_base column were disaggregated. See documentation for process: pudl.output.ferc1.disaggregate_null_or_total_tag
The name of the PUDL database table where a given record originated from.
out_ferc1__yearly_retained_earnings_sched118
Annual time series of utilities' statements of retained earnings.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 118)
- Primary key:
utility_id_ferc1, report_year, earnings_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Label describing types of earnings.
Account balance at beginning of year.
Account balance at end of year.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
out_ferc1__yearly_sales_by_rate_schedules_sched304
Annual time series of utilities' electric sales from all rate schedules in effect throughout the year.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 304)
- Primary key:
This table has no primary key.
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Values in rate_schedule_description are free-form strings.
Data prior to 2021 does not include information in columns: rate_schedule_type and billing_status.
Units of revenue_per_kwh are suspected to include a mix of dollars and possibly cents.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
The pre-2021 data in this table (extracted from FoxProDB vs. XBRL) is extremely unstructured. Where the post-2020 data (from XBRL) sorts the data into rate schedule types: residential, industrial, commercial, public_lighting, public_authorities, railroads, interdepartmental, provision_for_rate_refund, commercial_and_industrial, total, and billing status: billed, unbilled, total, the pre-2021 data stuffs all of that information (if you're lucky) into the rate_schedule_description column. There's no point trying to parse through the pre 2021 rate_schedule_description column en masse because it's just too messy. The contents of rate_schedule_description often contain numbers and acronyms that have little to no meaning out of context. The table is structured somewhat like the FERC1 small generators table with headings about rate structure type also embedded into the rate_schedule_description column. To all who dare, beware.
This table is a combination of one pre-2021 (DBF) table and nine post-2020 (XBRL) tables--one for each rate schedule type plus totals--hence increase in data clarity post-2020. The rate_schedule_type and billing_status columns are only relevant for post-2020 data as they can be reliably parsed from each of the tables and incorporated into columns. The rate_schedule_description is supposed to contain sub-rate_schedule_type names for charges (Ex: Residential 1, Residential 2, etc.). However, the pre-2021 data contains a little bit of everything (or nothing) and the post-2020 has some totals or wonky data thrown in. That's to say, even when working with post-2020 data, be wary of aggregating the data. That's what the "total" rows are for.
The values that come from from the totals table are marked with the string "total" in the rate_schedule_description column. The totals table is a product of the transition to XBRL, so these distinguishable totals are only available for data post-2020 (otherwise you could try keyword searching for "total" in rate_schedule_description). The total table contains two types of totals, the utility totals across all rate schedules in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "total") and each of the utility's individual rate schedule totals in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "residential" or any other rate schedule type).
The rate schedule based XBRL tables only report billed values whereas the total tables report billed, unbilled, and total values. (See the column description for more info on the difference between billed and unbilled). This is important to consider if you're endeavoring to compare the subtotal values with the total values. We have not attempted to fix or verify any subtotals or totals that don't add up.
Another important note is the possibility of unit discrepancies in certain columns. The revenue_per_kwh column does not specify reporting units, and closer inspection of the data reveals two clear peaks approximate two orders of magnitude apart. This indicates that values may be reported in both dollars and cents. However, because the price of energy per kwh varies so much regionally, we cannot guarantee which is which and have not put any cleaning mechanisms in place to account for this.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Categorization of rate schedule type.
Whether an amount is billed, unbilled, or both. Billed amounts pertain to the exchange of energy and unbilled amounts pertain to other sources of revenue such as contracts with peaker plants to keep them on standby or charging rent to host cell antennas on transmission towers.
Free-form description of what the rate schedule name is. Not standardized. Often a sub-category of rate_schedule_type.
Quantity of electricity sold in MWh.
Dollar value of reported income, expense, asset, or liability.
Average number of customers per month.
kWh per customer.
The amount of revenue per kWh by rate schedule acquired in the given report year.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
out_ferc1__yearly_small_plants_sched410
Annual time series of plant statistics for internal combustion plants, gas turbine-plants, conventional hydro plants, and pumped storage plants with less than 10 MW installed nameplate capacity and steam plants with less than 25 MW installed nameplate capacity.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 410)
- Primary key:
record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
As reported on FERC Form 1 Schedule 410 (pages 410-411) and extracted from the FERC Visual FoxPro and XBRL. See our pudl.extract.ferc1.TABLE_NAME_MAP_FERC1 for links to the raw tables.
The raw version of this table is more like a digitized PDF than an actual data table. The rows contain lots of information in addition to what the columns might suggest. For instance, a single column may contain header rows, note rows, and total rows. This extraneous information is useful, but it prevents proper analysis when mixed in with the rest of the values data in the column. We employ a couple of data transformations to extract these rows from the data and preserve some of the information they contain (fuel type, plant type, FERC license, or general notes about the plant) in separate columns.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
A manually assigned PUDL plant ID. May not be constant over time.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Total installed (nameplate) capacity, in megawatts.
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Total cost of plant (USD).
Year the plant's oldest still operational unit was built.
Average fuel cost per mmBTU of heat content in nominal USD.
Type of fuel.
FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.
Net electricity generation for the specified period in megawatt-hours (MWh).
Production expenses: fuel (USD).
Production expenses: Maintenance (USD).
Production expenses: operations, supervision, and engineering (USD).
Total production expenses, excluding fuel (USD).
Total production expenses, excluding fuel (USD).
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Type of plant.
out_ferc1__yearly_steam_plants_fuel_by_plant_sched402
Annual time series of FERC fuel data by plant.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)
- Primary key:
report_year, utility_id_ferc1, plant_name_ferc1
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
A manually assigned PUDL plant ID. May not be constant over time.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Coal cost as a percentage of overall fuel cost.
Coal heat content as a percentage of overall fuel heat content (mmBTU).
Total fuel cost for plant (in $USD).
Total heat content for plant (in MMBtu).
Natural gas cost as a percentage of overall fuel cost.
Natural gas heat content as a percentage of overall fuel heat content (MMBtu).
Nuclear cost as a percentage of overall fuel cost.
Nuclear heat content as a percentage of overall fuel heat content (MMBtu).
Oil cost as a percentage of overall fuel cost.
Oil heat content as a percentage of overall fuel heat content (MMBtu).
Primary fuel for plant as a percentage of cost.
Primary fuel for plant as a percentage of heat content.
Waste-heat cost as a percentage of overall fuel cost.
Waste-heat heat content as a percentage of overall fuel heat content (MMBtu).
out_ferc1__yearly_steam_plants_fuel_sched402
Annual time series of fuel cost and quantity for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)
- Primary key:
record_id
Usage Warnings
The fuel_type_code_pudl is inferred from a free-form string field.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is a subset of the steam plant table reported on page 402 of FERC Form 1.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
A manually assigned PUDL plant ID. May not be constant over time.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total cost of consumed fuel.
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Average fuel cost per mmBTU of heat content in nominal USD.
Average cost of fuel consumed in the report year per reported fuel unit (USD).
Average cost of fuel delivered in the report year per reported fuel unit (USD).
Heat content of the fuel in millions of Btus per physical unit.
Simplified fuel type code used in PUDL
Reported unit of measure for fuel.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
out_ferc1__yearly_steam_plants_sched402
Annual time series of plant statistics for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)
- Primary key:
record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
A manually assigned PUDL plant ID. May not be constant over time.
Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Asset retirement cost (USD).
The average number of employees assigned to each plant.
Fraction of potential generation that was actually reported for a plant part.
Total installed (nameplate) capacity, in megawatts.
Annual capital addition into capex_total.
Year-to-date capital addition into capex_total.
Annual capital addition into capex_total per kw.
Annual capital addition into capex_total per MW.
Year-to-date capital addition into capex_total per MW.
Annual capital addition into capex_total per MWh.
Year-to-date capital addition into capex_total per MWh.
Cost of plant: equipment (USD).
Cost of plant: land and land rights (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Cost of plant: structures and improvements (USD).
Total cost of plant (USD).
Total cost of plant (USD) without retirements.
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Net electricity generation for the specified period in megawatt-hours (MWh).
Plant capacity in MW when not limited by condenser water.
Allowances.
Maintenance of boiler (or reactor) plant.
Cost of coolants and water (nuclear plants only)
Production expenses: electric expenses (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: fuel (USD).
Production expenses: fuel (USD) per megawatt-hour (Mwh).
Miscellaneous steam (or nuclear) expenses.
Maintenance of miscellaneous steam (or nuclear) plant.
Investments in non-fuel production expenses per Mwh.
Production expenses: operations, supervision, and engineering (USD).
Total production expenses (USD per MWh generated).
Maintenance of electrical plant.
Total operating expenses.
Production expenses: rents (USD).
Steam expenses.
Steam from other sources.
Production expenses: maintenance of structures (USD).
Total production expenses, excluding fuel (USD).
Steam transferred (Credit).
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Net plant capability in megawatts.
Hours the plant was connected to load while generating in the report year.
Type of plant.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Plant capacity in MW when limited by condenser water.
out_ferc1__yearly_transmission_lines_sched422
Annual time series of statistics about transmission lines.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 422)
- Primary key:
This table has no primary key. Each record of this table is supposed to represent one stretch of a transmission line, but there are no IDs and many nulls in the fields which would nominally distinguish unique transmission lines.
Usage Warnings
Contains columns which may appear categorical, but are actually free text.
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Information describing transmission lines, the cost of lines, annual operating and capital expenses, etc. This table includes transmission lines having nominal voltage of 132 kilovolts or greater. Transmission lines below these voltages are required to be reported in group totals only for each voltage.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
The starting point of a transmission line.
The end point of a transmission line.
The operating voltage, expressed kilo-volts, for three-phase 60 cycle alternative current transmission lines.
Manufactured (Designed) voltage, expressed in kilo-volts, for three-phase 60 cycle alternative current transmission lines
Supporting structure of the transmission line.
Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are stand alone structures (whose cost are reported on a stand-alone basis).
Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are agrregated with other lines / structures (whose cost are aggregated and combined with other structures).
Number of circuits in a transmission line.
Size of transmission conductor and material of the transmission line.
Cost of plant: land and land rights (USD).
Other costs associated with the plant (USD).
Total cost of plant (USD).
Production expenses: operations, supervision, and engineering (USD).
Production expenses: Maintenance (USD).
Production expenses: rents (USD).
Total production expenses, excluding fuel (USD).
out_ferc1__yearly_utility_plant_summary_sched200
Annual time series of utility plant and accumulated provisions for depreciation, amortization and depletion of utility plant assets.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 200)
- Primary key:
utility_id_ferc1, report_year, utility_type, utility_plant_asset_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
A manually assigned PUDL utility ID. May not be stable over time.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).
Type of utility plant asset reported in the core_ferc1__yearly_utility_plant_summary_sched200 table. Assets include those leased to others, held for future use, construction work-in-progress and details of accumulated depreciation.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Account balance at end of year.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
out_ferc714__georeferenced_respondents
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report
- Primary key:
respondent_id_ferc714, report_date
Usage Warnings
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
An annual summary of demand and other information about FERC-714 respondents.
This table differs from out_ferc714__summarized_demand in that it also includes a geometry column describing the respondent's service territory in each year. These service territories are based on the counties that the corresponding EIA-861 respondent reported serving in that year. There is sometimes ambiguity as to whether a FERC-714 respondent should be interpreted as an individual utility or a balancing authority. The respodent_type column indicates which type of entity has been assumed in determining the service territory from EIA-861 data.
Columns
Date reported.
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
Geospatial representation of the feature.
EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.
Whether a respondent to the FERC form 714 is a utility or a balancing authority.
Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.
EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
The EIA Utility Identification number.
The name of the utility.
County population, sourced from Census DP1 data.
County area in km2.
Annual electricity demand in a given report year.
out_ferc714__hourly_estimated_state_demand
Hourly time series of estimated electricity demand for each state, scaled such that it matches the total electricity sales by state reported in EIA 861.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report
- Primary key:
state_id_fips, datetime_utc
Usage Warnings
Contains estimated values.
Contains information from multiple raw inputs.
Large table; do not attempt to open with Excel.
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table uses hourly electricity demand and information about service territories to estimate the total hourly electricity demand for each US state. Hourly demand is drawn from the FERC 714, which reports at the balancing authority and utility level. Service territories for utilities and balancing authorities are inferred from information in the EIA 861, including the counties served by each utility and the utilities that make up each balancing authority.
This table uses the total electricity sales by state reported in the EIA 861 as a scaling factor to ensure that the magnitude of electricity sales is roughly correct, and obtains the shape of the demand curve from the hourly planning area demand reported in the FERC 714.
This table includes the state FIPS codes - see out_censusdp1tract__states for additional information about FIPS state codes.
Columns
Two digit state FIPS code.
Date and time converted to Coordinated Universal Time (UTC).
Electricity demand (energy) within a given timeframe.
Estimated electricity demand scaled by the total sales within a state.
out_ferc714__hourly_planning_area_demand
Hourly time series of electricity demand by planning area.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part III, Schedule 2a)
- Primary key:
respondent_id_ferc714, datetime_utc
Usage Warnings
Contains rows where missing values were imputed.
The datetime_utc timestamps have been cleaned due to inconsistent datetime reporting. See core_ferc714__hourly_planning_area_demand for additional details.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is based on core_ferc714__hourly_planning_area_demand, but adds imputed demand values where the original data was missing or anomalous. Codes explaining why values have been imputed can be found in the core_pudl__codes_imputation_reasons table.
Columns
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
Date reported.
Date and time converted to Coordinated Universal Time (UTC).
IANA timezone name
Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.
Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.
Code describing why a demand value was flagged for imputation.
out_ferc714__respondents_with_fips
Annual respondents with the county FIPS IDs for their service territories.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report
- Primary key:
This table has no primary key. Note that the state and county FIPS columns can be null. Otherwise the natural primary key would be: ['respondent_id_ferc714', 'report_date', 'county_id_fips']. The state FIPS code is not part of the primary key because the first two digits of the county FIPS code indicate the state.
Usage Warnings
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.
Whether a respondent to the FERC form 714 is a utility or a balancing authority.
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.
Date reported.
EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
The EIA Utility Identification number.
The name of the utility.
Two letter US state abbreviation.
County name.
Two digit state FIPS code.
County ID from the Federal Information Processing Standard Publication 6-4.
out_ferc714__summarized_demand
Summarized demand statistics and FERC-714 respondent attributes by respondent-year.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report
- Primary key:
respondent_id_ferc714, report_date
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Date reported.
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
Annual electricity demand in a given report year.
County population, sourced from Census DP1 data.
County area in km2.
Average population per sq. km area of a service territory.
Per-capita annual demand, averaged using Census county-level population estimates.
Annual demand per km2 of a given service territory.
EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.
Whether a respondent to the FERC form 714 is a utility or a balancing authority.
Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.
EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
The EIA Utility Identification number.
The name of the utility.
out_gridpathratoolkit__hourly_available_capacity_factor
Hourly time series of capacity factors defining the capacity available from an aggregated group of generators.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
GridPath Resource Adequacy Toolkit Data
- Primary key:
datetime_utc, aggregation_group
Usage Warnings
Contains estimated values.
The data is no longer being collected or reported in this way.
Additional Details
This table contains a mix of profiles representing whole regions and individual plants, where the individual plants are hybrid wind or solar plus electricity storage facilities. Capacity factors are stated as a fraction of the aggregate nameplate capacity of the group.
For the hybrid facilities, the capacity factor represents the available output of only the renewable generators. Estimates of the generation profiles have been extended across a longer range of dates than the original data. For wind, the years 2015-2020 are synthesized data and for solar 2020 is synthesized. See the Appendix of the GridPath Resource Adequacy Toolkit report for details.
Columns
Date and time converted to Coordinated Universal Time (UTC).
A label identifying a group of aggregated generator capacity factors.
Fraction of potential generation that was actually reported for a plant part.
out_pudl__yearly_assn_eia_ferc1_plant_parts
Annual Association table providing connections between power plant data reported in FERC Form 1 and any available EIA data related to the plant parts covered by each FERC Form 1 record.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
record_id_ferc1
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table answers the question "What EIA data reported about plants or generators should be associated with a given plant record found in the FERC Form 1?"
Each record in this table corresponds to a single FERC Form 1 record reported in one of several tables describing power plants (large steam, hydro, small, etc.). These FERC records can correspond to an entire plant, individual generators within a plant, all generators in a plant with the same prime mover type, or just the respondent's ownership share of any of those categories (or other categories). Furthermore, the same utility may report the same plant in different ways in different years.
The EIA data associated with each FERC plant record comes from our Plant Parts EIA table, out_eia__yearly_plant_parts. The EIA data in each record represents an aggregation of several slices of an EIA plant, across both physical characteristics and utility ownership.
This table inherits canonicalized values for generators and plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Identifier for EIA plant parts analysis records.
Indicates the source and validation of the match between EIA and FERC. Match types include matches was generated from the model, verified by the training data, overridden by the training data, etc.
Derived plant name that includes EIA plant name and other strings associated with ID and PK columns of the plant part.
The part of the plant a record corresponds to.
Four-digit year in which the data was reported.
Date reported.
Whether each generator record is for one owner or represents a total of all ownerships.
Plant name.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Code for the type of prime mover (e.g. CT, CG)
The code representing the most predominant type of energy that fuels the generator.
High level description of the technology used by the generator to produce electricity.
Name of FERC account, derived from technology description and prime mover code.
Year a generator went into service.
The EIA Utility Identification number.
A manually assigned PUDL utility ID. May not be stable over time.
Indicates whether a plant part list record is associated with the highest priority plant part for all identical records.
Plant part of the associated true granularity record.
EIA record ID of the associated true granularity record.
Number of distinct generator IDs that participated in the aggregation for a plant part list record.
Proportion of generator ownership attributable to this utility.
Whether a plant part record has a duplicate record with different ownership status.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
The operating status of the asset using PUDL categories.
A manually assigned PUDL plant ID. May not be constant over time.
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Average fuel cost per mmBTU of heat content in nominal USD.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total installed (nameplate) capacity, in megawatts.
Fraction of potential generation that was actually reported for a plant part.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
Fuel content per unit of electricity generated. Coming from MCOE calculation.
Simplified fuel type code used in PUDL
Year the plant's most recently built unit was installed.
Contains EIA plant ID, plant part, ownership, and EIA utility id
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Asset retirement cost (USD).
The average number of employees assigned to each plant.
Fraction of potential generation that was actually reported for a plant part.
Total installed (nameplate) capacity, in megawatts.
Annual capital addition into capex_total.
Year-to-date capital addition into capex_total.
Annual capital addition into capex_total per kw.
Annual capital addition into capex_total per MW.
Year-to-date capital addition into capex_total per MW.
Annual capital addition into capex_total per MWh.
Year-to-date capital addition into capex_total per MWh.
Cost of plant: equipment (USD).
Cost of plant: land and land rights (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Cost of plant: structures and improvements (USD).
Total cost of plant (USD).
Total cost of plant (USD) without retirements.
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Net electricity generation for the specified period in megawatt-hours (MWh).
Plant capacity in MW when not limited by condenser water.
Allowances.
Maintenance of boiler (or reactor) plant.
Cost of coolants and water (nuclear plants only)
Production expenses: electric expenses (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: fuel (USD).
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.
Miscellaneous steam (or nuclear) expenses.
Maintenance of miscellaneous steam (or nuclear) plant.
Investments in non-fuel production expenses per Mwh.
Production expenses: operations, supervision, and engineering (USD).
Total production expenses (USD per MWh generated).
Production expenses: maintenance of electric plant (USD).
Total operating expenses.
Production expenses: rents (USD).
Steam expenses.
Steam from other sources.
Production expenses: maintenance of structures (USD).
Total production expenses, excluding fuel (USD).
Steam transferred (Credit).
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Net plant capability in megawatts.
Hours the plant was connected to load while generating in the report year.
Type of plant.
Plant capacity in MW when limited by condenser water.
Average fuel cost per mmBTU of heat content in nominal USD.
Type of fuel.
FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.
Production expenses: Maintenance (USD).
Total production expenses, excluding fuel (USD).
Cost of plant: reservoirs, dams, and waterways (USD).
Cost of plant: roads, railroads, and bridges (USD).
Net plant capability under the least favorable operating conditions, in megawatts.
Net plant capability under the most favorable operating conditions, in megawatts.
Production expenses: maintenance of reservoirs, dams, and waterways (USD).
Production expenses: miscellaneous power generation expenses (USD).
Production expenses: hydraulic expenses (USD).
Production expenses: maintenance of miscellaneous hydraulic plant (USD).
Production expenses: water for power (USD).
The FERC license ID of a project.
Cost of plant: accessory electric equipment (USD).
Cost of plant: miscellaneous power plant equipment (USD).
Cost of plant: water wheels, turbines, and generators (USD).
Energy used for pumping, in megawatt-hours.
Net output for load (net generation - energy used for pumping) in megawatt-hours.
Total production expenses before pumping (USD).
Production expenses: pumped storage (USD).
Production expenses: We are here to PUMP YOU UP! (USD).
Total annual reported fuel costs for the plant part. Includes costs from all fuels.
Total annual heat content of fuel consumed by a plant part record in the plant parts list.
Simplified fuel type code used in PUDL
Fuel content per unit of electricity generated. Calculated from FERC reported fuel consumption and net generation.
out_sec10k__changelog_company_name
Changelog table tracking changes in the names each SEC 10-K filer has used.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
This table has no primary key.
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
We use the company name reported in association with each name change block in the company information table to fill in the most recent value of company_name_new. Roughly 1,000 reported "name changes" in which the old and new names were identical have been dropped.
Columns
Identifier of the company in SEC database.
Date of last name change of the company.
Name of company prior to name change.
Name of company after name change.
out_sec10k__parents_and_subsidiaries
A denormalized table containing information about parent companies that file SEC Form 10-K and their subsidiaries, which may or may not file Form 10-K.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
filename_sec10k, subsidiary_company_id_sec10k
Usage Warnings
This data has been extracted probabilistically using a machine learning model and contains some incompletions and errors. It should not be treated as ground truth data.
PUDL does not currently update its copy of this data.
Additional Details
Company ownership fractions are extracted from SEC 10-K Exhibit 21. Information about the companies is extracted primarily from the headers of the SEC 10-K filing. Subsidiaries that file Form 10-K will have much more information available than those that only appear as subsidiaries in Exhibit 21.
SEC 10-K filers and EIA utilities are matched using probabilistic record linkage. Exhibit 21 subsidiaries that don't file a Form 10-K are matched to EIA utilities using the company name.
Columns
Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.
Name of subsidiary company.
Location of subsidiary company. This is the full US state name or country name and occasionally a two digit code that was not mapped to a full name during cleaning.
PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.
Proportion of generator ownership attributable to this utility.
Central index key (CIK) of the parent company.
Name of the parent company.
Date on which the filing was submitted.
Date reported.
Phone number of the parent company.
City where the parent company's place of business is located.
State where the parent company's place of business is located.
Street address of the parent company's place of business.
Second line of the street address of the parent company's place of business.
Zip code of the parent company's place of business.
Zip code suffix of the company's place of business.
City of the parent company's mailing address.
State of the parent company's mailing address.
Street portion of the parent company's mailing address.
Second line of the street portion of the parent company's mailing address.
Zip code of the parent company's mailing address.
Zip code suffix of the parent company's mailing address.
Two letter state code where parent company is incorporated.
The EIA utility ID of the parent company.
The EIA reported utility name of the parent company.
Text description of the parent company's Standard Industrial Classification (SIC)
Four-digit Standard Industrial Classification (SIC) code identifying the parent company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.
Taxpayer ID of the parent company with the IRS.
Central index key (CIK) of the subsidiary company.
Phone number of the subsidiary company.
City where the subsidiary company's place of business is located.
State where the subsidiary company's place of business is located.
Street address of the subsidiary company's place of business.
Second line of the street address of the subsidiary company's place of business.
Zip code of the subsidiary company's place of business.
Zip code suffix of the subsidiary company's place of business.
City of the subsidiary company's mailing address.
State of the parent company's mailing address.
Street portion of the subsidiary company's mailing address.
Second line of the street portion of the subsidiary company's mailing address.
Zip code of the subsidiary company's mailing address.
Zip code suffix of the subsidiary company's mailing address.
Two letter state code where subisidary company is incorporated.
The EIA utility ID of the subsidiary company.
The EIA reported utility name of the subsidiary company.
Text description of the subsidiary company's Standard Industrial Classification (SIC)
Four-digit Standard Industrial Classification (SIC) code identifying the subsidiary company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.
Taxpayer ID of the subsidiary company with the IRS.
out_sec10k__quarterly_company_information
Quarterly time series of company and filing data extracted from SEC 10-K filings.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
filename_sec10k, central_index_key
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
In addition to the information provided by the core_sec10k__quarterly_company_information table, this output table merges in the associated utility_id_eia (and utility name) if it is available, as well as the report and filing dates associated with the filing each record was extracted from, as well as providing a link to the source URL for the filing.
Columns
Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.
Identifier of the company in SEC database.
A counter indicating which observation of company data within an SEC 10-K filing header the record pertains to.
The EIA Utility Identification number.
The name of the utility.
Date reported.
Date on which the filing was submitted.
Name of company submitting SEC 10k filing.
The end date of an SEC filing company's fiscal year, in MMDD format.
Taxpayer ID of the company with the IRS.
Two letter state code where company is incorporated.
Text description of Standard Industrial Classification (SIC)
A higher level industry category defined within the Standard Industrial Classification (SIC) system.
Four-digit Standard Industrial Classification (SIC) code identifying the company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.
Document control number used in the SEC EDGAR database. The first four digits can be used to access scans of the document in the SEC's Virtual Private Reference Room.
Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.
SEC Act through which the form was enacted, e.g. 1934 act.
Filing number used internally by the SEC commission to track filing.
Phone number for utility contact 1.
Street address of the company's place of business.
Second line of the street address of the company's place of business.
City where the company's place of business is located.
State where the company's place of business is located.
Zip code of the company's place of business.
Zip code suffix of the company's place of business.
Non-US postal code of the company's place of business.
Street portion of the company's mailing address.
Second line of the street portion of the company's mailing address.
City of the company's mailing address.
State of the company's mailing address.
Zip code of the company's mailing address.
Zip code suffix of the company's mailing address.
Non-US postal code of the company's mailing address.
URL pointing to the original source of the data in the record.
out_sec10k__quarterly_filings
Quarterly time series of metadata describing SEC 10-K filings.
- Processing:
Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
filename_sec10k
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
Each SEC 10-K filing is submitted by a single company, but may contain information about numerous other companies. This table indicates the company submitting the filing, as well as some information about the overall filing. Each filing is guaranteed to have a unique filename, but ~1% of all filings are one company submitting the same form multiple times on the same day, so the filename is the only available natural primary key. This output table adds a link to the source URL for the filing, which is constructed from the filename.
Columns
Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.
Identifier of the company in SEC database.
Name of company submitting SEC 10k filing.
Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.
Date on which the filing was submitted.
Version of exhibit 21 submitted (if applicable).
Date reported.
URL pointing to the original source of the data in the record.
out_vcerare__hourly_available_capacity_factor
Hourly time series of estimated county-averaged capacity factors for wind and solar generating facilities across the contiguous United States (US), to be used as a tool and input for resource adequacy modeling and planning.
- Processing:
Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.
- Source:
Vibrant Clean Energy Resource Adequacy Renewable Energy (RARE) Power Dataset
- Primary key:
state, place_name, datetime_utc
Usage Warnings
The hourly capacity factors are normalized to unity for maximal power output. To convert to units of power, the user must multiply by the installed capacity within the county.
Hourly capacity factors are spatially averaged across each county over the contiguous USA. There are a handful of counties that are too small to pick up representation on the HRRR operational forecast grid. As such, these counties will have no wind or solar power production curves.
Due to power production performance being correlated with panel temperatures, during cold sunny periods, some solar capacity factor values are greater than 1 (but less that 1.1).
Additional Details
The data in this table were produced by Vibrant Clean Energy, and are licensed to the public under the Creative Commons Attribution 4.0 International license (CC-BY-4.0).
The technologies provided are:
Onshore wind assuming a 100m hub height and 120m rotor diameter;
Offshore wind assuming a 140m hub height and 120m rotor diameter;
Utility solar assuming a fixed axis panel tilted at latitude.
The foundation of the capacity factors provided here is the NOAA HRRR operational numerical weather prediction model. The HRRR covers the entire contiguous US at a horizontal resolution of 3 km. Forecasts are initialized each hour of the year. Forecast hour two (2) is used as the input data for the power algorithms. This forecast hour is chosen to trade-off the impact of the measurement and data assimilation procedure of the HRRR with the physics of the model to derive the most complete picture of the atmosphere at the forecast time horizon.
For wind capacity factors: vertical slices of the atmosphere are considered across the defined rotor swept area. Bringing together wind speed, density, temperature and icing information, a power capacity is estimated using a representative power coefficient (Cp) curve to determine the power from a given wind speed, atmospheric density and temperature. There is no wake modeling included in the dataset.
For solar capacity factors: pertinent surface weather variables are pulled such as incoming short wave radiation, direct normal irradiance (calculated in the HRRR 2016 forward), surface temperature and other parameters. These are used in a non-linear I-V curve translation to power capacity factors.
Columns
Two letter US state abbreviation.
County or lake name, sourced from the latest Census PEP vintage based on county FIPS ID. Lake names originate from VCE RARE directly, and may also appear several times--once for each state it touches. FIPS ID values for lakes have been nulled.
Date and time converted to Coordinated Universal Time (UTC).
Four-digit year in which the data was reported.
Integer between 1 and 8670 representing the hour in a given year.
County ID from the Federal Information Processing Standard Publication 6-4.
Latitude of the place centroid (e.g., county centroid).
Longitude of the place centroid (e.g., county centroid).
Estimated capacity factor (0-1) calculated for solar PV assuming a fixed axis panel tilted at latitude and DC power outputs. Due to power production performance being correlated with panel temperatures, during cold sunny periods, some solar capacity factor values are greater than 1 (but less that 1.1).All values are based on outputs from the NOAA HRRR operational numerical weather prediction model. Capacity factors are normalized to unity for maximal power output. Pertinent surface weather variables are pulled such as incoming short wave radiation, direct normal irradiance (calculated in the HRRR 2016 forward), surface temperature and other parameters. These are used in a non-linear I-V curve translation to power capacity factors.
Estimated capacity factor (0-1) calculated for onshore wind assuming a 100m hub height and 120m rotor diameter.Based on outputs from the NOAA HRRR operational numerical weather prediction model. Capacity factors are normalized to unity for maximal power output. Vertical slices of the atmosphere are considered across the defined rotor swept area. Bringing together wind speed, density, temperature and icing information, a power capacity is estimated using a representative power coefficient (Cp) curve to determine the power from a given wind speed, atmospheric density and temperature. There is no wake modeling included in the dataset.
Estimated capacity factor (0-1) calculated for offshore wind assuming a 140m hub height and 120m rotor diameter.Based on outputs from the NOAA HRRR operational numerical weather prediction model. Capacity factors are normalized to unity for maximal power output. Vertical slices of the atmosphere are considered across the defined rotor swept area. Bringing together wind speed, density, temperature and icing information, a power capacity is estimated using a representative power coefficient (Cp) curve to determine the power from a given wind speed, atmospheric density and temperature. There is no wake modeling included in the dataset.
core_eia176__yearly_gas_disposition
Annual time series of a company's natural and supplemental gas disposition for the report state.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 176 -- Annual Report of Natural and Supplemental Gas Supply and Disposition (Part 6, Lines 9, 12-20)
- Primary key:
operator_id_eia, report_year
Additional Details
The deliveries_out_of_state_volume_mcf (Line 14.0) are reported as one aggregated volume, calculated by summing the original granular data. Similarly, disposition_to_other_volume_mcf (Line 18.4) is summed from the original granular data and reported as one aggregate field.
The delivered_gas_heat_content_mmbtu_per_mcf is expected to be between 0.8 and 1.2 by the EIA. We find that less than 0.5 percent of data falls outside of these expected bounds.
The total_disposition_mcf field includes disposition to consumers which is reported in core_eia176__yearly_gas_disposition_by_consumer. Note that the reported total disposition and the sum of values in this table and core_eia176__yearly_gas_disposition_by_consumer often don't match as would be expected. Thus, we have preserved both the total field and the sub-components in these two tables.
Columns
The unique EIA identifier for an operator in a given state. The last two letters of the ID indicate the state.
Four-digit year in which the data was reported.
State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.
The average annual heat content of gas delivered directly to consumers.
Volume of natural gas consumed as fuel for space heat of the operator's own facilities within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Volume of natural gas consumed as fuel for the operator's new pipeline fill within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Volume of natural gas consumed as fuel for the operator's own pipeline distribution or storage compressor use within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Volume of natural gas consumed by the operator for vaporization, liquefaction, and LNG fuel within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Volume of natural gas used in the operator's company-owned fleet within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Volume of natural gas consumed as fuel by the operator for other purposes within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Free-text detail describing the operator’s specified 'other purposes' for operational natural-gas consumption, corresponding to the volume reported in operational_consumption_other_mcf.
The total volume added to underground storage operated by the company within the report state, regardless of ownership of the gas. This includes new fields. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
The total volume added to LNG storage operated by the company within the report state, regardless of ownership of the gas. For LNG import and export marine terminals, this should exclude injections of LNG during the course of routine operations for handling imports, as well as LNG held in storage for future use. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
The total volume of gas used in the company's well, field and lease operations. This field should only be reported by producers. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
The volume of gas delivered directly from the operator's system to oil or gas fields located within the report state for repressuring, pressure maintenance, and cycling operations. This field should only be reported by producers. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Disposition to distribution companies within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Disposition to storage operators within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Disposition to other pipelines within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Total volume of the operator's deliveries across or to state lines or U.S. borders. This has been summed from the detailed data reported by each company on Line 14.0 of the original form in order to preserve the primary key of the table. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Other disposition within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. This has been summed from the detailed data reported by each company on Line 18.4 of the original form in order to preserve the primary key of the table. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Total disposition within the report state, as reported by the operator. This includes disposition to consumers which is reported in core_eia176__yearly_gas_disposition_by_consumer. Note that the reported total disposition and the sum of values in this table and core_eia176__yearly_gas_disposition_by_consumer often don't match as it would be expected. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
Known loss volumes as a result of leaks, damage, accidents, migration and blow down within the report state where the events took place, as well as estimated losses from leaks occurring during distribution activities. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
The difference between gas supply and disposition. A positive entry indicates supply in excess of accounted-for disposition. A negative entry indicates accounted-for disposition exceeds reported supply. This is calculated as the difference between Part 4 Line 7.0 and Part 6 Line 19.0, and is reported as Line 20.0 in the original form.
core_eia176__yearly_gas_disposition_by_consumer
Annual time series of a company's natural gas deliveries to end-use consumers within the report state.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 176 -- Annual Report of Natural and Supplemental Gas Supply and Disposition (Part 6, Lines 10.0-11.6)
- Primary key:
report_year, operator_id_eia, customer_class, revenue_class
Columns
Four-digit year in which the data was reported.
The unique EIA identifier for an operator in a given state. The last two letters of the ID indicate the state.
State that the operator is reporting for.
High level categorization of customer type (e.g., commercial, residential).
Source of revenue: whether revenue originates from gas owned directly by the operator (sales) or gas transported by the operator (transport).
Number of end-use consumers within the report state.
Revenue including taxes, rounded to the nearest whole dollar.
Total volume of natural gas deliveries in the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.
core_eia860__assn_boiler_cooling
Association table providing connections between EIA boiler IDs and EIA cooling system IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
report_date, plant_id_eia, boiler_id, cooling_id_eia
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
EIA Identification code for cooling system (if multiple cooling systems are not distinguished by separate IDs, the word 'PLANT' is listed to encompass the cooling system for the entire plant)
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860__assn_boiler_generator
Association table providing connections between boilers and generators.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 6, Part A)
- Primary key:
plant_id_eia, report_date, generator_id, boiler_id
Usage Warnings
Contains information from multiple raw inputs.
Additional Details
Augmented with various heuristics within PUDL.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Date reported.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Alphanumeric boiler ID.
EIA-assigned unit identification code.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Indicates whether boiler associations with generator during the year were actual or theoretical. Only available before 2013.
Code that describes types of steam plants from EIA 860. See steam_plant_types_eia table for more details.
The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860__assn_boiler_stack_flue
Association table providing connections between EIA boiler IDs and EIA stack and/or flue system IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
report_date, plant_id_eia, boiler_id, stack_flue_id_pudl
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
The stack identification value reported to EIA. Stacks or chimneys are the place where emissions from the combustion process are released into the atmosphere. This field was reported in conjunction with flue_id_eia until 2013 when stack_flue_id_eia took their place.
The flue identification value reported to EIA. The flue is a duct, pipe, or opening that transports exhast gases through the stack. This field was reported in conjunction with stack_id_eia until 2013 when stack_flue_id_eia took their place.
The stack or flue identification value reported to EIA. This denotes the place where emissions from the combustion process are released into the atmosphere. Prior to 2013, this was reported as stack_id_eia and flue_id_eia.
A stack and/or flue identification value created by PUDL for use as part of the primary key for the stack flue equipment and boiler association tables. For 2013 and onward, this value is equal to the value for stack_flue_id_eia. Prior to 2013, this value is equal to the value for stack_id_eia and the value for flue_id_eia separated by an underscore or just the stack_flue_eia in cases where flue_id_eia is NA.
core_eia860__assn_yearly_boiler_emissions_control_equipment
Association table providing connections between EIA boiler IDs and emissions control IDs for NOx, SO2, mercury, and particulate monitoring.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
report_date, plant_id_eia, boiler_id, emission_control_id_type, emission_control_id_eia
Usage Warnings
The relationship between the IDs is sometimes many to many.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
The type of emissions control id: SO2, NOx, particulate, or mercury.
The emission control ID used to collect SO2, NOx, particulate, and mercury emissions data. This column should be used in conjunction with emissions_control_type as it's not guaranteed to be unique.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860__scd_boilers
Slowly changing dimension (SCD) table describing attributes of boilers which may vary from year to year. Compiled from across all EIA-860 data.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
plant_id_eia, boiler_id, report_date
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for boilers are set. It contains values which are expected to vary slowly, while core_eia__entity_boilers contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
Date reported.
Date the boiler began or is planned to begin commercial operation.
EIA short code identifying boiler operational status.
Date of the scheduled or effected retirement of the boiler.
EIA short code indicating the standards under which the boiler is operating as described in the U.S. EPA regulation under 40 CFR.
EIA short code indicating the type of firing used by this boiler.
EIA short code indicating the type of firing used by this boiler.
EIA short code indicating the type of firing used by this boiler.
Design firing rate at maximum continuous steam flow for coal to the nearest 0.1 ton per hour.
Design firing rate at maximum continuous steam flow for pet coke to the nearest 0.1 barrels per hour.
Design firing rate at maximum continuous steam flow for gas to the nearest 0.1 cubic feet per hour.
Design firing rate at maximum continuous steam flow for energy sources other than coal, petroleum, or natural gas.
The code representing the most predominant type of energy that fuels the boiler.
The code representing the second most predominant type of energy that fuels the boiler.
The code representing the third most predominant type of energy that fuels the boiler.
The code representing the fourth most predominant type of energy that fuels the boiler.
Design waste-heat input rate at maximum continuous steam flow where a waste-heat boiler is a boiler that receives all or a substantial portion of its energy input from the noncumbustible exhaust gases of a separate fuel-burning process (MMBtu per hour).
Wet or Dry Bottom where Wet Bottom is defined as slag tanks that are installed at furnace throat to contain and remove molten ash from the furnace, and Dry Bottom is defined as having no slag tanks at furnace throat area, throat area is clear, and bottom ash drops through throat to bottom ash water hoppers.
Indicates whether the boiler is capable of re-injecting fly ash.
indicates if the boiler is a heat recovery steam generator (HRSG).
Maximum continuous steam flow at 100 percent load.
The turndown ratio for the boiler.
Boiler efficiency percentage when burning at 100 percent load to the nearest 0.1 percent.
Boiler efficiency percentage when burning at 50 percent load to the nearest 0.1 percent.
Total air flow including excess air at 100 percent load, reported at standard temperature and pressure (i.e. 68 F and one atmosphere pressure).
Indicates whether the boiler is subject to New Source Review requirements.
Month of issued New Source Review permit.
New Source Review permit number.
EIA short code for most stringent type of statute or regulation code under which the boiler is operating for particulate matter control standards.
EIA short code for most stringent type of statute or regulation code under which the boiler is operating for sulfur dioxide control standards.
EIA short code for most stringent type of statute or regulation code under which the boiler is operating for nitrogen oxide control standards.
Numeric value for the unit of measurement specified for particulate matter.
Numeric value for the unit of measurement specified for sulfur dioxide.
Numeric value for the unit of measurement specified for nitrogen oxide.
Numeric value for the unit of measurement specified for particulate matter.
Numeric value for the unit of measurement specified for sulfur dioxide.
Numeric value for the unit of measurement specified for nitrogen oxide.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for particulate matter emissions.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for nitrogen oxide emissions.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for sulfur dioxide emissions.
If boiler is not in compliance with particulate matter regulations, strategy for compliance.
If boiler is not in compliance with particulate matter regulations, strategy for compliance.
If boiler is not in compliance with particulate matter regulations, strategy for compliance.
If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.
If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.
If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.
Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.
If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.
If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.
If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.
Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.
Year boiler was or is expected to be in compliance with federal, state and/or local regulations for mercury emissions.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent mercury regulation.
Proposed strategy to comply with the most stringent mercury regulation.
Proposed strategy to comply with the most stringent mercury regulation.
Proposed strategy to comply with the most stringent mercury regulation.
Existing strategy to comply with the most stringent nitrogen oxide regulation.
Existing strategy to comply with the most stringent nitrogen oxide regulation.
Existing strategy to comply with the most stringent nitrogen oxide regulation.
Name of nitrogen oxide control manufacturer.
Code indicating the nitrogen oxide control burner manufacturer.
Proposed strategy to comply with the most stringent nitrogen oxide regulation.
Proposed strategy to comply with the most stringent nitrogen oxide regulation.
Proposed strategy to comply with the most stringent nitrogen oxide regulation.
Nitrogen oxide control status code.
Most stringent type of statute or regulation code under which the boiler is operating for mercury control standards.
Existing strategy to comply with the most stringent sulfur dioxide regulation.
Existing strategy to comply with the most stringent sulfur dioxide regulation.
Existing strategy to comply with the most stringent sulfur dioxide regulation.
Proposed strategy to comply with the most stringent sulfur dioxide regulation.
Proposed strategy to comply with the most stringent sulfur dioxide regulation.
Proposed strategy to comply with the most stringent sulfur dioxide regulation.
The percent of sulfur dioxide to be scrubbed specified by the most stringent sulfur dioxide regulation.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860__scd_emissions_control_equipment
Slowly changing dimension (SCD) table describing attributes of emissions control equipment reported to EIA, including cost, type, operating status, retirement date, and install year.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
report_year, plant_id_eia, emission_control_id_pudl
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Additional Details
Includes control ids for sulfur dioxide (SO2), particulate matter, mercury, nitrogen oxide (NOX), and acid (HCl) gas monitoring.
This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.
Columns
Four-digit year in which the data was reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A PUDL-generated ID used to distinguish emission control units in the same report year and plant id. This ID should not be used to track units over time or between plants.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
Short code indicating the type of emission control equipment installed.
The operating status of the asset.
Mercury control identification number. This ID is not a unique identifier.
Nitrogen oxide control identification number. This ID is not a unique identifier.
Particulate matter control identification number. This ID is not a unique identifier.
Sulfur dioxide control identification number. This ID is not a unique identifier.
Indicates whether the emissions control equipment controls acid (HCl) gas.
The total cost to install a piece of emission control equipment.
The date a piece of emissions control equipment began operating. Derived from month and year columns in the raw data.
The expected or actual retirement date for a piece of emissions control equipment. Derived from month and year columns in the raw data.
core_eia860__scd_generators
Slowly changing dimension (SCD) table describing attributes of generators which may vary from year to year. Compiled from across EIA-860 and EIA-923 data.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
plant_id_eia, generator_id, report_date
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for generators are set. It contains values which are expected to vary slowly, while core_eia__entity_generators contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
The EIA Utility Identification number.
Date reported.
The operating status of the asset.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
Identifies the ownership for each generator.
Total installed (nameplate) capacity, in megawatts.
The net summer capacity.
Whether the summer capacity value was an estimate
The net winter capacity.
Whether the winter capacity value was an estimate
Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.
Energy storage capacity in MWh (e.g. for batteries).
Code for the type of prime mover (e.g. CT, CG)
The code representing the most predominant type of energy that fuels the generator.
The code representing the second most predominant type of energy that fuels the generator
The code representing the third most predominant type of energy that fuels the generator
The code representing the fourth most predominant type of energy that fuels the generator
The code representing the fifth most predominant type of energy that fuels the generator
The code representing the sixth most predominant type of energy that fuels the generator
Primary mode of transport for energy source 1.
Secondary mode of transport for energy source 1.
Tertiary mode of transport for energy source 1.
Primary mode of transport for energy source 2.
Secondary mode of transport for energy source 2.
Tertiary mode of transport for energy source 2.
Simplified fuel type code used in PUDL
Whether the generator can burn multiple fuels.
Indicate whether the generator can deliver power to the transmission grid.
Whether the generator is considered distributed generation
Indicates whether standby generators (SB status) can be synchronized to the grid.
Number of wind turbines, or hydrokinetic buoys.
Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.
Increase in summer capacity expected to be realized from the modification to the equipment.
Increase in winter capacity expected to be realized from the uprate modification to the equipment.
Planned effective date that the generator is scheduled to enter operation after the uprate modification.
Decrease in summer capacity expected to be realized from the derate modification to the equipment.
Decrease in winter capacity expected to be realized from the derate modification to the equipment.
Planned effective month that the generator is scheduled to enter operation after the derate modification.
New prime mover for the planned repowered generator.
New energy source code for the planned repowered generator.
Planned effective date that the generator is scheduled to enter operation after the repowering is complete.
Indicates whether there are there other modifications planned for the generator.
Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.
Planned effective date of the scheduled retirement of the generator.
Indicates whether the generator uses carbon capture technology.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.
High level description of the technology used by the generator to produce electricity.
Number of wind turbines, or hydrokinetic buoys.
The minimum amount of time required to bring the unit to full load from shutdown.
The expected new namplate capacity for the generator.
Whether the generator can co-fire fuels.
Whether the generator can switch between oil and natural gas.
The nameplate power factor of the generator.
The minimum load at which the generator can operate at continuosuly.
Was an uprate or derate completed on this generator during the reporting year?
The date when the uprate or derate was completed.
The most recently updated effective date on which the generator is scheduled to start operation
EIA estimated summer capacity (in MWh).
EIA estimated winter capacity (in MWh).
Date of the scheduled or effected retirement of the generator.
Whether any part of generator is owned by a nonutilty
Reactive Power Output (MVAr)
Indicates whether or not a generator is a qualifying FERC cogeneration facility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860__scd_generators_energy_storage
Slowly changing dimension (SCD) table describing attributes of energy storage which may vary from year to year.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 3)
- Primary key:
plant_id_eia, generator_id, report_date
Additional Details
This table includes only those values that are unique to energy storage. The rest of the columns that are reported in the EIA-860 Energy Storage tabs are included in core_eia860__scd_generators and core_eia__entity_generators.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Date reported.
Maximum charge rate in MW.
Maximum discharge rate in MW.
A code representing the enclosure type that best describes where the generator is located.
The electro-chemical storage technology used for this battery applications.
The electro-chemical storage technology used for this battery applications.
The electro-chemical storage technology used for this battery applications.
The electro-chemical storage technology used for this battery applications.
Whether the energy storage device served arbitrage applications during the reporting year
Whether the energy storage device served backup power applications during the reporting year.
Whether the energy storage device served renewable firming applications during the reporting year.
Whether the energy storage device served frequency regulation applications during the reporting year.
Whether the energy storage device served load following applications during the reporting year.
Whether the energy storage device served load management applications during the reporting year.
Whether the this energy storage device served ramping / spinning reserve applications during the reporting year.
Whether the energy storage device served system peak shaving applications during the reporting year.
Whether the energy storage device served renewable firming applications during the reporting year.
Whether the energy storage device served voltage or reactive power support applications during the reporting year.
Whether the energy storage device was used to store excess wind/solar generation during the reporting year.
Indicates if this energy storage device is AC-coupled (means the energy storage device and the PV system are not installed on the same side of an inverter).
Indicates if this energy storage device is DC-coupled (means the energy storage device and the PV system are on the same side of an inverter and the battery can still charge from the grid).
Indicates if this energy storage device is DC tightly coupled (means the energy storage device and the PV system are on the same side of an inverter and the battery cannot charge from the grid).
Indicates if this energy storage device is independent (not coupled with another generators)
Indicate if the energy storage system is intended to support a specific substation, transmission or distribution asset.
Indicates if this energy storage device is intended for dedicated generator firming or storing excess generation of other units.
The EIA Plant ID of the primary unit whose generation this energy storage device is intended to firm or store.
The EIA Generator ID of the primary unit whose generation this energy storage device is intended to firm or store.
The EIA Plant ID of the secondary unit whose generation this energy storage device is intended to firm or store.
The EIA Generator ID of the secondary unit whose generation this energy storage device is intended to firm or store.
The EIA Plant ID of the tertiary unit whose generation this energy storage device is intended to firm or store.
The EIA Generator ID of the tertiary unit whose generation this energy storage device is intended to firm or store.
core_eia860__scd_generators_multifuel
Slowly changing dimension (SCD) table describing attributes of generators that may vary from year to year, pertaining to fuel-switching and the use of multiple fuels.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 3)
- Primary key:
report_date, utility_id_eia, generator_id, plant_id_eia
Columns
Date reported.
The EIA Utility Identification number.
The name of the utility.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Plant name.
Two letter US state abbreviation.
County name.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
The operating status of the asset.
High level description of the technology used by the generator to produce electricity.
Code for the type of prime mover (e.g. CT, CG)
EIA assigned sector name, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status
EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status
Total installed (nameplate) capacity, in megawatts.
The net summer capacity.
The net winter capacity.
The most recently updated effective date on which the generator is scheduled to start operation
The code representing the most predominant type of energy that fuels the generator.
The code representing the second most predominant type of energy that fuels the generator
Whether the generator can burn multiple fuels.
Whether the generator can co-fire fuels.
The codes representing the type of fuel that will be co-fired.
The codes representing the type of fuel that will be co-fired.
The codes representing the type of fuel that will be co-fired.
The codes representing the type of fuel that will be co-fired.
The codes representing the type of fuel that will be co-fired.
The codes representing the type of fuel that will be co-fired.
Whether the generator can switch between oil and natural gas.
The time required to switch the generator from running 100 percent oil to running 100 percent natural gas.
The time required to switch the generator from running 100 percent natural gas to running 100 percent oil.
Indicates whether a fuel switching generator can switch fuels while operating.
The maximum net summer output achievable when running on natural gas.
The maximum net summer output achievable when running on oil.
The maximum net winter output achievable when running on natural gas.
The maximum net summer output achievable when running on oil.
Whether there are factors that limit the generator's ability to switch between oil and natural gas.
Whether limited on-site fuel storage is a factor that limits the generator's ability to switch between oil and natural gas.
Whether air permit limits are a factor that limits the generator's ability to switch between oil and natural gas.
Whether there are factors other than air permit limits and storage that limit the generator's ability to switch between oil and natural gas.
Whether the generator can co-fire oil and gas.
Whether the generator can co-fire 100 oil.
The maximum oil heat input (percent of MMBtus) expected for proposed unit when co-firing with natural gas
The maximum output (net MW) expected for proposed unit, when making the maximum use of oil and co-firing natural gas.
Whether a unit is able to switch fuels.
Whether there are factors that limit the operation of the generator when running on 100 percent oil
The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.
The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.
The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.
The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.
The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.
The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.
core_eia860__scd_generators_solar
Slowly changing dimension (SCD) table describing attributes of solar generators that may vary from year to year.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 3.3)
- Primary key:
plant_id_eia, generator_id, report_date
Additional Details
This table includes only those values that are unique to solar generators. The rest of the columns that are reported in the EIA-860 Solar tabs are included in core_eia860__scd_generators and core_eia__entity_generators.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Date reported.
The net capacity of this photovoltaic generator in direct current under standard test conditions (STC) of 1000 W/m^2 solar irradiance and 25 degrees Celsius PV module temperature. This was only reported in 2013 and 2014.
The DC megawatt capacity that is part of a net metering agreement.
Indicates if the output from this generator is part of a net metering agreement.
Indicates if the output from this generator is part of a virtual net metering agreement.
The DC capacity in MW that is part of a virtual net metering agreement.
Indicates the azimuth angle of the unit for fixed tilt or single-axis technologies.
Indicates the tilt angle of the unit for fixed tilt or single-axis technologies.
Indicates whether lenses or mirrors are used at this solar generating unit.
Indicates whether single-axis tracking technologies are used at this solar generating unit.
Indicates whether dual-axis tracking technologies are used at this solar generating unit.
Indicates whether fixed tilt technologies are used at this solar generating unit.
Indicates whether east west fixed tilt technologies are used at this solar generating unit.
Indicates whether parabolic trough technologies s are used at this solar generating unit.
Indicates whether linear fresnel technologies are used at this solar generating unit.
Indicates whether power towers are used at this solar generating unit.
Indicates whether dish engines are used at this solar generating unit.
Indicates whether other solar technologies are used at this solar generating unit.
Indicates whether any solar photovoltaic panels at this generator are made of crystalline silicon.
Indicates whether any solar photovoltaic panels at this generator are made of thin-film amorphous silicon (A-Si).
Indicates whether any solar photovoltaic panels at this generator are made of thin-film cadmium telluride (CdTe).
Indicates whether any solar photovoltaic panels at this generator are made of thin-film copper indium gallium diselenide (CIGS).
Indicates whether any solar photovoltaic panels at this generator are made of other thin-film material.
Indicates whether any solar photovoltaic panels at this generator are made of other materials.
Indicates whether bifacial solar panels are used at this solar generating unit.
core_eia860__scd_generators_wind
Slowly changing dimension (SCD) table describing attributes of wind generators that may vary from year to year.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 3.2)
- Primary key:
plant_id_eia, generator_id, report_date
Additional Details
This table includes only those values that are unique to wind generators. The rest of the columns that are reported in the EIA-860 Wind tabs are included in core_eia860__scd_generators and core_eia__entity_generators.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Date reported.
Average annual wind speed that turbines at this wind site were designed for.
The Federal Aviation Administration (FAA) obstacle number assigned to this generator. If more than one obstacle number exists, the one that best represents the turbines. References the obstacle numbers reported in the FAA's Digital Obstacle File: https://www.faa.gov/air_traffic/flight_info/aeronav/digital_products/dof/ This field was only reported from 2013 through 2015.
Name of predominant manufacturer of turbines at this generator.
Predominant model number of turbines at this generator.
The hub height of turbines at this generator. If more than one value exists, the one that best represents the turbines.
The wind quality class for turbines at this generator. See table core_eia__codes_wind_quality_class for specifications about each class.
core_eia860__scd_ownership
Slowly changing dimension (SCD) table describing attributes of generator ownership. Includes only jointly or third-party owned generators.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 4)
- Primary key:
report_date, plant_id_eia, generator_id, owner_utility_id_eia
Columns
Date reported.
The EIA utility Identification number for the owner company that is responsible for the day-to-day operations of the generator, not the operator utility.
The EIA utility Identification number for the operator utility.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
The name of the EIA owner utility.
Two letter ISO-3166 political subdivision code.
City of owner.
Three letter ISO-3166 country code.
Steet address of owner.
Zip code of owner.
Proportion of generator ownership attributable to this utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860__scd_plants
Slowly changing dimension (SCD) table describing attributes of plants which may vary from year to year. Compiled from across all EIA-860 and EIA-923 data.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
plant_id_eia, report_date
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for plants are set. It contains values which are expected to vary slowly, while core_eia__entity_plants contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Date reported.
Is there an ash impoundment (e.g. pond, reservoir) at the plant?
If there is an ash impoundment at the plant, is the impoundment lined?
If there is an ash impoundment at the plant, the ash impoundment status as of December 31 of the reporting year.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
Geodetic coordinate system identifier (e.g. NAD27, NAD83, or WGS84).
Indicates if the facility has energy storage capabilities.
The docket number relating to the FERC cogenerator status. See FERC Form 556.
Indicates whether the plant has FERC qualifying facility cogenerator status. See FERC Form 556.
The docket number relating to the FERC qualifying facility exempt wholesale generator status.
Indicates whether the plant has FERC qualifying facility exempt wholesale generator status
The docket number relating to the FERC qualifying facility small power producer status. See FERC Form 556.
Indicates whether the plant has FERC qualifying facility small power producer status. See FERC Form 556.
The docket number relating to the FERC qualifying facility cogenerator status. See FERC Form 556.
Plant's grid voltage at point of interconnection to transmission or distribution facilities
Plant's grid voltage at point of interconnection to transmission or distribution facilities
Plant's grid voltage at point of interconnection to transmission or distribution facilities
The code of the plant's ISO or RTO. NA if not reported in that year.
Indicates if the facility have the capability to store the natural gas in the form of liquefied natural gas.
Names of Local Distribution Company (LDC), connected to natural gas burning power plants.
Indicates if the facility have on-site storage of natural gas.
The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.
The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.
The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.
NERC region in which the plant is located
Whether the plant has a net metering agreement in effect during the reporting year. (Only displayed for facilities that report the sun or wind as an energy source). This field was only reported up until 2015
Additional owner or operator of natural gas pipeline.
North American Industry Classification System (NAICS) code that best describes the primary purpose of the reporting plant
Indicates whether the plant is regulated or non-regulated.
Code that specifies what time period data has to be reported (i.e. monthly data or annual totals) and how often the power plant reports this data to EIA. See reporting_frequencies_eia for more details.
EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status
EIA assigned sector name, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status
Service area in which plant is located; for unregulated companies, it's the electric utility with which plant is interconnected
EIA-assigned code for owner of transmission/distribution system to which the plant is interconnected.
Name of the owner of the transmission or distribution system to which the plant is interconnected.
State location for owner of transmission/distribution system to which the plant is interconnected.
The EIA Utility Identification number.
Name of water source associated with the plant.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860__scd_utilities
Slowly changing dimension (SCD) table describing attributes of utilities which may vary from year to year. Compiled from all EIA data.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860 -- Annual Electric Generator Report
- Primary key:
utility_id_eia, report_date
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for utilities are set. It contains values which are expected to vary slowly, while core_eia__entity_utilities contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The EIA Utility Identification number.
Date reported.
Physical street address.
Name of the city.
Two letter US state abbreviation.
Five digit US Zip Code.
Is the reporting entity an owner of power plants reported on Schedule 2 of the form?
Is the reporting entity an operator of power plants reported on Schedule 2 of the form?
Is the reporting entity an asset manager of power plants reported on Schedule 2 of the form?
Does the reporting entity have any other relationship to the power plants reported on Schedule 2 of the form?
Entity type of principal owner.
Mail attention name of the operator/owner.
Second line of the address.
Four digit US Zip Code suffix.
First name of utility contact 1.
Last name of utility contact 1.
Title of of utility contact 1.
Phone number for utility contact 1.
Phone extension for utility contact 1
First name of utility contact 2.
Last name of utility contact 2.
Title of utility contact 2.
Phone number for utility contact 2.
Phone extension for utility contact 2
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia860m__changelog_generators
Changelog table tracking changes in generator characteristics reported in EIA-860M.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 860M -- Monthly Update to the Annual Electric Generator Report
- Primary key:
plant_id_eia, generator_id, report_date
Additional Details
EIA-860M includes generator tables with the most up-to-date catalog of EIA generators and their operational status and other generator characteristics. EIA-860M is reported monthly, although for the vast majority of the generators nothing changes month-to-month. This table is a changelog of that monthly reported generator data. There is a record corresponding to the first instance of a generator and associated characteristics with a report_date column and a valid_until_date column. Whenever any of the reported EIA-860M data was changed for a record, there will be a new changelog record with a new report_date.
Columns
Date reported.
The record in the changelog is valid until this date. The record is valid from the report_date up until but not including the valid_until_date.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Plant name.
The EIA Utility Identification number.
The name of the utility.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Total installed (nameplate) capacity, in megawatts.
County name.
The most recently updated effective date on which the generator is scheduled to start operation
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
The code representing the most predominant type of energy that fuels the generator.
Energy storage capacity in MWh (e.g. for batteries).
Simplified fuel type code used in PUDL
Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.
Date of the scheduled or effected retirement of the generator.
Latitude of the plant's location, in degrees.
Longitude of the plant's location, in degrees.
Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
The operating status of the asset.
Planned effective month that the generator is scheduled to enter operation after the derate modification.
Planned effective date of the scheduled retirement of the generator.
Decrease in summer capacity expected to be realized from the derate modification to the equipment.
Increase in summer capacity expected to be realized from the modification to the equipment.
Planned effective date that the generator is scheduled to enter operation after the uprate modification.
Planned effective date that the generator is scheduled to enter operation after the repowering is complete.
Code for the type of prime mover (e.g. CT, CG)
EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status
Two letter US state abbreviation.
The net summer capacity.
High level description of the technology used by the generator to produce electricity.
The net winter capacity.
core_eia861__assn_balancing_authority
Association table providing connections between state, balancing authority, and utility in a given year.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
report_date, balancing_authority_id_eia, utility_id_eia, state
Columns
Date reported.
EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.
The EIA Utility Identification number.
Two letter US state abbreviation.
core_eia861__assn_utility
Association table providing connections between utility and state in a given year.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
report_date, utility_id_eia, state
Columns
Date reported.
The EIA Utility Identification number.
Two letter US state abbreviation.
core_eia861__yearly_advanced_metering_infrastructure
Annual time series of advanced metering infrastructure (AMI) and automated meter reading (AMR) by state, sector, and balancing authority.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
balancing_authority_code_eia, customer_class, report_date, state, utility_id_eia
Additional Details
The energy served (in MWH) for AMI systems is provided. Form EIA-861 respondents also report the number of standard meters (non AMR/AMI) in their system. Historical Changes: We started collecting the number of standard meters in 2013. The monthly survey collected these data from January 2011 to January 2017.
Columns
Number of meters that measure and record usage data at a minimum, in hourly intervals and provide usage data at least daily to energy companies and may also provide data to consumers. Data are used for billing and other purposes. Advanced meters include basic hourly interval meters and extend to real-time meters with built-in two-way communication capable of recording and transmitting instantaneous data.
Number of meters that collect data for billing purposes only and transmit this data one way, usually from the customer to the distribution utility.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
High level categorization of customer type (e.g., commercial, residential).
Number of customers able to access daily energy usage through a webportal or other electronic means.
Number of customers with direct load control: a A demand response activity by which the program sponsor remotely shuts down or cycles a customer’s electrical equipment (e.g. air conditioner, water heater) on short notice.
Amount of energy served through AMI meters. AMI meters can transmit data in both directions, between the delivery entity and the customer.
Entity type of principal owner.
Number of AMI meters with home area network (HAN) gateway enabled.
Number of non-AMR/AMI meters. Usually electromechanical or solid state meters measuring aggregated kWh where data are manually retrieved over monthly billing cycles for billing purposes only. Standard meters may also include functions to measure time-of-use and/or demand with data manually retrieved over monthly billing cycles.
Date reported.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_balancing_authority
Annual time series of balancing authorities.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
report_date, balancing_authority_id_eia
Columns
Date reported.
EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Name of the balancing authority.
core_eia861__yearly_demand_response
Annual time series of demand response programs by state, sector, and balancing authority.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
balancing_authority_code_eia, customer_class, report_date, state, utility_id_eia
Additional Details
The EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.
Columns
Demand reduction actually achieved by demand response activities. Measured at the time of the company's annual system peak hour.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
High level categorization of customer type (e.g., commercial, residential).
Total cost of customer incentives in a given report year. Customer incentives are the total financial value provided to a customer for program participation, whether, for example, cash payment, or lowered tariff rates relative to non-participants, in-kind services (e.g. design work), or other benefits directly provided to the customer for their program participation.
Number of customers.
The energy savings incurred in a given reporting year by participation in demand response programs.
Additional costs.
The total demand savings that could occur at the time of the system peak hour assuming all demand response is called.
Date reported.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_demand_response_water_heater
Annual time series of grid-connected water heaters enrolled in demand response programs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
balancing_authority_code_eia, report_date, state, utility_id_eia
Columns
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Date reported.
Two letter US state abbreviation.
The EIA Utility Identification number.
The number of grid-enabled water heaters added to the respondent's program this year - if the respondent has DSM program for grid-enabled water heaters (as defined by DOE’s Office of Energy Efficiency and Renewable Energy).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_demand_side_management_ee_dr
Annual time series of The impact of energy efficiency and load management programs on total energy sold (MWh) and peak demand (MW) by customer class.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
The data is no longer being collected or reported in this way.
Additional Details
The raw EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.
Columns
Costs that have not been included in any program category, but could be meaningfully identified with operating the company’s DSM programs (e.g., Administrative, Marketing, Monitoring & Evaluation, Company-Earned Incentives, Other).
The sum of direct program costs, indirect program costs, and incentive payments associated with utility demand side management programs.
High level categorization of customer type (e.g., commercial, residential).
The peak reduction incurred in a given reporting year by all participants in efficiency programs.
The sum of actual direct costs, incentive payments, and indirect costs incurred in a given reporting year from energy efficiency programs.
The change in energy use incurred in a given reporting year by all participants in energy efficiency programs.
The cost of incentive payments incurred in a given reporting year from energy efficiency programs. Incentives are the total financial value provided to a customer for program participation, whether cash payment, in-kind services (e.g. design work), or other benefits directly provided customer for their program participation.
The peak reduction incurred in a given reporting year by new participants in existing energy efficiency programs and all participants in new energy efficiency programs.
The change in energy use incurred in a given reporting year by new participants in existing energy efficiency programs and all participants in new energy efficiency programs.
The peak reduction incurred in a given reporting year by all participants in load management programs.
The sum of actual direct costs, incentive payments, and indirect costs incurred in a given reporting year from load management programs.
The change in energy use incurred in a given reporting year by all participants in load management programs.
The cost of incentive payments incurred in a given reporting year from load management programs. Incentives are the total financial value provided to a customer for program participation, whether cash payment, in-kind services (e.g. design work), or other benefits directly provided customer for their program participation.
The potential amount of peak reduction that could be incurred in a given reporting year by all participants in load management programs.
The peak reduction incurred in a given reporting year by new participants in existing load management programs and all participants in new load management programs.
The change in energy use incurred in a given reporting year by new participants in existing load management programs and all participants in new load management programs.
The potential amount of peak reduction that could be incurred in a given reporting year by new participants in existing load management programs and all participants in new load management programs.
NERC region in which the plant is located
The number of customers participating in the respondent's incentive-based demand response programs.
Date reported.
Two letter US state abbreviation.
The number of cusomters participating in the respondent's time-based rate programs.
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_demand_side_management_misc
Annual time series of demand-side management (DSM) program information.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
The data is no longer being collected or reported in this way.
Additional Details
Includes boolean fields about whether the energy savings estimates/calculations were independently verified and whether the utility runs time and or price responsive programs. Also contains information on whether any of the respondent's DSM activities are reported under another company, and if so which one.
The raw EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.
Columns
Whether savings estimates are based on a forecast or the report of one or more independent evaluators.
Whether reported energy savings were verified through an independent evaluation.
Entity type of principal owner.
Whether there have been any major changes to the respondent's demand-side management programs (e.g., terminated programs, new information or financing programs, or a shift to programs with dual load building objectives and energy efficiency objectives), program tracking procedures, or reporting methods that affect the comparison of demand-side management data reported on this schedule to data from previous years.
NERC region in which the plant is located
Whether the respondent operates any incentive-based demand response programs (e.g., market incentives, financial incentives, direct load control, interruptible programs, demand bidding/buyback, emergency demand response, capacity market programs, and ancillary service market programs).
Date reported.
The name of the company if a respondent's demand-side management activities are reported on Schedule 6 of another company’s form.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
Whether the respondent operates any time-based rate programs (e.g., real-time pricing, critical peak pricing, variable peak pricing and time-of-use rates administered through a tariff).
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_demand_side_management_sales
Annual time series of electricity sales related to demand-side management (DSM).
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
The data is no longer being collected or reported in this way.
Additional Details
The raw EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.
Columns
NERC region in which the plant is located
Date reported.
The amount of electricity sold for resale purposes. This entry should include sales for resale to power marketers (reported separately in previous years), full and partial requirements customers, firm power customers and nonfirm customers.
The amount of electricity sold to customers purchasing electricity for their own use and not for resale.
Two letter US state abbreviation.
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_distributed_generation_fuel
Annual time series of the energy sources used for utility or customer-owned distributed generation capacity.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
The data is no longer being collected or reported in this way.
Additional Details
From 2016 on, there's no great match for the data found in this table. You may be able to compute an analog for fuel percents using the capacity and tech class figures in core_eia861__yearly_non_net_metering_customer_fuel_class, but no corresponding consumption or generation information is available in the new format.
The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.
Columns
Whether the reported fuel data is estimated or actual.
Fuel types specific to EIA 861 distributed generation table: ['gas', 'oil', 'other', 'renewable', 'water', 'wind', 'wood']
Percent of fuel
Date reported.
Two letter US state abbreviation.
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_distributed_generation_misc
Annual time series of the capacity and quantity of utility or customer-owned distributed generation.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
The data is no longer being collected or reported in this way.
Additional Details
From 2016 on, data similar to that found in this table can be found in core_eia861__yearly_non_net_metering_misc.
The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.
Columns
The total nameplate capacity of generators that are used only for emergency backup service.
Amount of distributed generation capacity owned by the respondent.
Whether the reported capacity data is estimated or actual.
Total number of generators less than 1 MW.
Total number of generators
Date reported.
Two letter US state abbreviation.
The total amount of capacity from generators with less than 1 MW of nameplate capacity.
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_distributed_generation_tech
Annual time series of the technology used for utility or customer-owned distributed generation.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
The data is no longer being collected or reported in this way.
Additional Details
From 2016 on, data similar to that found in this table can be found in core_eia861__yearly_non_net_metering_customer_fuel_class.
The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.
Columns
Total installed (nameplate) capacity, in megawatts.
Whether the reported technology data is estimated or actual.
Date reported.
Two letter US state abbreviation.
Type of technology specific to EIA 861 distributed generation and net generation tables: ['backup', 'chp_cogen', 'combustion_turbine', 'fuel_cell', 'hydro', 'internal_combustion', 'other', 'pv', 'steam', 'storage_pv', 'storage_nonpv', 'all_storage', 'total', 'virtual_pv', 'virtual_pv_under_1mw', 'virtual_pv_over_1mw', 'wind'].
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_distribution_systems
Annual time series of distribution circuits and circuits with voltage optimization by state.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Columns
Number of distribution circuits that employ voltage/VAR optimization (VVO).
Total number of distribution circuits.
Date reported.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_dynamic_pricing
Annual time series of enrollment in dynamic pricing programs by state, sector, and balancing authority.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Additional Details
Respondents check if one or more customers are enrolled in time-of-use pricing, real time pricing, variable peak pricing, critical peak pricing, and critical peak rebates.
Columns
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Whether customers are participating in critical peak pricing, a program in which rate and/or price structure is designed to encourage reduced consumption during periods of high wholesale market prices or system contingencies, by imposing a pre-specified high rate or price for a limited number of days or hours.
Whether customers are participating in critical peak rebates, a program in which rate and/or price structure is designed to encourage reduced consumption during periods of high wholesale market prices or system contingencies, by providing a rebate to the customer on a limited number of days and for a limited number of hours, at the request of the energy provider.
High level categorization of customer type (e.g., commercial, residential).
Number of customers.
Whether the respondent has customers participating in a real time pricing (RTP) program. RTP is a program of rate and price structure in which the retail price for electricity typically fluctuates hourly or more often, to reflect changes in the wholesale price of electricity on either a day- ahead or hour-ahead basis.
Date reported.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
Whether the respondent has customers participating in a time-of-use pricing programs (TOU). TOU is a program in which customers pay different prices at different times of the day. On-peak prices are higher and off-peak prices are lower than a “standard” rate. Price schedule is fixed and predefined, based on season, day of week, and time of day.
The EIA Utility Identification number.
The name of the utility.
Whether the respondent has customers participating in a variable peak pricing program (VPP). VPP is a program in which a form of TOU pricing allows customers to purchase their generation supply at prices set on a daily basis with varying on-peak and constant off-peak rates. Under the VPP program, the on-peak price for each weekday becomes available the previous day (typically late afternoon) and the customer is billed for actual consumption during the billing cycle at these prices.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_energy_efficiency
Annual time series of incremental energy savings, peak demand savings, weighted average life cycle, and associated costs for the reporting year and life cycle of energy efficiency programs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Additional Details
The EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.
Columns
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
High level categorization of customer type (e.g., commercial, residential).
The cost of customer incentives resulting from new participants in existing energy efficiency programs and all participants in new energy efficiency programs. Customer incentives are the total financial value provided to a customer for program participation, whether, for example, cash payment, or lowered tariff rates relative to non-participants, in-kind services (e.g. design work), or other benefits directly provided to the customer for their program participation.
All anticipated costs of the customer incentives including reporting year incremental costs and all future costs. Customer incentives are the total financial value provided to a customer for program participation, whether, for example, cash payment, or lowered tariff rates relative to non-participants, in-kind services (e.g. design work), or other benefits directly provided to the customer for their program participation.
All anticipated costs other than customer incentives. Includes reporting year incremental costs and all future costs.
Energy savings in the given report year resulting from new participants in existing demand response programs and all participants in new demand response programs.
The estimated total changes in energy use for incremental programs and participants over the life of the programs. DSM programs have a useful life, and the net effects of these programs will diminish over time. Considers the useful life of energy efficiency technology by accounting for building demolition, equipment degradation, and program attrition.
The estimated total changes in peak load for incremental programs and participants over the life of the programs. DSM programs have a useful life, and the net effects of these programs will diminish over time. Considers the useful life of energy efficiency technology by accounting for building demolition, equipment degradation, and program attrition.
The peak reduction incurred in a given reporting year by new participants in existing energy efficiency programs and all participants in new energy efficiency programs.
Costs resulting from new participants in existing energy efficiency programs and all participants in new energy efficiency programs that aren't directly associated with customer incentives.
Date reported.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
The weighted average life of the respondent's portfolio of energy efficiency programs.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_green_pricing
Annual time series of green pricing program revenue, sales, and customer count by sector and state.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
The data is no longer being collected or reported in this way.
Columns
High level categorization of customer type (e.g., commercial, residential).
Number of customers.
The money derived from premium green pricing rate of the respondent'sprogram.
Amount of revenue collected from Renewable Energy Certificates (RECs).
Amount of sales collected from Renewable Energy Certificates (RECs).
Date reported.
Quantity of electricity sold in MWh.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_mergers
Annual time series of utility mergers and acquisitions.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Columns
Entity type of principal owner.
Address of new parent company.
City of new parent company.
Name of the company merged with or acquired.
Date of merger or acquisition.
Two letter US state abbreviations and three letter ISO-3166-1 country codes for international mines.
Name of the new parent company post merger.
Date reported.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
Five digit US Zip Code.
Four digit US Zip Code suffix.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_net_metering_customer_fuel_class
Annual time series of net metering by customer and fuel class.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Usage Warnings
Some years use a slightly different data definition.
Additional Details
The amount of energy sold to back to the grid. From 2007 - 2009 the data are reported as a lump sum of total energy dispatched by sector. After 2009, the data are broken down by sector and technology type.
Columns
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Total installed (nameplate) capacity, in megawatts.
The total amount of energy which the system can supply power before recharging is necessary, in megawatt-hours.
High level categorization of customer type (e.g., commercial, residential).
Number of customers.
Date reported.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
The amount of electric energy sold back to the utility through the net metering application.
Two letter US state abbreviation.
Type of technology specific to EIA 861 distributed generation and net generation tables: ['backup', 'chp_cogen', 'combustion_turbine', 'fuel_cell', 'hydro', 'internal_combustion', 'other', 'pv', 'steam', 'storage_pv', 'storage_nonpv', 'all_storage', 'total', 'virtual_pv', 'virtual_pv_under_1mw', 'virtual_pv_over_1mw', 'wind'].
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_net_metering_misc
Annual time series of PV current flow type for net metered capacity.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Columns
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Current flow type for photovoltaics: AC or DC
Date reported.
Two letter US state abbreviation.
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_non_net_metering_customer_fuel_class
Annual time series of non-net metered distributed generation by sector and technology type.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Additional Details
This table's data starts in 2016. For 2015 and earlier, data similar to that found in this table can be found in core_eia861__yearly_distributed_generation_tech.
The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.
Columns
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Total installed (nameplate) capacity, in megawatts.
The total amount of energy which the system can supply power before recharging is necessary, in megawatt-hours.
High level categorization of customer type (e.g., commercial, residential).
Date reported.
Two letter US state abbreviation.
Type of technology specific to EIA 861 distributed generation and net generation tables: ['backup', 'chp_cogen', 'combustion_turbine', 'fuel_cell', 'hydro', 'internal_combustion', 'other', 'pv', 'steam', 'storage_pv', 'storage_nonpv', 'all_storage', 'total', 'virtual_pv', 'virtual_pv_under_1mw', 'virtual_pv_over_1mw', 'wind'].
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_non_net_metering_misc
Annual time series of non-net metered distributed generation generators, pv current flow type, backup capacity and utility owned capacity.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key.
Additional Details
This table's data starts in 2016. For 2015 and earlier, data similar to that found in this table can be found in core_eia861__yearly_distributed_generation_misc.
The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.
Columns
The total nameplate capacity of generators that are used only for emergency backup service.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Total number of generators
Current flow type for photovoltaics: AC or DC
Date reported.
Two letter US state abbreviation.
The EIA Utility Identification number.
Total non-net-metered capacity owned by the respondent.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_operational_data_misc
Annual time series of megawatt hours (MWH) for the sources and disposition of a utility's electricity.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report (Schedule 2B)
- Primary key:
utility_id_eia, nerc_region, report_date
Usage Warnings
Some values have been redacted.
Contains rows where missing values were imputed.
Additional Details
Respondents are required to report this information to the EIA, but are not required to disclose utility-level data to the public. When a respondent chooses to keep its utility-level data proprietary, it files using EIA utility id 88888. For more details, see EIA-861 Notable Irregularities <eia861-notable-irregularities>.
Rows where data_observed is False were labeled as imputed in the raw EIA data. The EIA documentation does not specify what imputation applies to operational data, only net metering and non net-metering distributed data. Less than 1% of rows are labeled as imputed, all of which occur in 2004 or 2005. Imputed rows make up no more than 25% of the rows for any utility.
Columns
The amount of electricity used by the facility.
The amount of electricity used by the electric utility in its electric and other departments without charge.
Is the value observed (True) or imputed (False).
Entity type of principal owner.
The amount of exchange energy delivered. Does not include power delivered as part of a tolling arrangement.
The amount of exchange energy received. Does not include power received through tolling arrangements.
The amount of electricity furnished by the electric utility without charge, such as to a municipality under a franchise agreement or for street and highway lighting.
NERC region in which the plant is located
Net electricity generation for the specified period in megawatt-hours (MWh).
The net amount of energy exchanged. Net exchange is the difference between the amount of exchange received and the amount of exchange delivered. This entry should not include wholesale energy purchased from or sold to regulated companies or unregulated companies for other systems.
The difference between the amount of energy entering the respondent's system (wheeled received) for transmission through the respondent's system and the amount of energy leaving the respondent's system (wheeled delivered). Wheeled net represents the energy losses on the respondent's system associated with the wheeling of energy for other systems.
Date reported.
MWh of sales to end-use customers in areas where the customer has been given the legal right to select a power supplier other than the traditional, vertically integrated electric utility.
The amount of electricity sold for resale purposes. This entry should include sales for resale to power marketers (reported separately in previous years), full and partial requirements customers, firm power customers and nonfirm customers.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
The maximum hourly summer load (for the months of June through September) based on net energy for the system during the reporting year. Net energy for the system is the sum of energy an electric utility needs to satisfy their service area and includes full and partial wholesale requirements customers, and the losses experienced in delivery. The maximum hourly load is determined by the interval in which the 60-minute integrated demand is the greatest.
Sum of all disposition of electricity listed. Includes sales to ultimate customers, sales for resale, energy furnished without charge, energy consumed by respondent without charge and total energy losses.
The total amount of electricity lost from transmission, distribution, and/or unaccounted for. Should be expressed as a positive number.
Sum of all sources of electricity listed. Includes net generation, purchases from electricity suppliers, net exchanges (received - delivered), net wheeled (received - delivered), transmission by others, and losses.
The amount of energy losses associated with the wheeling of electricity provided to the respondent's system by other utilities. Transmission by others, losses should always be a negative value.
The EIA Utility Identification number.
The name of the utility.
The total amount of energy leaving the respondent's system that was transmitted through the respondent's system for delivery to other systems. If wheeling delivered is not precisely known, the value is an estimate based on the respondent's system's known percentage of losses for wheeling transactions.
The total amount of energy entering the respondent's system from other systems for transmission through the respondent's system (wheeling) for delivery to other systems. Does not include energy purchased or exchanged for consumption within the respondent's system, which was wheeled to the respondent by others.
Purchases from electricity suppliers.
The maximum hourly winter load (for the months of January through March) based on net energy for the system during the reporting year. Net energy for the system is the sum of energy an electric utility needs to satisfy their service area and includes full and partial wholesale requirements customers, and the losses experienced in delivery. The maximum hourly load is determined by the interval in which the 60-minute integrated demand is the greatest.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_operational_data_revenue
Annual time series of utility revenue by type of electric operating revenue.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report (Schedule 2C)
- Primary key:
utility_id_eia, nerc_region, report_date, revenue_class
Usage Warnings
Some values have been redacted.
Additional Details
A utility's revenue by type of electric operating revenue. Includes electric operating revenue from sales to ultimate customers, revenue from unbundled (delivery) customers, revenue from sales for resale, electric credits/other adjustments, revenue from transmission, other electric operating revenue, and total electric operating revenue.
Respondents are required to report this information to the EIA, but are not required to disclose utility-level data to the public. When a respondent chooses to keep its utility-level data proprietary, it files using EIA utility id 88888. For more details, see EIA-861 Notable Irregularities <eia861-notable-irregularities>.
Columns
NERC region in which the plant is located
Date reported.
Amount of revenue.
Source of revenue (e.g., retail sales, transmission).
Two letter US state abbreviation.
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_reliability
Annual time series of electricity system reliability and outage impacts.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report (Schedules 3B and 3C)
- Primary key:
utility_id_eia, state, report_date, standard
Note: standard is included because while respondents are asked to only fill out one of parts B or C, sometimes they fill out both.
Additional Details
Contains information on non-momentary electrical interruptions. Includes the system average interruption duration index (SAIDI), system average interruption frequency index (SAIFI), and customer average interruption duration index (CAIDI) aka SAIDI/SAIFI with and without major event days and loss of service. Includes the standard (IEEE/other) and other relevant information.
SAIDI is measured in minutes, and represents the sum over the year of all customers interrupted times the number of minutes they experienced an interruption, divided by total number of customers.
SAIFI is measured in customers, and represents the sum over the year of the total number of customers that experienced an interruption, divided by the total number of customers.
The IEEE standards which can be used to calculate SAIDI and SAIFI include IEEE 1366-2003 and IEEE 1366-2012. These standards define momentary interruptions as having a duration of five minutes or less. If one of these IEEE standards is used, respondents fill out part B and are required to specify:
SAIDI major event days minus loss of supply
SAIFI major event days minus loss of supply
SAIDI and SAIFI major event days minus loss of supply excludes events where the reliability event resulted from an event on the distribution system, not from the high-voltage system. The voltage that distinguishes the distribution system from the supply system is given in highest_distribution_voltage_kv.
If a method other than these IEEE standards is used for calculating SAIDI and SAIFI indexes, respondents fill out part C and are required to specify:
whether inactive accounts are included
how they define a momentary interruption (less than 1 minute, less than or equal to 5 minutes, or other)
In this table, column standard is "ieee_standard" for respondents who have filled out part B, and "other_standard" for respondents who have filled out part C.
Columns
Average number of minutes per interruption (SAIDI/SAIFI) including major event days and excluding reliability events caused by a loss of supply.
Average number of minutes per interruption (SAIDI/SAIFI) including major event days.
Average number of minutes per interruption (SAIDI/SAIFI) excluding major event days.
Number of customers.
Entity type of principal owner.
The highest voltage that's part of the distribution system.
Whether the respondent includes inactive accounts in its definition of customers used to determine SAIDI and SAIFI.
How the respondent defines momentary service interruptions: less than 1 min, equal to or less than 5 min, or some other way.
Whether the information about customer outages is recorded automatically.
Date reported.
Cumulative duration (minutes) of interruption for the average customer during the report year including major event days and excluding reliability events caused by a loss of supply.
Cumulative duration (minutes) of interruption for the average customer during the report year including major event days.
Cumulative duration (minutes) of interruption for the average customer during the report year excluding major event days.
Average number of times a customer experienced a sustained interruption (over 5 minutes) during the report year including major event days.
Average number of times a customer experienced a sustained interruption (over 5 minutes) during the report year including major event days and excluding reliability events caused by a loss of supply.
Average number of times a customer experienced a sustained interruption (over 5 minutes) during the report year excluding major event days.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Whether the respondent calculates SAIDI/SAIFI, and major event days according to the IEEE or an Other standard.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_sales
Annual time series of electricity sales to ultimate customers by utility, balancing authority, state, and customer class.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
utility_id_eia, state, report_date, balancing_authority_code_eia, customer_class, business_model, service_type
Columns
The EIA Utility Identification number.
Two letter US state abbreviation.
Date reported.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
High level categorization of customer type (e.g., commercial, residential).
Business model.
Is the value observed (True) or imputed (False).
Entity type of principal owner.
The type of service the respondent provides to a given customer class.Bundled: both energy and delivery; energy: just the energy consumed; delivery: just the billing and energy delivery services.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
The name of the utility.
Number of customers.
Quantity of electricity sold in MWh.
Revenue from electricity sold.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_service_territory
Annual time series of counties in utility service territories.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
report_date, utility_id_eia, county_id_fips
Columns
County name.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
The EIA Utility Identification number.
The name of the utility.
Date reported.
Two digit state FIPS code.
County ID from the Federal Information Processing Standard Publication 6-4.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_short_form
Annual time series of data from the short form (EIA-861S).
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
utility_id_eia, state, report_date, balancing_authority_code_eia
Usage Warnings
Some years are missing from the data record.
Additional Details
The data started being reported in 2012. However, the 2019 data is not available. They are expected to submit the completed Form EIA-861S to EIA by April 30th, following the end of the prior calendar year. Utilities report on Form EIA-861S if they:
Report less than 200,000 megawatthours on the last previous Form EIA-861.
Provide only bundled service (generation and distribution).
Are not needed to ensure acceptable quality of statistical estimates.
Are not part of the aggregate TVA or WPPI.
Do not report on Form EIA-861M.
Columns
Date reported.
The EIA Utility Identification number.
The name of the utility.
Entity type of principal owner.
Two letter US state abbreviation.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Revenue from electricity sold.
Quantity of electricity sold in MWh.
Number of customers.
Whether the plant has a net metering agreement in effect during the reporting year. (Only displayed for facilities that report the sun or wind as an energy source). This field was only reported up until 2015
Whether there were strategies or measures used to control electricity demand by customers
Whether the respondent operates any time-based rate programs (e.g., real-time pricing, critical peak pricing, variable peak pricing and time-of-use rates administered through a tariff).
Whether a green pricing program was associated with this utility during the reporting year.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_utility_data_misc
Annual time series of utility business activities.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key. The primary key would have been: utility_id_eia, state, report_date, and nerc_region, but there are nulls in the state column across several years of reporting.
Additional Details
This includes whether they operate alternative fuel vehicles, whether they provide transmission, distribution, or generation services (bundled or unbundled), and whether they engage in wholesale and/or retail markets.
Columns
Whether the utility plants to operate alternative-fueled vehicles this coming year.
Whether the utility operates alternative-fueled vehicles during the year.
Whether a utility engaged in combined utility services (electricity plus other services such as gas, water, etc. in addition to electric services) during the year.
Whether a utility bought any distribution on other electrical systems during the year.
Whether a utility bought any transmission services on other electrical systems during the year.
Whether a utility engaged in any distribution using owned/leased electrical wires during the year.
Entity type of principal owner.
Whether a utility utilized generation from company owned plant during the year.
NERC region in which the plant is located
Whether the respondent operated at least one generating plant during the reporting period.
Date reported.
Whether a utility engaged in retail power marketing during the year.
Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.
Two letter US state abbreviation.
Whether a utility engaged in any transmission activities during the year.
The EIA Utility Identification number.
The name of the utility.
Whether a utility engages in wholesale power marketing during the year.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_utility_data_nerc
Annual time series of the NERC regions that utilities operate in.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key. The primary key would have been: utility_id_eia, state, report_date, nerc_region, and nerc_regions_of_operation, but there are nulls in the state column across several years of reporting.
Columns
NERC region in which the plant is located
All the regional entities within the North American Electric Reliability Corporation (NERC) in which the respodent conducts operations.
Date reported.
Two letter US state abbreviation.
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia861__yearly_utility_data_rto
Annual time series of the RTOs that utilities operate in.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 861 -- Annual Electric Power Industry Report
- Primary key:
This table has no primary key. The primary key would have been: utility_id_eia, state, report_date, nerc_region, and rtos_of_operation, but there are nulls in the state column across several years of reporting.
Columns
NERC region in which the plant is located
Date reported.
The ISOs/RTOs, in which the respondent conducts operations.
Two letter US state abbreviation.
The EIA Utility Identification number.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia923__entity_coalmine
Entity table containing static information about coal mines reporting deliveries in the Fuel Receipts and Costs table.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 2, Part C)
- Primary key:
mine_id_pudl
Usage Warnings
Mine ID's are not as well defined and consistent as other EIA ID's.
Additional Details
This table is produced during the transformation of fuel delivery data, in order to produce a better normalized database. The same coalmines report many individual deliveries, and repeating their attributes many times in the fuel receipts and costs table is duplicative. Unfortunately the coalmine attributes do not generally use a controlled vocabulary or well defined IDs and so in practice there are many distinct records in this table that correspond to the same mines in reality.
We have not yet taken the time to rigorously clean this data, but it could be linked with both Mining Safety and Health Administration (MSHA) and USGS data to provide more insight into where coal is coming from, and what the employment and geological context is for those supplies.
Columns
Dynamically assigned PUDL mine identifier.
Coal mine name.
Type of coal mine.
Two letter US state abbreviation.
County ID from the Federal Information Processing Standard Publication 6-4.
MSHA issued mine identifier.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia923__monthly_boiler_fuel
Monthly time series of boiler fuel consumption and emissions.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Additional Details
Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.
This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Code for the type of prime mover (e.g. CT, CG)
Simplified fuel type code used in PUDL
Date reported.
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Heat content of the fuel in millions of Btus per physical unit.
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia923__monthly_energy_storage
Monthly time series of energy consumed and discharged ("generated") by energy storage units.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, report_date, prime_mover_code, energy_source_code
Usage Warnings
Date column arbitrarily uses the first of the month.
Additional Details
The total MWh discharged from the energy storage unit during the reporting period is the gross generation and the difference between gross generation and consumption is the net generation.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Date reported.
Code for the type of prime mover (e.g. CT, CG)
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
Reported unit of measure for fuel.
Consumption for electric generation of the fuel type in physical unit.
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Gross electricity generation for the specified period in megawatt-hours (MWh).
Net electricity generation for the specified period in megawatt-hours (MWh).
core_eia923__monthly_fuel_receipts_costs
Monthly time series of fuel deliveries to power plants.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)
- Primary key:
This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.
Usage Warnings
Date column arbitrarily uses the first of the month.
Some values have been redacted.
Time of fuel deliveries is not necessarily connected with time of fuel consumption.
Additional Details
There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.
Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.
Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Date reported.
Purchase type under which receipts occurred in the reporting month. C: Contract, NC: New Contract, S: Spot Purchase, T: Tolling Agreement.
Date contract expires.Format: MMYY.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Simplified fuel type code used in PUDL
Fuel groups used in the Electric Power Monthly
Dynamically assigned PUDL mine identifier.
Company that sold the fuel to the plant or, in the case of Natural Gas, pipeline owner.
Quantity of fuel received in tons, barrel, or Mcf.
Heat content of the fuel in millions of Btus per physical unit.
Sulfur content percentage by weight to the nearest 0.01 percent.
Ash content percentage by weight to the nearest 0.1 percent.
Mercury content in parts per million (ppm) to the nearest 0.001 ppm.
Average fuel cost per mmBTU of heat content in nominal USD.
Transportation mode for the longest distance transported.
Transportation mode for the second longest distance transported.
Contract type for natural gas transportation service.
Contract type for natural gas delivery service:
For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.
For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia923__monthly_generation
Monthly time series of net electricity generated by power plant generators.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, generator_id, report_date
Usage Warnings
Date column arbitrarily uses the first of the month.
Not all IDs are present.
Table has known low coverage - either geographic or temporal or otherwise.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Additional Details
This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.
Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Date reported.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia923__monthly_generation_fuel
Monthly time series of generation and fuel consumption by fuel and prime mover.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, report_date, prime_mover_code, energy_source_code
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Additional Details
This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.
The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Simplified fuel type code used in PUDL
A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER) or Monthly Energy Review (MER). Two or three letter alphanumeric.
Code for the type of prime mover (e.g. CT, CG)
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Consumption for electric generation of the fuel type in physical unit.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia923__monthly_generation_fuel_nuclear
Monthly time series of generation and fuel consumption of nuclear generation units by fuel and prime mover.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 3)
- Primary key:
plant_id_eia, report_date, nuclear_unit_id, energy_source_code, prime_mover_code
Usage Warnings
Date column arbitrarily uses the first of the month.
A small number of respondents only report annual fuel consumption, and all of it is reported in December.
Additional Details
This data is originally reported alongside similar information for fossil fuel plants, but the nuclear data is reported by (nuclear) generation unit rather than fuel type and prime mover, and so has a different primary key.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Date reported.
For nuclear plants only, the unit number .One digit numeric. Nuclear plants are the only type of plants for which data are shown explicitly at the generating unit level.
A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.
Simplified fuel type code used in PUDL
A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER) or Monthly Energy Review (MER). Two or three letter alphanumeric.
Code for the type of prime mover (e.g. CT, CG)
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Consumption for electric generation of the fuel type in physical unit.
Heat content of the fuel in millions of Btus per physical unit.
Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Total consumption of fuel to produce electricity, in physical unit, year to date.
Net electricity generation for the specified period in megawatt-hours (MWh).
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
core_eia930__hourly_interchange
Hourly time series of balancing authority interchange.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
- Primary key:
datetime_utc, balancing_authority_code_eia, balancing_authority_code_adjacent_eia
Additional Details
Interchange is the net metered tie line flow from one BA to another directly interconnected BA. Total net interchange is the net sum of all interchange occurring between a BA and its directly interconnected neighboring BAs. Negative interchange values indicate net inflows, and positive interchange values indicate net outflows.
Columns
Timestamp at the end of the hour for which the data is reported.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
EIA short code for the other adjacent balancing authority, with which interchange is occurring. Includes Canadian and Mexican BAs.
Original reported energy interchange between adjacent balancing authorities.
core_eia930__hourly_net_generation_by_energy_source
Hourly time series of balancing authority net generation by energy source.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
- Primary key:
datetime_utc, balancing_authority_code_eia, generation_energy_source
Usage Warnings
Contains known calculation discrepancies.
Additional Details
BAs report generation from dual-fuel (switching from one fuel to another) and multiple-fuel (using multiple fuels simultaneously) generators under the actual energy source used, if known, and under the generator's primary energy source, if not known.
To maintain generator confidentiality, generation may sometimes be reported in the Other category if too few generators are reported for a particular energy source category.
In theory, the sum of net generation across all energy sources should equal the total net generation reported in the balancing authority operations table. In practice, there are many cases in which these values diverge significantly, which require further investigation.
Columns
Timestamp at the end of the hour for which the data is reported.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
High level energy source used to produce electricity.
Unaltered originally reported net generation for the specified period.
Reported net generation adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.
Reported net generation with outlying values removed and missing values imputed by EIA.
core_eia930__hourly_operations
Hourly time series of balancing authority net generation, interchange, and demand.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
- Primary key:
datetime_utc, balancing_authority_code_eia
Additional Details
Net generation represents the metered output of electric generating units in a BA's electric system. This generation only includes generating units that are managed by a BA or whose operations are visible to a BA.
Generators on the distribution system—both large-scale resources and small-scale distributed resources, such as rooftop solar photovoltaic (PV) systems—are typically not included.
In some electricity publications, EIA reports generation from all utility-scale generating units in the United States. BAs only meter generating units that are from a subset of all utility-scale generating units. As a result, when hourly generation from the EIA-930 is aggregated to monthly or annual levels, the results will be lower than monthly and annual aggregations in other EIA electricity publications.
Interchange is the net metered tie line flow from one BA to another directly interconnected BA. This table includes the net sum of all interchange occurring between a BA and all of its directly interconnected neighboring BAs. For a detailed breakdown of interchange between each pair of adjacent BAs see core_eia930__hourly_interchange.
Negative interchange values indicate net inflows, and positive interchange values indicate net outflows.
Demand is a calculated value representing the amount of electricity load within a BA's electric system. A BA derives its demand value by taking the total metered net electricity generation within its electric system and subtracting the total metered net electricity interchange occurring between the BA and its neighboring BAs.
Each BA produces a day-ahead electricity demand forecast for every hour of the next day. These forecasts help BAs plan for and coordinate the reliable operation of their electric system.
Columns
Timestamp at the end of the hour for which the data is reported.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Unaltered originally reported net generation for the specified period.
Reported net generation adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.
Reported net generation with outlying values removed and missing values imputed by EIA.
Original reported energy interchange between adjacent balancing authorities.
Energy interchange between adjacent balancing authorities, adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.
Energy interchange between adjacent balancing authorities, with outliers and missing values imputed by EIA.
Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.
Electricity demand adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.
Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed by EIA.
Day ahead demand forecast.
core_eia930__hourly_subregion_demand
Hourly time series of balancing authority subregion demand.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report
- Primary key:
datetime_utc, balancing_authority_code_eia, balancing_authority_subregion_code_eia
Additional Details
For select BAs, balancing authority subregion demand provides a further level of geographic disaggregation (for example, load zones, weather zones, operating company areas, local BAs, etc.) within a BA's electric system.
Columns
Timestamp at the end of the hour for which the data is reported.
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Code identifying subregions of larger balancing authorities.
Originally reported electricity demand for the balancing area subregion. Note that different BAs have different methods of calculating and allocating subregion demand.
core_eia__codes_averaging_periods
Code table containing descriptions of categorical codes for the averaging period specified by emissions statutes and regulations for the EIA 860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_balancing_authorities
Code table containing descriptions of categorical codes for balancing authorities in EIA 860, EIA 923, and EIA 930.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
EIA balancing authority region code.
Human-readable name of the EIA balancing region.
Timezone used by the reporting entity. For use in localizing UTC times.
Date on which the balancing authority ceased independent operation.
Indicates whether the balancing authority is generation-only, meaning it does not serve retail customers and thus reports only net generation and interchange, but not demand.
EIA interconnect code.
core_eia__codes_balancing_authority_subregions
Code table containing descriptions of categorical codes for balancing authority subregions in EIA 930.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
balancing_authority_code_eia, balancing_authority_subregion_code_eia
Columns
EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.
Code identifying subregions of larger balancing authorities.
Name of the balancing authority subregion.
core_eia__codes_boiler_generator_assn_types
Code table containing descriptions of categorical codes for boiler-generator associations in the EIA 860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_boiler_status
Code table containing descriptions of categorical codes for boiler status in the EIA 860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_boiler_types
Code table containing descriptions of categorical codes for boiler regulatory types in the EIA 860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_coalmine_types
Code table containing descriptions of categorical codes for coalmines reported as fuel sources in the EIA-923.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_contract_types
Code table containing descriptions of categorical codes for fuel supply contracts reported in EIA-923.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_cooling_system_types
Code table containing descriptions of categorical codes for cooling systems.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Usage Warnings
Contains information from multiple raw inputs.
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_cooling_tower_types
Code table containing descriptions of categorical codes for cooling towers.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_cooling_water_sources
Code table containing descriptions of categorical codes for cooling water sources in EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_cooling_water_types
Code table containing descriptions of categorical codes for cooling water.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_emission_control_equipment_types
Code table containing descriptions of categorical codes for emissions control equipment installed on a boiler.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_energy_sources
Code table containing descriptions of categorical codes for energy sources reported to EIA.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Usage Warnings
Contains information from multiple raw inputs.
Additional Details
Compiled from EIA-860 instructions and EIA-923 file layout spreadsheets.
Columns
Originally reported short code.
Longer human-readable code using snake_case
Reported unit of measure for fuel.
Minimum heat content per physical unit of fuel in MMBtu.
Maximum heat content per physical unit of fuel in MMBtu.
High level fuel group defined in the 2021-2023 EIA Form 860 instructions, Table 28.
Original fuel from which this refined fuel was derived.
Physical phase of matter of the fuel.
Simplified fuel type code used in PUDL
Long human-readable description of the meaning of a code/label.
core_eia__codes_environmental_equipment_manufacturers
Code table containing descriptions of categorical codes for manufacturers of boilers and environmental control equipment in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_firing_types
Code table containing descriptions of categorical codes for boiler firing types in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_fuel_transportation_modes
Code table containing descriptions of categorical codes for fuel transportation modes reported in the EIA-860 and EIA-923.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Usage Warnings
Contains information from multiple raw inputs.
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_fuel_types_agg
Code table containing descriptions of categorical codes for aggregated fuel types used in the Annual Energy Review or Monthly Energy Review.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Additional Details
See EIA-923 Fuel Code table for additional information.
Columns
Originally reported short code.
Long human-readable description of the meaning of a code/label.
core_eia__codes_mercury_compliance_strategies
Code table containing descriptions of categorical codes for compliance strategies used to control mercury in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_momentary_interruptions
Code table containing descriptions of categorical codes for utility definitions of momentary service interruptions.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_nox_compliance_strategies
Code table containing descriptions of categorical codes for compliance strategies used to control nitrogen oxide in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_nox_control_status
Code table containing descriptions of categorical codes for the operational status of nitrogen oxide control units associated with boilers in the EIA-860 data.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_nox_units
Code table containing descriptions of categorical codes for units of measurement for nitrogen oxide in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_operational_status
Code table containing descriptions of categorical codes for operational status reported to EIA.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Usage Warnings
Contains information from multiple raw inputs.
Additional Details
Compiled from EIA-860 instructions and EIA-923 file layout spreadsheets.
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.
core_eia__codes_particulate_compliance_strategies
Code table containing descriptions of categorical codes for compliance strategies used to control particulate matter in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_particulate_units
Code table containing descriptions of categorical codes for units of measurement for particulate matter in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_prime_movers
Code table containing descriptions of categorical codes for prime movers reported in the EIA-860 and EIA-923.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Usage Warnings
Contains information from multiple raw inputs.
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_regulations
Code table containing descriptions of categorical codes for levels of statutes and codes under which boilers operate in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_reporting_frequencies
Code table containing descriptions of categorical codes for the reporting frequencies used by plants in the EIA-923.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_sector_consolidated
Code table containing descriptions of categorical codes for EIA consolidated NAICS sectors.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Usage Warnings
Contains information from multiple raw inputs.
Additional Details
Codes and descriptions taken from the EIA-923 File Layout spreadsheet.
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_so2_compliance_strategies
Code table containing descriptions of categorical codes for compliance strategies used to control sulfur dioxide in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_so2_units
Code table containing descriptions of categorical codes for units of measurement for sulfur dioxide in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_sorbent_types
Code table containing descriptions of categorical codes for flue gas desulfurization sorbent types used in EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_steam_plant_types
Code table containing descriptions of categorical codes for steam plants in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_storage_enclosure_types
Code table containing descriptions of categorical codes for energy storage enclosures.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_storage_technology_types
Code table containing descriptions of categorical codes for energy storage technologies.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_wet_dry_bottom
Code table containing descriptions of categorical codes for boiler bottoms in the EIA-860.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_eia__codes_wind_quality_class
Code table containing descriptions of categorical codes for wind quality classes.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
code
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
Average wind speed in meters per second.
The extreme 50-year wind gusts at this generator in meters per hour.
The upper bounds of the turbulence intensity at the wind site (ratio of standard deviation of fluctuating wind velocity to the mean wind speed).
The lower bounds of the turbulence intensity at the wind site (ratio of standard deviation of fluctuating wind velocity to the mean wind speed).
core_eia__entity_boilers
Entity table containing static information about boilers compiled from the EIA-860 and EIA-923.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
plant_id_eia, boiler_id
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for boilers are set. It contains values which are expected to remain fixed, while core_eia860__scd_boilers contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
Name of boiler manufacturer.
EIA short code for boiler manufacturer.
core_eia__entity_generators
Entity table containing static information about generators compiled from across the EIA-860 and EIA-923.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
plant_id_eia, generator_id
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for generators are set. It contains values which are expected to remain fixed, while core_eia860__scd_generators contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas
Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.
If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle
Indicates whether the generator is part of a solid fuel gasification system
Indicates whether the generator uses pulverized coal technology
Indicates whether the generator uses fluidized bed technology
Indicates whether the generator uses subcritical technology
Indicates whether the generator uses supercritical technology
Indicates whether the generator uses ultra-supercritical technology
Indicates whether the generator uses stoker technology
Indicates whether the generator uses other combustion technologies
Can this generator operate while bypassing the heat recovery steam generator?
The designation used to identify the price node in RTO/ISO Locational Marginal Price reports
The designation used to report the specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report
Indicates whether the generator is associated with a combined heat and power system
The date the generator was originally scheduled to be operational
Indicates whether a fuel switching generator can switch fuels while operating.
Indicates whether the generator was previously reported as indefinitely postponed or canceled
core_eia__entity_plants
Entity table containing static information about plants, compiled from across all EIA-860 and EIA-923 data.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
plant_id_eia
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for plants are set. It contains values which are expected to remain fixed, while core_eia860__scd_plants contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Plant name.
Name of the city.
County name.
Latitude of the plant's location, in degrees.
Longitude of the plant's location, in degrees.
Two letter US state abbreviation.
Physical street address.
Five digit US Zip Code.
IANA timezone name
core_eia__entity_utilities
Entity table containing static information about utilities, compiled from all EIA data.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
utility_id_eia
Usage Warnings
Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.
Contains information from multiple raw inputs.
Additional Details
This is one of two tables where canonical values for utilities are set. It contains values which are expected to remain fixed, while core_eia860__scd_utilities contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.
Columns
The EIA Utility Identification number.
The name of the utility.
core_eia__yearly_fuel_receipts_costs_aggs
Annual time series of aggregated fuel receipts and costs from the EIA API.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA -- Mix of multiple EIA Forms
- Primary key:
fuel_agg, geo_agg, sector_agg, temporal_agg, report_date
Columns
Category of fuel aggregation in EIA bulk electricity data.
Category of geographic aggregation in EIA bulk electricity data.
Category of sectoral aggregation in EIA bulk electricity data.
Category of temporal aggregation in EIA bulk electricity data.
Date reported.
Aggregated fuel receipts, in MMBtu, in EIA bulk electricity data.
Average fuel cost per mmBTU of heat content in nominal USD.
core_eiaaeo__yearly_projected_electric_sales
Annual time series of projected electricity sales by region and customer class.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Annual Energy Outlook (AEO)
- Primary key:
report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, customer_class
Columns
Four-digit year in which the data was reported.
AEO projection region.
Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.
The year of the projected value.
High level categorization of customer type (e.g., commercial, residential).
Quantity of electricity sold in MWh.
core_eiaaeo__yearly_projected_energy_use_by_sector_and_type
Annual time series of projected energy use for commercial, electric power, industrial, residential, and transportation sectors, drawn from AEO Table 2.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Annual Energy Outlook (AEO)
- Primary key:
report_year, region_name_eiaaeo, model_case_eiaaeo, projection_year, energy_use_sector, energy_use_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Additional Details
The series in Table 2 which track energy use by sector do not always define each type of use the same way across sectors. There is detailed information about what is included or excluded in each use type for each sector in the footnotes of the EIA's online AEO data browser:
https://www.eia.gov/outlooks/aeo/data/browser/#/?id=2-AEO2023
Use caution when aggregating across use types! Energy Use has a tricky system of subtotals, and summing all types within a sector will result in double-counting. Consult the EIA's data browser for visibility into which use types are subtotals, and what they contain: subtotal series are displayed indented, and include all lines above them which are one level out, up to the next indented line. Delivered Energy and Total are special cases which include those plus all subtotals above. In this way, "Delivered Energy" includes purchased electricity, renewable energy, and an array of fuels based on sector, and explicitly excludes electricity-related losses.
AEO Energy Use figures are variously referred to as delivered energy, energy consumption, energy use, and energy demand, depending on which use types are being discussed, and which org and which document is describing them. In PUDL we say energy use or energy consumption.
Columns
Four-digit year in which the data was reported.
EIA AEO region for energy consumption. Includes US Census Divisions plus United States.
Region type for EIA AEO energy consumption, indicating whether region_name_eiaaeo is a US Census Division or country (United States)
Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.
The year of the projected value.
Sector for energy use figures in AEO Table 2. Similar to customer class, but with some missing and some extra values.
Type of energy use, indicating the name of the series from AEO Table 2. Includes fuels, electricity, losses, and various subtotals; consult table documentation for aggregation guidelines.
Energy use, in MMBtu; also referred to as energy consumption, energy demand, or delivered energy, depending on type.
core_eiaaeo__yearly_projected_fuel_cost_in_electric_sector_by_type
Annual time series of projected fuel prices for the electric power sector, including electricity-only and combined-heat-and-power plants that have a regulatory status.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Annual Energy Outlook (AEO)
- Primary key:
report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, fuel_type_eiaaeo
Columns
Four-digit year in which the data was reported.
AEO projection region.
Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.
The year of the projected value.
Fuel type reported for AEO end-use sector generation data.
Average fuel cost per mmBTU of heat content in nominal USD.
Average fuel cost per mmBTU of heat content in real USD, standardized to the value of a USD in the year defined by real_cost_basis_year.
Four-digit year which is the basis for any 'real cost' monetary values (as opposed to nominal values).
core_eiaaeo__yearly_projected_generation_in_electric_sector_by_technology
Annual time series of projected generation capacity & total generation in the electric sector, broken out by technology.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Annual Energy Outlook (AEO)
- Primary key:
report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, technology_description_eiaaeo
Columns
Four-digit year in which the data was reported.
AEO projection region.
Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.
The year of the projected value.
Generation technology reported for AEO.
The net summer capacity.
The total planned additions to net summer generating capacity.
The total unplanned additions to net summer generating capacity.
The total retirements from net summer generating capacity.
Gross electricity generation for the specified period in megawatt-hours (MWh).
core_eiaaeo__yearly_projected_generation_in_end_use_sectors_by_fuel_type
Annual time series of projected generation capacity and total generation in the end-use sector, broken out by fuel type.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EIA Annual Energy Outlook (AEO)
- Primary key:
report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, fuel_type_eiaaeo
Usage Warnings
The solar photovoltaic capacity portion of Renewable Sources is reported in megawatts direct current; other technologies are reported in megawatts of alternating current.
Additional Details
Includes combined-heat-and-power plants and electricity-only plants in the commercial and industrial sectors; and small on-site generating systems in the residential, commercial, and industrial sectors used primarily for own-use generation, but which may also sell some power to the grid.
Columns
Four-digit year in which the data was reported.
AEO projection region.
Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.
The year of the projected value.
Fuel type reported for AEO end-use sector generation data.
The net summer capacity.
Gross electricity generation for the specified period in megawatt-hours (MWh).
core_epa__assn_eia_epacamd
Association table providing connections between EPA units and EIA plants, boilers, and generators.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EPA -- Mix of multiple EPA sources
- Primary key:
This table has no primary key.
Additional Details
This crosswalk table comes from the PUDL fork of the EPA camd-eia-crosswalk Github repo: https://github.com/catalyst-cooperative/camd-eia-crosswalk-latest.
The camd-eia-crosswalk README and our Data Source documentation page on ../data_sources/epacems depict the complicated relationship between EIA and EPA data, specifically the nature of EPA vs. EIA "units" and the level of granularity that one can connect the two sources.
The original EPA crosswalk runs on 2018 EIA data. We adapted the crosswalk code to run on each new year of EIA data, capturing changes in plant information over time.
Our version of the crosswalk clarifies some of the column names and removes unmatched rows. The pudl.etl.glue_assets.core_epa__assn_eia_epacamd function doc strings explain what changes are made from the EPA's version.
Columns
Four-digit year in which the data was reported.
The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.
Emissions (smokestack) unit monitored by EPA CEMS.
Generator ID used by the EPA.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Alphanumeric boiler ID.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
core_epa__assn_eia_epacamd_subplant_ids
Association table providing connections between EPA units and EIA units/generators, at the subplant level.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EPA -- Mix of multiple EPA sources
- Primary key:
This table has no primary key. The primary keys would have been: plant_id_eia, generator_id, subplant_id and emissions_unit_id_epa, but there are some null records in the generator_id column. ~2 percent of all EPA CAMD records are not successfully mapped to EIA generators.
Additional Details
This table is an augmented version of the core_epa__assn_eia_epacamd crosswalk table which initially comes from the EPA's Github repo camd-eia-crosswalk: https://github.com/USEPA/camd-eia-crosswalk.
This table identifies subplants within plant_ids, which are the smallest coherent units for aggregation. A plant_id refers to a legal entity that often contains multiple distinct power plants, even of different technology or fuel types.
EPA CEMS data combines information from several parts of a power plant:
emissions from smokestacks
fuel use from combustors
electricity production from generators
But smokestacks, combustors, and generators can be connected in complex, many-to-many relationships. This complexity makes attribution difficult for, as an example, allocating pollution to energy producers. Furthermore, heterogeneity within plant_ids make aggregation to the parent entity difficult or inappropriate.
This table inherits from the EPA's crosswalk, the IDs from EPA CAMD core_epacems__hourly_emissions table itself, the core_eia860__assn_boiler_generator table and the core_eia860__scd_generators table. While the core_epa__assn_eia_epacamd table is the core backbone of the table, EPA CAMD IDs ensure there is complete coverage of EPA CAMD reporting units. The EIA 860 table addition ensures there is also complete coverage of those units as well.
For more information about the how this subplant_id is made, see the documentation for pudl.etl.glue_assets.make_subplant_ids and pudl.etl.glue_assets.update_subplant_ids.
But by analyzing the relationships between combustors and generators, as provided in the core_epa__assn_eia_epacamd crosswalk, we can identify distinct power plants. These are the smallest coherent units of aggregation.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.
Sub-plant ID links EPA CEMS emissions units to EIA units.
Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.
Emissions (smokestack) unit monitored by EPA CEMS.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
core_epacems__hourly_emissions
Hourly time series of emissions and plant operational data as required by 40 CFR Part 75.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
EPA Hourly Continuous Emission Monitoring System (CEMS)
- Primary key:
plant_id_epa, emissions_unit_id_epa, operating_datetime_utc
Usage Warnings
Large table; do not attempt to open with Excel.
Not all IDs are present.
Contains information from multiple raw inputs.
Additional Details
Continuous Emissions Monitoring Systems (CEMS) are used to determine the rate of gas or particulate matter exiting a point source of emissions. The EPA Clean Air Markets Division (CAMD) has collected data on power plant emissions from CEMS units stretching back to 1995. The CEMS dataset includes hourly gross load, SO2, CO2, and NOx emissions associated with a given emissions-unit during startup, shutdown, and instances of malfunction. An EPA CEMS emissions-unit or smokestack unit is not the same as an EIA unit. See core_epa__assn_eia_epacamd for details about how to connect CEMS data to corresponding EIA units.
Only fossil-combustion units over 25 MW are required to install and use CEMS, so there are some units that do report in EIA-860 or EIA-923 that do not have any CEMS data.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.
Emissions (smokestack) unit monitored by EPA CEMS.
Date and time measurement began (UTC).
Year the data was reported in, used for partitioning EPA CEMS.
Two letter US state abbreviation.
Length of time interval measured.
Average power in megawatts delivered during time interval measured.
The energy contained in fuel burned, measured in million BTU.
Total steam pressure produced by a unit during the reported hour.
Sulfur dioxide emissions in pounds.
Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.
NOx emissions in pounds.
Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.
Carbon dioxide emissions in short tons.
Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.
core_ferc1__codes_power_purchase_types
Code table containing descriptions of categorical codes for electric power purchase types.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities
- Primary key:
code
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Originally reported short code.
Longer human-readable code using snake_case
Long human-readable description of the meaning of a code/label.
core_ferc1__yearly_balance_sheet_assets_sched110
Annual time series of utility assets and other debits.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)
- Primary key:
utility_id_ferc1, report_year, asset_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Type of asset being reported to the core_ferc1__yearly_balance_sheet_assets_sched110 table.
Account balance at end of year.
Account balance at beginning of year.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_balance_sheet_liabilities_sched110
Annual time series of utility liabilities and other credits.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)
- Primary key:
utility_id_ferc1, report_year, liability_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Account balance at beginning of year.
Account balance at end of year.
Type of liability being reported to the core_ferc1__yearly_balance_sheet_liabilities_sched110 table.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
core_ferc1__yearly_cash_flows_sched120
Annual time series of utility cash flow.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 120)
- Primary key:
utility_id_ferc1, report_year, amount_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Label describing the type of amount being reported. This could be a balance or a change in value.
Reported amount of dollars. This could be a balance or a change in value.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_depreciation_by_function_sched219
Annual time series of ending balances in accumulated provision for depreciation of electric utility plant.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section B)
- Primary key:
utility_id_ferc1, report_year, depreciation_type, plant_function, plant_status, utility_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric utility plant refers to FERC Account 108. Section B includes the Balances at End of Year According to Functional Classification (plant_function).
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.
Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).
Utility plant financial status (in service, future, leased, total).
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Account balance at end of year.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_depreciation_changes_sched219
Annual time series of changes in accumulated provision for depreciation of electric utility plant.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section A)
- Primary key:
utility_id_ferc1, report_year, depreciation_type, plant_status, utility_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric utility plant refers to FERC Account 108.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.
Utility plant financial status (in service, future, leased, total).
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Dollar value of reported income, expense, asset, or liability.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_depreciation_summary_sched336
Annual time series of depreciation and amortization of electric plant.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 336 - Section A)
- Primary key:
utility_id_ferc1, report_year, plant_function, ferc_account_label
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric Plant refers to FERC Accounts 403, 404, and 405. This table only contains information from Section A: Summary of depreciation and amortization changes.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Four-digit year in which the data was reported.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).
Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Dollar value of reported income, expense, asset, or liability.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_energy_dispositions_sched401
Annual time series of dispositions of electric energy sold, exchanged, or stored.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)
- Primary key:
utility_id_ferc1, report_year, energy_disposition_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric Energy Account, dispositions only. Schedule 401a. Electricity utilities delivered to end users, internal losses, etc.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Type of energy disposition reported in the core_ferc1__yearly_energy_dispositions_sched401. Dispositions include sales to customers, re-sales of energy, energy used internally, losses, etc.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Sources and uses of energy in MWh.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
core_ferc1__yearly_energy_sources_sched401
Annual time series of sources of electric energy generated or purchased, exchanged and wheeled.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)
- Primary key:
utility_id_ferc1, report_year, energy_source_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Electric Energy Account, sources only. Schedule 401a. Amount of electricity the utility obtained from each of several sources.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Type of energy source reported in the core_ferc1__yearly_energy_sources_sched401 table. There are three groups of energy sources: generation, power exchanges and transmission.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Sources and uses of energy in MWh.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
core_ferc1__yearly_hydroelectric_plants_sched406
Annual time series of plant statistics for large hydroelectric generating plants.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 406)
- Primary key:
This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
The plant_type and construction_type are standardized into categorical values from free-form strings.
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Large plants have an installed nameplate capacity of more than 10 MW.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
FERC Licensed Project Number.
Type of plant.
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Total installed (nameplate) capacity, in megawatts.
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Hours the plant was connected to load while generating in the report year.
Net plant capability under the most favorable operating conditions, in megawatts.
Net plant capability under the least favorable operating conditions, in megawatts.
The average number of employees assigned to each plant.
Net electricity generation for the specified period in megawatt-hours (MWh).
Cost of plant: land and land rights (USD).
Cost of plant: structures and improvements (USD).
Cost of plant: reservoirs, dams, and waterways (USD).
Cost of plant: equipment (USD).
Cost of plant: roads, railroads, and bridges (USD).
Asset retirement cost (USD).
Total cost of plant (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Production expenses: operations, supervision, and engineering (USD).
Production expenses: water for power (USD).
Production expenses: hydraulic expenses (USD).
Production expenses: electric expenses (USD).
Production expenses: miscellaneous power generation expenses (USD).
Production expenses: rents (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: maintenance of structures (USD).
Production expenses: maintenance of reservoirs, dams, and waterways (USD).
Production expenses: maintenance of electric plant (USD).
Production expenses: maintenance of miscellaneous hydraulic plant (USD).
Total production expenses, excluding fuel (USD).
Total production expenses (USD per MWh generated).
core_ferc1__yearly_income_statements_sched114
Annual time series of utility income statements.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 114)
- Primary key:
utility_id_ferc1, report_year, utility_type, income_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Type of income reported in core_ferc1__yearly_income_statements_sched114 table.
Dollar value of reported income, expense, asset, or liability.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_operating_expenses_sched320
Annual time series of operating and maintenance costs associated with producing electricity.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 320)
- Primary key:
utility_id_ferc1, report_year, expense_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Dollar value of reported income, expense, asset, or liability.
The type of expense.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_operating_revenues_sched300
Annual time series of utilities' electric operating revenues.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 300)
- Primary key:
utility_id_ferc1, report_year, revenue_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table includes only the structured part of schedule 300. There are a number of revenue_type's that do not have sales_mwh,or avg_customers_per_month provided, in which case these columnswill be NULL.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Label describing types of revenues.
Dollar value of reported income, expense, asset, or liability.
Quantity of electricity sold in MWh.
Average number of customers per month.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
core_ferc1__yearly_other_regulatory_liabilities_sched278
Annual time series of utilities' other regulatory liabilities, including rate order docket number.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 278)
- Primary key:
This table has no primary key. Respondents are able to enter any number of liabilities across many rows. There are no IDs or set fields enforced in the original table.
Usage Warnings
The description column is a free-form string.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Long human-readable description of the meaning of a code/label.
Account balance at end of year.
Account balance at beginning of year.
The increase during the reporting period of other regulatory liabilities.
Description of the account number credited from making debit adjustment to other regulatory liabilities.
The decrease during the reporting period of other regulatory liabilities.
core_ferc1__yearly_plant_in_service_sched204
Annual time series of utilities' balances and changes to FERC Electric Plant in Service accounts.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 204)
- Primary key:
utility_id_ferc1, report_year, ferc_account_label
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Account numbers correspond to the FERC Uniform System of Accounts for Electric Plant, which is defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101). Each FERC respondent reports starting and ending balances for each account annually. Balances are organization wide, and are not broken down on a per-plant basis. End of year balance should equal beginning year balance plus the sum of additions, retirements, adjustments, and transfers.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Account balance at beginning of year.
Cost of acquisition of items classified within the account.
Cost of disposal of items classified within the account.
Cost of adjustments to the account.
Cost of transfers into (out of) the account.
Account balance at end of year.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Utility plant financial status (in service, future, leased, total).
core_ferc1__yearly_pumped_storage_plants_sched408
Annual time series of plant statistics for hydroelectric pumped storage plants with an installed nameplate capacity of 10+ MW.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 408)
- Primary key:
This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
As reported in Schedule 408 of FERC Form 1.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
FERC Licensed Project Number.
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Total installed (nameplate) capacity, in megawatts.
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Hours the plant was connected to load while generating in the report year.
Net plant capability in megawatts.
The average number of employees assigned to each plant.
Net electricity generation for the specified period in megawatt-hours (MWh).
Energy used for pumping, in megawatt-hours.
Net output for load (net generation - energy used for pumping) in megawatt-hours.
Cost of plant: land and land rights (USD).
Cost of plant: structures and improvements (USD).
Cost of plant: reservoirs, dams, and waterways (USD).
Cost of plant: water wheels, turbines, and generators (USD).
Cost of plant: accessory electric equipment (USD).
Cost of plant: miscellaneous power plant equipment (USD).
Cost of plant: roads, railroads, and bridges (USD).
Asset retirement cost (USD).
Total cost of plant (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Production expenses: operations, supervision, and engineering (USD).
Production expenses: water for power (USD).
Production expenses: pumped storage (USD).
Production expenses: electric expenses (USD).
Production expenses: miscellaneous power generation expenses (USD).
Production expenses: rents (USD).
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: maintenance of structures (USD).
Production expenses: maintenance of reservoirs, dams, and waterways (USD).
Production expenses: maintenance of electric plant (USD).
Production expenses: maintenance of miscellaneous hydraulic plant (USD).
Total production expenses before pumping (USD).
Production expenses: We are here to PUMP YOU UP! (USD).
Total production expenses, excluding fuel (USD).
Total production expenses (USD per MWh generated).
core_ferc1__yearly_purchased_power_and_exchanges_sched326
Annual time series of purchased power (Account 555) including power exchanges (transactions involving a balancing of debits and credits for energy, capacity, etc.) and any settlements for imbalanced exchanges.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 326)
- Primary key:
This table has no primary key. The primary key for this table would be record_id, however a small number of these values are duplicated in 2021, 2022, and 2024.
Usage Warnings
Contains columns which may appear categorical, but are actually free text.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table has data about inter-utility power purchases. This includes how much electricity was purchased, how much it cost, and who it was purchased from. Unfortunately the field describing which other utility the power was being bought from (seller_name) is poorly standardized, making it difficult to correlate with other data.
Purchased Power is considered FERC Account 555 according to FERC's Uniform System of Accounts. Reported on pages 326-327 of FERC Form 1.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Name of the seller, or the other party in an exchange transaction.
Categorization based on the original contractual terms and conditions of the service. Must be one of 'requirements', 'long_firm', 'intermediate_firm', 'short_firm', 'long_unit', 'intermediate_unit', 'electricity_exchange', 'other_service', or 'adjustment'. Requirements service is ongoing high reliability service, with load integrated into system resource planning. 'Long term' means 5+ years. 'Intermediate term' is 1-5 years. 'Short term' is less than 1 year. 'Firm' means not interruptible for economic reasons. 'unit' indicates service from a particular designated generating unit. 'exchange' is an in-kind transaction.
FERC Rate Schedule Number or Tariff. (Note: may be incomplete if originally reported on multiple lines.)
Monthly average billing demand (for requirements purchases, and any transactions involving demand charges). In megawatts.
Average monthly non-coincident peak (NCP) demand (for requirements purhcases, and any transactions involving demand charges). Monthly NCP demand is the maximum metered hourly (60-minute integration) demand in a month. In megawatts.
Average monthly coincident peak (CP) demand (for requirements purchases, and any transactions involving demand charges). Monthly CP demand is the metered demand during the hour (60-minute integration) in which the supplier's system reaches its monthly peak. In megawatts.
Megawatt-hours shown on bills rendered to the respondent. Includes both electricity purchased for storage and non-storage purposes, which were lumped together prior to 2021.
Number of megawatt hours purchased during the period for energy storage.
Number of megawatt hours purchased during the period for other than energy storage.
Gross megawatt-hours received in power exchanges and used as the basis for settlement.
Gross megawatt-hours delivered in power exchanges and used as the basis for settlement.
Demand charges (USD).
Energy charges (USD).
Other charges, including out-of-period adjustments (USD).
Sum of demand, energy, and other charges (USD). For power exchanges, the settlement amount for the net receipt of energy. If more energy was delivered than received, this amount is negative.
core_ferc1__yearly_retained_earnings_sched118
Annual time series of utilities' statements of retained earnings.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 118)
- Primary key:
utility_id_ferc1, report_year, earnings_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
Label describing types of earnings.
Account balance at beginning of year.
Account balance at end of year.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
core_ferc1__yearly_sales_by_rate_schedules_sched304
Annual time series of utilities' electric sales from all rate schedules in effect throughout the year.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 304)
- Primary key:
This table has no primary key.
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
Values in rate_schedule_description are free-form strings.
Data prior to 2021 does not include information in columns: rate_schedule_type and billing_status.
Units of revenue_per_kwh are suspected to include a mix of dollars and possibly cents.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
The pre-2021 data in this table (extracted from FoxProDB vs. XBRL) is extremely unstructured. Where the post-2020 data (from XBRL) sorts the data into rate schedule types: residential, industrial, commercial, public_lighting, public_authorities, railroads, interdepartmental, provision_for_rate_refund, commercial_and_industrial, total, and billing status: billed, unbilled, total, the pre-2021 data stuffs all of that information (if you're lucky) into the rate_schedule_description column. There's no point trying to parse through the pre 2021 rate_schedule_description column en masse because it's just too messy. The contents of rate_schedule_description often contain numbers and acronyms that have little to no meaning out of context. The table is structured somewhat like the FERC1 small generators table with headings about rate structure type also embedded into the rate_schedule_description column. To all who dare, beware.
This table is a combination of one pre-2021 (DBF) table and nine post-2020 (XBRL) tables--one for each rate schedule type plus totals--hence increase in data clarity post-2020. The rate_schedule_type and billing_status columns are only relevant for post-2020 data as they can be reliably parsed from each of the tables and incorporated into columns. The rate_schedule_description is supposed to contain sub-rate_schedule_type names for charges (Ex: Residential 1, Residential 2, etc.). However, the pre-2021 data contains a little bit of everything (or nothing) and the post-2020 has some totals or wonky data thrown in. That's to say, even when working with post-2020 data, be wary of aggregating the data. That's what the "total" rows are for.
The values that come from from the totals table are marked with the string "total" in the rate_schedule_description column. The totals table is a product of the transition to XBRL, so these distinguishable totals are only available for data post-2020 (otherwise you could try keyword searching for "total" in rate_schedule_description). The total table contains two types of totals, the utility totals across all rate schedules in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "total") and each of the utility's individual rate schedule totals in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "residential" or any other rate schedule type).
The rate schedule based XBRL tables only report billed values whereas the total tables report billed, unbilled, and total values. (See the column description for more info on the difference between billed and unbilled). This is important to consider if you're endeavoring to compare the subtotal values with the total values. We have not attempted to fix or verify any subtotals or totals that don't add up.
Another important note is the possibility of unit discrepancies in certain columns. The revenue_per_kwh column does not specify reporting units, and closer inspection of the data reveals two clear peaks approximate two orders of magnitude apart. This indicates that values may be reported in both dollars and cents. However, because the price of energy per kwh varies so much regionally, we cannot guarantee which is which and have not put any cleaning mechanisms in place to account for this.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Categorization of rate schedule type.
Whether an amount is billed, unbilled, or both. Billed amounts pertain to the exchange of energy and unbilled amounts pertain to other sources of revenue such as contracts with peaker plants to keep them on standby or charging rent to host cell antennas on transmission towers.
Free-form description of what the rate schedule name is. Not standardized. Often a sub-category of rate_schedule_type.
Quantity of electricity sold in MWh.
Dollar value of reported income, expense, asset, or liability.
Average number of customers per month.
kWh per customer.
The amount of revenue per kWh by rate schedule acquired in the given report year.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
core_ferc1__yearly_small_plants_sched410
Annual time series of plant statistics for internal combustion plants, gas turbine-plants, conventional hydro plants, and pumped storage plants with less than 10 MW installed nameplate capacity and steam plants with less than 25 MW installed nameplate capacity.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 410)
- Primary key:
This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
As reported on FERC Form 1 Schedule 410 (pages 410-411) and extracted from the FERC Visual FoxPro and XBRL. See our pudl.extract.ferc1.TABLE_NAME_MAP_FERC1 for links to the raw tables.
The raw version of this table is more like a digitized PDF than an actual data table. The rows contain lots of information in addition to what the columns might suggest. For instance, a single column may contain header rows, note rows, and total rows. This extraneous information is useful, but it prevents proper analysis when mixed in with the rest of the values data in the column. We employ a couple of data transformations to extract these rows from the data and preserve some of the information they contain (fuel type, plant type, FERC license, or general notes about the plant) in separate columns.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Type of plant.
FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.
Year the plant's oldest still operational unit was built.
Total installed (nameplate) capacity, in megawatts.
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Net electricity generation for the specified period in megawatt-hours (MWh).
Total cost of plant (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Production expenses: operations, supervision, and engineering (USD).
Production expenses: fuel (USD).
Production expenses: Maintenance (USD).
Type of fuel.
Average fuel cost per mmBTU of heat content in nominal USD.
core_ferc1__yearly_steam_plants_fuel_sched402
Annual time series of fuel cost and quantity for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)
- Primary key:
This table has no primary key.
Usage Warnings
The fuel_type_code_pudl is inferred from a free-form string field.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table is a subset of the steam plant table reported on page 402 of FERC Form 1.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Simplified fuel type code used in PUDL
Reported unit of measure for fuel.
Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.
Heat content of the fuel in millions of Btus per physical unit.
Average cost of fuel consumed in the report year per reported fuel unit (USD).
Average cost of fuel delivered in the report year per reported fuel unit (USD).
Average fuel cost per mmBTU of heat content in nominal USD.
core_ferc1__yearly_steam_plants_sched402
Annual time series of plant statistics for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)
- Primary key:
This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.
Usage Warnings
FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
Type of plant.
Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.
Year the plant's oldest still operational unit was built.
Year the plant's most recently built unit was installed.
Total installed (nameplate) capacity, in megawatts.
Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.
Hours the plant was connected to load while generating in the report year.
Net plant capability in megawatts.
Plant capacity in MW when limited by condenser water.
Plant capacity in MW when not limited by condenser water.
The average number of employees assigned to each plant.
Net electricity generation for the specified period in megawatt-hours (MWh).
Cost of plant: land and land rights (USD).
Cost of plant: structures and improvements (USD).
Cost of plant: equipment (USD).
Total cost of plant (USD).
Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.
Production expenses: operations, supervision, and engineering (USD).
Production expenses: fuel (USD).
Cost of coolants and water (nuclear plants only)
Steam expenses.
Steam from other sources.
Steam transferred (Credit).
Production expenses: electric expenses (USD).
Miscellaneous steam (or nuclear) expenses.
Production expenses: rents (USD).
Allowances.
Production expenses: maintenance, supervision, and engineering (USD).
Production expenses: maintenance of structures (USD).
Maintenance of boiler (or reactor) plant.
Maintenance of electrical plant.
Maintenance of miscellaneous steam (or nuclear) plant.
Total operating expenses.
Total production expenses (USD per MWh generated).
Asset retirement cost (USD).
core_ferc1__yearly_transmission_lines_sched422
Annual time series of statistics about transmission lines.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 422)
- Primary key:
This table has no primary key. Each record of this table is supposed to represent one stretch of a transmission line, but there are no IDs and many nulls in the fields which would nominally distinguish unique transmission lines.
Usage Warnings
Contains columns which may appear categorical, but are actually free text.
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
Information describing transmission lines, the cost of lines, annual operating and capital expenses, etc. This table includes transmission lines having nominal voltage of 132 kilovolts or greater. Transmission lines below these voltages are required to be reported in group totals only for each voltage.
Columns
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
The starting point of a transmission line.
The end point of a transmission line.
The operating voltage, expressed kilo-volts, for three-phase 60 cycle alternative current transmission lines.
Manufactured (Designed) voltage, expressed in kilo-volts, for three-phase 60 cycle alternative current transmission lines
Supporting structure of the transmission line.
Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are stand alone structures (whose cost are reported on a stand-alone basis).
Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are agrregated with other lines / structures (whose cost are aggregated and combined with other structures).
Number of circuits in a transmission line.
Size of transmission conductor and material of the transmission line.
Cost of Land and land rights for the transmission line.
Construction and other costs for the transmission line.
Total costs for the transmission line.
Operating expenses for the transmission line.
Maintenance expenses for the transmission line.
Rent expenses for the transmission line.
Overall expenses for the transmission line.
core_ferc1__yearly_utility_plant_summary_sched200
Annual time series of utility plant and accumulated provisions for depreciation, amortization and depletion of utility plant assets.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 200)
- Primary key:
utility_id_ferc1, report_year, utility_type, utility_plant_asset_type
Usage Warnings
Some columns contain subtotals; use caution when choosing columns to aggregate.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Four-digit year in which the data was reported.
Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.
Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).
Type of utility plant asset reported in the core_ferc1__yearly_utility_plant_summary_sched200 table. Assets include those leased to others, held for future use, construction work-in-progress and details of accumulated depreciation.
Indicates whether the value reported in the row is calculated, or uniquely reported within the table.
Account balance at end of year.
Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.
core_ferc714__hourly_planning_area_demand
Hourly time series of electricity demand by planning area.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part III, Schedule 2a)
- Primary key:
respondent_id_ferc714, datetime_utc
Usage Warnings
The datetime_utc timestamps have been cleaned due to inconsistent datetime reporting. See below for additional details.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table includes data from the pre-2021 CSV raw source as well as the newer 2021 through present XBRL raw source.
This table includes three respondent ID columns: one from the CSV raw source, one from the XBRL raw source and another that is PUDL-derived that links those two source ID's together. This table has filled in source IDs for all records so you can select the full timeseries for a given respondent from any of these three IDs.
An important caveat to note is that there was some cleaning done to the datetime_utc timestamps. The Form 714 includes sparse documentation for respondents for how to interpret timestamps - the form asks respondents to provide 24 instances of hourly demand for each day. The form is labeled with hour 1-24. There is no indication if hour 1 begins at midnight.
The XBRL data contained several formats of timestamps. Most records corresponding to hour 1 of the Form have a timestamp with hour 1 as T1. About two thirds of the records in the hour 24 location of the form have a timestamp with an hour reported as T24 while the remaining third report this as T00 of the next day. T24 is not a valid format for the hour of a datetime, so we convert these T24 hours into T00 of the next day. A smaller subset of the respondents reports the 24th hour as the last second of the day - we also convert these records to the T00 of the next day.
Columns
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
Date reported.
Date and time converted to Coordinated Universal Time (UTC).
IANA timezone name
Electricity demand (energy) within a given timeframe.
core_ferc714__respondent_id
Respondent identification.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part I, Schedule 1)
- Primary key:
respondent_id_ferc714
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.
EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.
core_ferc714__yearly_planning_area_demand_forecast
Annual time series of 10-year forecasted summer and winter peak demand and annual net energy per planning area.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part III, Schedule 2b)
- Primary key:
respondent_id_ferc714, report_year, forecast_year
Usage Warnings
This table includes data from the pre-2021 CSV raw source as well as the newer 2021 through present XBRL raw source. We created the respondent_id_ferc714 field to blend disparate IDs from the CSV and XBRL data over time. See the core_ferc714__respondent_id table for links to the original source IDs.
There is a small handful of respondents (~11) that report more than 10 years and an even smaller handful that report less than 10 (~9).
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table contains forecasted net demand (MWh) as well as summer and winter peak demand (MW) for the next ten years after after the report_year.
Columns
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
Four-digit year in which the data was reported.
Four-digit year that applies to a particular forecasted value.
The maximum forecasted hourly sumemr load (for the months of June through September).
The maximum forecasted hourly winter load (for the months of January through March).
Net forecasted electricity demand for the specific period in megawatt-hours (MWh).
core_ferc__codes_accounts
Code table containing descriptions of categorical codes for account numbers from the FERC Uniform System of Accounts for Electric Plant.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC -- Mix of multiple FERC Forms
- Primary key:
ferc_account_id
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
These codes are defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101).
Columns
Account identifier from FERC's Uniform System of Accounts for Electric Plant. Includes higher level labeled categories.
Description of the FERC account.
core_ferc__entity_companies
Entity table containing static information about the companies that submit required filings to FERC.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC -- Mix of multiple FERC Forms
- Primary key:
company_id_ferc
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
The Company Identifier (CID) obtained through FERC's Company Registration system.
The name of the FERC-reporting organization or company.
The specific program or initiative associated with the FERC organization, which can provide context for the company's activities.
The website URL of the company, which can provide additional information about the organization.
Physical street address.
Second line of the address.
Name of the city.
Two letter US state abbreviation.
Five digit US Zip Code.
Four digit US Zip Code suffix.
core_gridpathratoolkit__assn_generator_aggregation_group
Association table providing connections between individual generator IDs and their corresponding aggregation group when creating aggregated profiles.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
GridPath Resource Adequacy Toolkit Data
- Primary key:
plant_id_eia, generator_id
Usage Warnings
The data is no longer being collected or reported in this way.
Additional Details
This table defines which individual generator profiles are combined when creating aggregated capacity factors / generation profiles. Generator capacity is used to weight the contribution of each generator in the resulting aggregated profiles, and is made available in this table for convenience and legibility. The resulting aggregated profiles are stored in out_gridpathratoolkit__hourly_available_capacity_factor.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!
A label identifying a group of aggregated generator capacity factors.
Total installed (nameplate) capacity, in megawatts.
Every row in the aggregation table describes a single generator. Groups of rows with the same aggregation are combined using a capacity weighted average to produce an aggregate generation profile. A few generators are not included in that aggregation process. This column determines whether a generator is included.
core_nrelatb__yearly_projected_cost_performance
Annual time series of projections of costs and performance for various technologies.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
NREL Annual Technology Baseline (ATB) for Electricity
- Primary key:
This table has no primary key. The fields technology_description_detail_1 and technology_description_detail_2 occasionally contain nulls, preventing this table from having a primary key.
Additional Details
This table includes NREL's projections for capital and operational costs as well as operational attributes for a variety of technologies for each of the NREL ATB scenarios. Each new year of ATB data includes projections for the next 25 years.
All of the NREL tables are related. If you want to understand the financial assumptions of a given scenario in this table, see core_nrelatb__yearly_projected_financial_cases. The other two core_nrealatb tables include attributes about technologies in these projections.
Columns
Four-digit year in which the data was reported.
NREL's financial assumption cases. There are two cases which effect project financial assumptions: R&D Only Case and Market + Policies Case. R&D Only includes only projected R&D improvements while Market + Policy case includes policy and tax incentives. https://atb.nrel.gov/electricity/2024/financial_cases_&_methods
NREL's tax credit assumption cases. There are two types of tax credits: production tax credit (PTC) and investment tax credit (ITC). For more detail, see: https://atb.nrel.gov/electricity/2024/financial_cases_&_methods
The year of the projected value.
High level description of the technology used by the generator to produce electricity.
The period over which the initial capital investment to build a plant is recovered.
Technology innovation scenarios. https://atb.nrel.gov/electricity/2023/definitions#scenarios
Technology details indicate resource levels and specific technology subcategories.
Technology details indicate resource levels and specific technology subcategories.
Fraction of potential generation that was actually reported for a plant part.
Capital cost (USD). Expenditures required to achieve commercial operation of the generation plant.
capex if plant could be constructed overnight (i.e., excludes construction period financing); includes on-site electrical equipment (e.g., switchyard), a nominal-distance spur line (<1 mi), and necessary upgrades at a transmission substation.
capex for retrofits if plant could be constructed overnight (i.e., excludes construction period financing); includes on-site electrical equipment (e.g., switchyard), a nominal-distance spur line (<1 mi), and necessary upgrades at a transmission substation.
Overnight capital cost includes a nominal-distance spur line (<1 mi) for all technologies, and for offshore wind, it includes export cable and construction period transit costs for a 30-km distance from shore. Project-specific costs lines that are based on distance to existing transmission are not included. This only applies to offshore wind.
Portion of all-in capital cost associated with construction period financing. This factor is applied to an overnight capital cost to represent the financing costs incurred during the construction period.
Fuel costs in USD$/MWh. NREL-derived values using heat rates.
Fuel content per unit of electricity generated.
Heat rate penalty for retrofitting. This column only has contents to retrofit technologies. It seems to be a rate between 0.35 and 0.09
Levelized cost of energy (LCOE) is a summary metric that combines the primary technology cost and performance parameters: capital expenditures, operations expenditures, and capacity factor.
Penalty for retrofitting for net output. This column only has contents to retrofit technologies. It seems to be a rate between -0.25 and -0.08
Fixed operation and maintenance expenses. Annual expenditures to operate and maintain equipment that are not incurred on a per-unit-energy basis.
Operation and maintenance costs incurred on a per-unit-energy basis.
core_nrelatb__yearly_projected_financial_cases
Annual time series of financial assumptions for each model case (model_case_nrelatb), and sub-type of technology (technology_description).
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
NREL Annual Technology Baseline (ATB) for Electricity
- Primary key:
report_year, model_case_nrelatb, projection_year, technology_description
Columns
Four-digit year in which the data was reported.
NREL's financial assumption cases. There are two cases which effect project financial assumptions: R&D Only Case and Market + Policies Case. R&D Only includes only projected R&D improvements while Market + Policy case includes policy and tax incentives. https://atb.nrel.gov/electricity/2024/financial_cases_&_methods
The year of the projected value.
High level description of the technology used by the generator to produce electricity.
Rate of inflation. All dollar values are given in 2021 USD, using the Consumer Price Index for All Urban Consumers for dollar year conversions where the source year dollars do not match 2021.
Also referred to as construction finance cost. Portion of all-in capital cost associated with construction period financing. It is a function of construction duration, capital fraction during construction, and interest during construction.
Calculated real interest rate.
Nominal interest rate.
Calculated real rate of return on equity.
Nomial rate of return on equity.
Combined federal and state tax rate. The R&D model_case_nrelatb holds tax and inflation rates constant at assumed long-term values: 21 percent federal tax rate, 6 percent state tax rate (though actual state tax rates vary), and 2.5 percent inflation rate excludes effects of tax credits. The Market + Policy model_case_nrelatb applies federal tax credits and expires them as consistent with existing law and guidelines.
core_nrelatb__yearly_projected_financial_cases_by_scenario
Annual time series of additional financial assumptions for NREL ATB projections that also vary by technology innovation scenario (scenario_atb), tax credit case (model_tax_credit_case_nrelatb), and cost recovery period (cost_recovery_period_years).
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
NREL Annual Technology Baseline (ATB) for Electricity
- Primary key:
This table has no primary key. There are a small number of records which have nulls in the cost_recovery_period_years column. Based on NREL's documentation, this seems to indicate that those records apply to any relevant cost_recovery_period_years. If those records were non-null, the primary keys of this table would be: ['report_year', 'model_case_nrelatb', 'model_tax_credit_case_nrelatb', 'projection_year', 'technology_description', 'scenario_atb', 'cost_recovery_period_years']
Columns
Four-digit year in which the data was reported.
NREL's financial assumption cases. There are two cases which effect project financial assumptions: R&D Only Case and Market + Policies Case. R&D Only includes only projected R&D improvements while Market + Policy case includes policy and tax incentives. https://atb.nrel.gov/electricity/2024/financial_cases_&_methods
NREL's tax credit assumption cases. There are two types of tax credits: production tax credit (PTC) and investment tax credit (ITC). For more detail, see: https://atb.nrel.gov/electricity/2024/financial_cases_&_methods
The year of the projected value.
High level description of the technology used by the generator to produce electricity.
Technology innovation scenarios. https://atb.nrel.gov/electricity/2023/definitions#scenarios
The period over which the initial capital investment to build a plant is recovered.
Ratio of a constant annuity to the present value of receiving that annuity for a given length of time.
Fraction of capital financed with debt; Debt fraction is assumed financed with equity; also referred to as the leverage ratio.
Amount of revenue per dollar of investment required that must be collected annually from customers to pay the carrying charges on that investment.
Nominal weighted average cost of capital - average expected rate that is paid to finance assets.
Real weighted average cost of capital - average expected rate that is paid to finance assets.
core_nrelatb__yearly_technology_status
Annual time series of technology statuses, indicating whether technologies in the ATB scenarios are mature, and whether they are the default technologies.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
NREL Annual Technology Baseline (ATB) for Electricity
- Primary key:
This table has no primary key.
Columns
Four-digit year in which the data was reported.
High level description of the technology used by the generator to produce electricity.
Technology details indicate resource levels and specific technology subcategories.
Technology details indicate resource levels and specific technology subcategories.
Indicator of whether the technology is mature. Technologies are definedas mature if a representative plant is operating or under constructionin the United States in the Base Year.
Indicator of whether the technology is default.
core_phmsagas__yearly_distribution_operators
Annual time series of distribution operator information.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part A)
- Primary key:
report_id, report_date, operator_id_phmsa
Additional Details
This table contains operator-level information including office and headquarter location.
Columns
Report number of the PHMSA Gas utility submission.
Date reported.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
PHMSA operator name.
Street address of an operator's office.
City where an operator's office is located.
County where an operator's office is located.
Zipcode where an operator's office is located.
State where an operator's office is located.
Street address for an operator's headquarters.
City where an operator's headquarters are located.
County where an operator's headquarters are located.
State where an operator's headquarters are located.
Zipcode where an operator's headquarters are located.
Any additional information which will assist in clarifying or classifying the reported data.
core_pudl__assn_eia_pudl_plants
Association table providing connections between EIA plant IDs and manually assigned PUDL plant IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
plant_id_eia
Usage Warnings
Contains information from multiple raw inputs.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Plant name.
A manually assigned PUDL plant ID. May not be constant over time.
core_pudl__assn_eia_pudl_utilities
Association table providing connections between EIA utility IDs and manually assigned PUDL utility IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
utility_id_eia
Usage Warnings
Contains information from multiple raw inputs.
Columns
The EIA Utility Identification number.
The name of the utility.
A manually assigned PUDL utility ID. May not be stable over time.
core_pudl__assn_ferc1_dbf_pudl_utilities
Association table providing connections between PUDL-assigned FERC1 utility IDs and the native FERC1 DBF utility IDs originally reported as respondent_id.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
utility_id_ferc1_dbf
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.
core_pudl__assn_ferc1_pudl_plants
Association table providing connections between FERC 1 plants and their manually assigned PUDL plant IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
utility_id_ferc1, plant_name_ferc1
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
FERC does not assign IDs to plants, so each FERC 1 plant is identified by a utility_id_ferc1 and a plant_name_ferc1.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.
A manually assigned PUDL plant ID. May not be constant over time.
core_pudl__assn_ferc1_pudl_utilities
Association table providing connections between PUDL utility IDs and PUDL-assigned FERC1 utility IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
utility_id_ferc1
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
This table maps two manually assigned utility IDs: a PUDL ID and a FERC1 ID. The PUDL IDs link EIA and FERC1 utilities. The PUDL FERC1 IDs link records from older DBF respondent IDs and new XBRL entity IDs via core_pudl__assn_ferc1_dbf_pudl_utilities and core_pudl__assn_ferc1_xbrl_pudl_utilities respectively. This table is generated from a table stored in the PUDL repository: src/package_data/glue/utility_id_pudl.csv
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
Name of the responding utility, as it is reported in FERC Form 1. For human readability only.
A manually assigned PUDL utility ID. May not be stable over time.
core_pudl__assn_ferc1_xbrl_pudl_utilities
Association table providing connections between PUDL-assigned FERC1 utility IDs and the native FERC1 XBRL utility IDs originally reported as entity_id.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
utility_id_ferc1_xbrl
Usage Warnings
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.
FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.
core_pudl__assn_ferc714_csv_pudl_respondents
Association table providing connections between PUDL-assigned FERC714 respondent IDs and native FERC714 CSV respondent IDs originally reported as respondent_id.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
respondent_id_ferc714, respondent_id_ferc714_csv
Usage Warnings
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
core_pudl__assn_ferc714_pudl_respondents
Entity table containing static information about PUDL-derived FERC 714 respondent IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
respondent_id_ferc714
Usage Warnings
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Additional Details
These IDs are used to connect older CSV data which uses different respondent IDs than the newer XBRL entity IDs. These IDs are manually assigned when new FERC 714 data is integrated, and any newly found utilities are added to the list with a new ID. This table is read in from a CSV stored in the PUDL repository: src/pudl/package_data/glue/respondent_id_ferc714.xlsx
Columns
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
core_pudl__assn_ferc714_xbrl_pudl_respondents
Association table providing connections between PUDL-assigned FERC714 respondent IDs and native FERC714 XBRL respondent IDs originally reported as entity_id.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
respondent_id_ferc714, respondent_id_ferc714_xbrl
Usage Warnings
Contains information from multiple raw inputs.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.
FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.
core_pudl__assn_utilities_plants
Association table providing connections between PUDL utility IDs and PUDL plant IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
utility_id_pudl, plant_id_pudl
Additional Details
This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx
Columns
A manually assigned PUDL utility ID. May not be stable over time.
A manually assigned PUDL plant ID. May not be constant over time.
core_pudl__codes_data_maturities
Code table containing descriptions of categorical codes for maturity levels of data records. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
code
Usage Warnings
Contains information from multiple raw inputs.
Columns
Originally reported short code.
Long human-readable description of the meaning of a code/label.
core_pudl__codes_datasources
Code table containing descriptions of categorical codes for the data sources that PUDL integrates.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
datasource
Columns
Code identifying a dataset available within PUDL.
The data partitions used to generate this instance of the database.
Unique digital object identifier of Zenodo archive.
The version of PUDL used to generate this database.
core_pudl__codes_imputation_reasons
Code table containing descriptions of categorical codes for the reasons a value might be flagged for imputation.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
code
Additional Details
Any time a column is imputed, there should be a corresponding column which contains codes for every value in the column that was imputed.
Columns
Originally reported short code.
Long human-readable description of the meaning of a code/label.
core_pudl__codes_subdivisions
Code table containing descriptions of categorical codes for states, provinces, and other sub-national political subdivisions.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
country_code, subdivision_code
Columns
Three letter ISO-3166 country code (e.g. USA or CAN).
Full country name (e.g. United States of America).
Two-letter ISO-3166 political subdivision code (e.g. US state or Canadian province abbreviations like CA or AB).
Full name of political subdivision (e.g. US state or Canadian province names like California or Alberta.
ISO-3166 political subdivision type. E.g. state, province, outlying_area.
IANA timezone name of the timezone which encompasses the largest portion of the population in the associated geographic area.
Two digit state FIPS code.
Longer human readable name describing the US Census division.
Three-letter US Census division code as it appears in the bulk electricity data published by the EIA. Note that EIA splits the Pacific division into distinct contiguous (CA, OR, WA) and non-contiguous (AK, HI) states. For reference see this US Census region and division map: https://www2.census.gov/geo/pdfs/maps-data/maps/reference/us_regdiv.pdf
Human-readable name of a US Census region.
Indicates whether the associated state reports data within the EPA's Continuous Emissions Monitoring System.
core_pudl__entity_plants_pudl
Entity table containing static information about PUDL-assigned plant IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
plant_id_pudl
Additional Details
These IDs are manually generated each year when new FERC and EIA reporting is integrated, and any newly identified plants are added to the list with a new ID. Each ID maps to a power plant which is reported in at least one FERC or EIA data set. This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx
Columns
A manually assigned PUDL plant ID. May not be constant over time.
Plant name, chosen arbitrarily from the several possible plant names available in the plant matching process. Included for human readability only.
core_pudl__entity_utilities_pudl
Entity table containing static information about PUDL-assigned utility IDs.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
The Public Utility Data Liberation (PUDL) Project
- Primary key:
utility_id_pudl
Additional Details
These IDs are manually generated each year when new FERC and EIA reporting is integrated, and any newly found utilities are added to the list with a new ID. Each ID maps to a power plant owning or operating entity which is reported in at least one FERC or EIA data set. This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx
Columns
A manually assigned PUDL utility ID. May not be stable over time.
Utility name, chosen arbitrarily from the several possible utility names available in the utility matching process. Included for human readability only.
core_rus12__scd_borrowers
Slowly changing dimension (SCD) table describing attributes of active RUS borrowers.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply
- Primary key:
report_date, borrower_id_rus
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Additional Details
This table contains all of the Active Distribution Borrowers as of each report year who were eligible to report to RUS Form 12. If these Borrowers have reported to RUS they will have records in the enclosed data tables, however a small number of these Borrowers did not report for various reasons and these Borrowers will not be represented in any of the other tables.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Two letter US state abbreviation.
core_rus12__yearly_balance_sheet_assets
Annual time series of assets and other debts from the balance sheet.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply (Part A - Section B)
- Primary key:
report_date, borrower_id_rus, asset_type
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Additional Details
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Type of asset being reported to the core_rus12__yearly_balance_sheet_assets table.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Whether or not this record represents a total.
core_rus12__yearly_balance_sheet_liabilities
Annual time series of liabilities and other credits from the balance sheet.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply (Part A - Section B)
- Primary key:
report_date, borrower_id_rus, liability_type
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Additional Details
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Type of liability being reported to the core_rus12__yearly_balance_sheet_liabilities table.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Whether or not this record represents a total.
core_rus12__yearly_lines_stations_labor_materials_cost
Annual time series of labor and material cost for lines and stations operated by RUS borrowers.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply (Part I - Section C)
- Primary key:
report_date, borrower_id_rus, labor_or_material, operation_or_maintenance, lines_or_stations
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Indicates whether the cost reported is for labor or material.
Indicates whether the expenditure is for operation or maintenance.
Indicates whether the cost reported pertains to lines or stations.
Cost value.
core_rus12__yearly_long_term_debt
Annual time series of long-term debt and debt service requirements for RUS borrowers.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply (Part H - Section H)
- Primary key:
This table has no primary key. This table has no primary key because some borrowers report multiple debt values from the same entity in a given year.
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Description of debt or loan. On the original form, there are nine provided descriptions and a section to add other free-form descriptions.
The amount of principal still owned on the debt at the end of the report year.
The interest expense on the debt for the report year.
The principal paid on the debt during the report year.
The total amount of debt.
core_rus12__yearly_meeting_and_board
Annual time series of annual meeting and board information for RUS borrowers.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply (Part H - Section I)
- Primary key:
report_date, borrower_id_rus
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
The date of the last annual meeting.
The total number of members.
The number of members present at the last annual meeting.
Whether or not quorum was met.
The number of members voting by mail or by proxy.
The total number of board members.
The total amount of fees and expenses for board members.
Whether or not the RUS borrower's manager has a written contract.
core_rus12__yearly_renewable_plants
Annual time series of renewable energy plant generation information for RUS borrowers.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 12 -- Financial and Operating Report: Electric Power Supply (Part C RE)
- Primary key:
report_date, borrower_id_rus, plant_name_rus, prime_mover_id
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Name of the plant as reported to RUS.
Unique numeric identifier for each prime mover type used by RUS borrowers.
Type of prime mover (e.g. Hydro, Internal Combustion).
Unique numeric identifier for each renewable fuel type.
Primary renewable fuel type used by the plant.
Percentage of renewable fuel used.
Total installed (nameplate) capacity, in megawatts.
Net electricity generation for the specified period in megawatt-hours (MWh).
Fraction of potential generation that was actually reported for a plant part.
Number of employees.
Total production expenses (USD per MWh generated).
The cost of power per mwh.
The amount of money invested.
Percentage of the plant owned by the respondent.
Amount of funding received from the Rural Utilities Service (RUS).
General comments field.
core_rus7__codes_investment_types
Code table containing descriptions of categorical codes for investment types.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution
- Primary key:
code
Columns
Originally reported short code.
Long human-readable description of the meaning of a code/label.
core_rus7__scd_borrowers
Slowly changing dimension (SCD) table describing attributes of active RUS borrowers
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution
- Primary key:
report_date, borrower_id_rus
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Additional Details
This table contains all of the Active Distribution Borrowers as of each report year who were eligible to report to RUS Form 7. If these Borrowers have reported to RUS they will have records in the enclosed data tables, however a small number of these Borrowers did not report for various reasons and these Borrowers will not be represented in any of the other tables.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Two letter US state abbreviation.
core_rus7__yearly_balance_sheet_assets
Annual time series of assets and other debts from the balance sheet.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part C)
- Primary key:
report_date, borrower_id_rus, asset_type
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Type of asset being reported to the core_rus7__yearly_balance_sheet_assets table.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Whether or not this record represents a total.
core_rus7__yearly_balance_sheet_liabilities
Annual time series of liabilities and other credits from the balance sheet.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part C)
- Primary key:
report_date, borrower_id_rus, liability_type
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Type of liability being reported to the core_rus7__yearly_balance_sheet_liabilities table.
Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.
Whether or not this record represents a total.
core_rus7__yearly_employee_statistics
Annual time series of statistics about employment and payroll.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part H)
- Primary key:
report_date, borrower_id_rus
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
The number of full time employees.
The number of regular (non-overtime) hours worked by employees.
The number of overtime hours worked by employees.
The amount of payroll spent that was expensed.
The amount of payroll spent that was capitalized.
The amount of payroll spent that was funded by other means - not capitalized or expensed.
core_rus7__yearly_energy_efficiency
Annual time series of
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part P)
- Primary key:
report_date, borrower_id_rus, customer_class, observation_period
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
The savings_mmbtu likely contains values with incorrect units.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
High level categorization of customer type.
The date range that any given record pertains to. Ex: 'december' implies that this record covers the month of December only, while 'avg' implies this record pertains to the average of the reporting period.
Number of customers.
The amount of money invested.
The estimated amount of energy savings from energy efficiency programs. Warning: We found values much larger than expected that we have not yet cleaned - this is likely a reporting unit error.
core_rus7__yearly_investments
Annual time series of investments, loan guarantees and loans.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part P - Section I)
- Primary key:
This table has no primary key. This table has no native primary key. It is a list of all investments or loan in each year and borrowers can have multiple records with the same investment_description.
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Additional Details
Reporting of investments is required by 7 CFR 1717, Subpart N. Investment categories reported on this Part correspond to Balance Sheet items in Part C.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Description of investment. This is a free-form text field and thus contains a wide variety of values.
Investment type code.
Included investment.
Excluded investment.
Income or loss from investment.
Whether or not the investment is Rural Development.
core_rus7__yearly_long_term_debt
Annual time series of long term debt and debt service requirements.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part N)
- Primary key:
This table has no primary key. This table has no native primary key. It is a list of all debts in each year and borrowers can have multiple records with the same investment_description.
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Description of debt or loan. On the original form, there are nine provided descriptions and a section to add other free-form descriptions.
The amount of principal still owned on the debt at the end of the report year.
The interest expense on the debt for the report year.
The principal paid on the debt during the report year.
The total amount of debt.
core_rus7__yearly_meeting_and_board
Annual time series of governance information about RUS borrowers' annual member meetings as well as information about their board.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part M)
- Primary key:
report_date, borrower_id_rus
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
The date of the last annual meeting.
The total number of members.
The number of members present at the last annual meeting.
Whether or not quorum was met.
The number of members voting by mail or by proxy.
The total number of board members.
The total amount of fees and expenses for board members.
Whether or not the RUS borrower's manager has a written contract.
core_rus7__yearly_patronage_capital
Annual time series of patronage capital distributed and received.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part I)
- Primary key:
report_date, borrower_id_rus, patronage_type
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Type of patronage capital distributed or received.
Amount of patronage distributed or received within report year.
Amount of patronage distributed or received cumulatively. Received patronage capital is not reported cumulatively and thus will be null.
Whether or not this record represents a total.
core_rus7__yearly_power_requirements
Annual time series of power requirements - revenue and generation summary.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part O)
- Primary key:
report_date, borrower_id_rus
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Additional Details
This table includes totals of electric sales and revenue which also appear in core_rus7__yearly_power_requirements_electric_sales with a customer_class of total. This table includes all other power requirements - not broken out by customer class, so we include these electric requirements in this table as well.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
Total Revenue Received From Sales of Electric Energy. Total of lines 1c thru 9c on the original form.
Transmission revenue.
Electric revenue other than electric_sales_revenue.
The cost of purchases and generation of electricity.
Total MWh Sold to electric sales. Total of lines 1b thru 9b on the original form.
The electricity in MWh used for the borrower's own internal use.
The total electricity purchased.
The total electricity generated.
The net interchange of electricity. The net amount of electricity exchanged in purchases and sales.
The peak system MWh - the sum of all MW.
Whether or not the peak_mw is coincident or non-coincident peak.
core_rus7__yearly_power_requirements_electric_customers
Annual time series of power requirements - number of customers served by customer type.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part O)
- Primary key:
report_date, borrower_id_rus, customer_class, observation_period
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
High level categorization of customer type.
The date range that any given record pertains to. Ex: 'december' implies that this record covers the month of December only, while 'avg' implies this record pertains to the average of the reporting period.
Number of customers.
core_rus7__yearly_power_requirements_electric_sales
Annual time series of power requirements - revenue and energy sold by customer type.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part O)
- Primary key:
report_date, borrower_id_rus, customer_class
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
High level categorization of customer type.
Quantity of electricity sold in MWh.
Amount of revenue.
core_rus7__yearly_statement_of_operations
Annual time series of statement of operations broken out by types and a variety of time periods.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
USDA RUS Form 7 -- Financial and Operating Report: Electric Distribution (Part A)
- Primary key:
report_date, borrower_id_rus, opex_group, opex_type
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
Columns
Date reported.
Unique identifier of RUS (Rural Utilities Service) borrower. These ID's are structured as: two character state acronyms followed by four digits.
The name of the RUS (Rural Utilities Service) borrower.
High level section from the statement of operations table. Most of these types have subcomponents broken out in the opex_type column.
Type of item from the statement of operations.
Amount of operational expense, cost or income during the report month.
The year-to-date amount of operational expense, cost or income.
The year-to-date budget for amount of operational expense, cost or income.
Whether or not this record represents a total.
core_sec10k__assn_exhibit_21_subsidiaries_and_eia_utilities
Association table providing connections between subsidiaries listed in Exhibit 21 and EIA utilities.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
subsidiary_company_id_sec10k
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
An Exhibit 21 subsidiary is considered matched to an EIA utility if their names are identical. Only subsidiaries that don't file SEC 10-K themselves are included in this table. SEC 10-K filers have much more information available and are matched to EIA utilities using probabilistic record linkage in core_sec10k__assn_sec10k_filers_and_eia_utilities.
Columns
PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.
The EIA Utility Identification number.
core_sec10k__assn_exhibit_21_subsidiaries_and_filers
Association table providing connections between subsidiaries listed in Exhibit 21 and their SEC central index key, if the subsidiary also files Form 10-K.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
subsidiary_company_id_sec10k
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
Exhibit 21 subsidiaries and SEC 10-K filers are considered matched if they have identical names and the same location of incorporation.
Columns
PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.
Identifier of the company in SEC database.
core_sec10k__assn_sec10k_filers_and_eia_utilities
Association table providing connections between SEC 10-K filers and EIA utilities.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
central_index_key, utility_id_eia
Usage Warnings
This data has been matched probabilistically using a machine learning model and contains some incompletions and errors. It should not be treated as ground truth data.
PUDL does not currently update its copy of this data.
Additional Details
SEC central index keys are matched to EIA utility IDs using probabilistic record linkage based on associated company information like company name, business and mailing addresses, and state of incorporation. The match between central_index_key and utility_id_eia is one to one and is not allowed to change over time. In cases where there were multiple candidate matches, the match with the highest probability is selected.
Columns
Identifier of the company in SEC database.
The EIA Utility Identification number.
core_sec10k__changelog_company_name
Changelog table tracking changes in the names each SEC 10-K filer has used.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
This table has no primary key.
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
This table is extracted from the same SEC 10-K filing header information as core_sec10k__quarterly_company_information. Each filing reports the full history of name change associated with a company up to the date of that filing. Because individual companies may appear in multiple filings in the same year, and the same historical name changes will be reported in multiple years, the raw input data contains many duplicate entries, which are deduplicated to create this table. The original name change data only contains the former name and the date of the change.
Roughly 2% of all records describe multiple name changes happening on the same date (they are duplicates on the basis of central_index_key and name_change_date). This may be due to company name reporting inconsistencies or reporting errors in which the old and new company names have been swapped.
Columns
Identifier of the company in SEC database.
Date of last name change of the company.
Name of company prior to name change.
Name of company submitting SEC 10k filing.
core_sec10k__quarterly_company_information
Quarterly time series of company and filing information extracted from SEC 10-K filing headers.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
filename_sec10k, central_index_key
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
While the SEC 10-K filings are submitted by a single company, they often contain references to many other related companies. Information about these companies and the filing itself are contained in text headers at the beginning of the filing. This table contains data extracted from those headers. Each record in this table represents a single observation of a company in a single filing. The filer_count indicates which referenced company within a filing header the record corresponds to.
Because the same company may be referenced in a number of different filings submitted in the same reporting period or even on the same day, this table contains apparently duplicative records about many companies, that may be distinguished only by the filename associated with the filing they appeared in and their filer count. Note that all references to a particular company may not be perfectly consistent across all filings in which they appear. The various company names, addresses, and other information associated with the company's unique and permanent central_index_key are later used as inputs into the probabilistic record linkage process.
Columns
Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.
Identifier of the company in SEC database.
A counter indicating which observation of company data within an SEC 10-K filing header the record pertains to.
Name of company submitting SEC 10k filing.
The end date of an SEC filing company's fiscal year, in MMDD format.
Taxpayer ID of the company with the IRS.
Two letter state code where company is incorporated.
Text description of Standard Industrial Classification (SIC)
Four-digit Standard Industrial Classification (SIC) code identifying the company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.
Document control number used in the SEC EDGAR database. The first four digits can be used to access scans of the document in the SEC's Virtual Private Reference Room.
Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.
SEC Act through which the form was enacted, e.g. 1934 act.
Filing number used internally by the SEC commission to track filing.
Phone number for utility contact 1.
Street address of the company's place of business.
Second line of the street address of the company's place of business.
City where the company's place of business is located.
State where the company's place of business is located.
Zip code of the company's place of business.
Zip code suffix of the company's place of business.
Non-US postal code of the company's place of business.
Street portion of the company's mailing address.
Second line of the street portion of the company's mailing address.
City of the company's mailing address.
State of the company's mailing address.
Zip code of the company's mailing address.
Zip code suffix of the company's mailing address.
Non-US postal code of the company's mailing address.
core_sec10k__quarterly_exhibit_21_company_ownership
Quarterly time series of subsidiary company ownership data from the SEC 10-K Exhibit 21 attachments.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
filename_sec10k, subsidiary_company_id_sec10k
Usage Warnings
This data has been extracted probabilistically using a machine learning model and contains some incompletions and errors. It should not be treated as ground truth data.
PUDL does not currently update its copy of this data.
Additional Details
Exhibit 21 is an unstructured text or PDF attachment to the main SEC 10-K filing that is used to describe the subsidiaries owned by the filing company. It may or may not provide the percentage of the subsidiary that is owned by the filing company, or the location of the subsidiary.
Columns
Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.
Name of subsidiary company.
Location of subsidiary company. This is the full US state name or country name and occasionally a two digit code that was not mapped to a full name during cleaning.
PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.
Proportion of generator ownership attributable to this utility.
core_sec10k__quarterly_filings
Quarterly time series of metadata describing SEC 10-K filings.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
U.S. Securities and Exchange Commission (SEC) Form 10-K
- Primary key:
filename_sec10k
Usage Warnings
PUDL does not currently update its copy of this data.
Additional Details
Each SEC 10-K filing is submitted by a single company, but may contain information about numerous other companies. This table indicates the company submitting the filing, as well as some information about the overall filing. Each filing is guaranteed to have a unique filename, but ~1% of all filings are one company submitting the same form multiple times on the same day, so the filename is the only available natural primary key.
Columns
Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.
Identifier of the company in SEC database.
Name of company submitting SEC 10k filing.
Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.
Date on which the filing was submitted.
Version of exhibit 21 submitted (if applicable).
Date reported.
core_ferceqr__contracts
Contains information about contracts between companies selling and buying electricity market products.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 920 -- Electric Quarterly Report (EQR)
- Primary key:
This table has no primary key. The FERC EQR contracts table has no natural primary key.
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Year-quarter corresponding to record. Formatted like YYYYq{1-4}.
The Company Identifier (CID) obtained through FERC's Company Registration system corresponding to the selling company.
An identifier beginning with the letter “C” and followed by a number (e.g., 'C1', 'C2') used to designate a record containing contract information. Note that these contract IDs may only be unique within the context of a particular seller, seller-buyer pair, or timeframe. FERC documentation of the field is limited.
The name of the company that is authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. This name must match the name provided as seller_company_name in the core_ferceqr__quarterly_identity table. There are a handful of cases in which this requirement is violated, so any joins between tables should rely on company_id_ferc, not the company names.
The name of the purchaser of contract products and services. Unlike the seller_company_name this name is not guaranteed to match a name in the core_ferceqr__quarterly_identity name. In addition, the same customer company may appear with different names in different contracts and transactions, since this field is an unconstrained string chosen by the seller.
If True, this field indicates the customer is an affiliate of the seller. The customer is an affiliate if it controls, is controlled by, or is under common control with the seller. This includes a division that operates as a functional unit. A customer of a seller who is an Exempt Wholesale Generator may be defined as an affiliate under the Public Utility Holding Company Act and the FPA.
The FERC tariff reference cites the document that specifies the terms and conditions under which a Seller is authorized to make transmission sales, power sales or sales of related jurisdictional services at cost-based rates or at market-based rates. If the sales are market-based, the tariff that is specified in the FERC order granting the Seller Market Based Rate Authority must be listed. If a non-public utility does not have a FERC Tariff Reference, it should enter “NPU” for the FERC Tariff Reference.
Unique identifier given to each service agreement that can be used by the Seller to produce the agreement, if requested. The identifier may be the number assigned by FERC for those service agreements that have been filed with and accepted by the Commission, or it may be generated as part of an internal identification system.
The date the contract was signed. If the parties signed on different dates, use the most recent date signed.
The date the terms of the contract reported in fields 18, 23 and 25 through 44 (as defined in the data dictionary) became effective. If those terms became effective on multiple dates (i.e., due to one or more amendments), the date to be reported in this field is the date the most recent amendment became effective. If the contract or the most recent reported amendment does not have an effective date, the date when service began pursuant to the contract or most recent reported amendment may be used. If the terms reported in fields 18, 23 and 25 through 44 have not been amended since January 1, 2009, the initial date the contract became effective (or absent an effective date the initial date when service began) may be used.
The date that the contract expires.
The date the contract actually terminates.
Description of terms that provide for the continuation of the contract.
F - Firm: For transmission sales, a service or product that always has priority over non-firm service. For power sales, a service or product that is not interruptible for economic reasons. NF - Non-firm: For transmission sales, a service that is reserved and/or scheduled on an as-available basis and is subject to curtailment or interruption at a lesser priority compared to Firm service. For an energy sale, a service or product for which delivery or receipt of the energy may be interrupted for any reason or no reason, without liability on the part of either the buyer or seller. UP - Unit Power Sale: Designates a dedicated sale of energy and capacity from one or more than one specified generation unit(s). N/A: To be used only when the other available Class Names do not apply.
Contracts with durations of one year or greater are long-term (LT). Contracts with shorter durations are short-term (ST).
5: Terms of the contract (if specifically noted in the contract) set for more than 0 minutes and less than or equal to 5 minutes (> 0 and ≤ 5 minutes). 15: Terms of the contract (if specifically noted in the contract) set for more than 5 minutes and less than or equal to 15 minutes (> 5 and ≤ 15 minutes). H: Terms of the contract (if specifically noted in the contract) set for more than 15 minutes and less than or equal to 6 hours (> 15 minutes and ≤ 6 hours). D: Terms of the contract (if specifically noted in the contract) set for more than 6 and up to 60 hours (> 6 and ≤ 60 hours). W: Terms of the contract (if specifically noted in the contract) set for over 60 hours and up to 168 hours (> 60 and ≤ 168 hours). M: Terms of the contract (if specifically noted in the contract) set for more than 168 hours up to, but not including, one year (> 168 hours and < 1 year). Y: Terms of the contract (if specifically noted in the contract) set for one year or more (≥ 1 year). N/A: Terms of the contract do not specify an increment.
FP: The product described may be sold during those hours designated as on-peak and off-peak at the point of delivery. OP: The product described may be sold only during those hours designated as off-peak at the point of delivery. P: The product described may be sold only during those hours designated as on-peak at the point of delivery. N/A: To be used only when the increment peaking name is not specified in the contract.
CB: Energy, capacity or ancillary services sold under a FERC-approved cost-based rate tariff. CR: An agreement under which a transmission provider sells, assigns or transfers all or portion of its rights to an eligible customer. CR-AD: Transmission capacity reassignments reported in Atlantic Daylight time. CR-AP: Transmission capacity reassignments reported in Atlantic Prevailing time. CR-AS: Transmission capacity reassignments reported in Atlantic Standard time. CR-CD: Transmission capacity reassignments reported in Central Daylight time. CR-CP: Transmission capacity reassignments reported in Central Prevailing time. CR-CS: Transmission capacity reassignments reported in Central Standard time. CR-ED: Transmission capacity reassignments reported in Eastern Daylight time. CR-EP: Transmission capacity reassignments reported in Eastern Prevailing time. CR-ES: Transmission capacity reassignments reported in Eastern Standard time. CR-MD: Transmission capacity reassignments reported in Mountain Daylight time. CR-MP: Transmission capacity reassignments reported in Mountain Prevailing time. CR-MS: Transmission capacity reassignments reported in Mountain Standard time. CR-PD: Transmission capacity reassignments reported in Pacific Daylight time. CR-PP: Transmission capacity reassignments reported in Pacific Prevailing time. CR-PS: Transmission capacity reassignments reported in Pacific Standard time. MB: Energy, capacity or ancillary services sold under the seller’s FERC-approved market-based rate tariff. T: The product is sold under a FERC-approved transmission tariff. NPU: The product is sold by a non-public utility that is required to file the EQR under section 220 of the Federal Power Act. OTHER: The product cannot be characterized by the other product type names.
Description of product being offered.
Quantity for the contract product identified.
Measure stated in the contract for the product sold.
The charge for the product per unit as stated in the contract.
Minimum rate to be charged per the contract, if a range is specified.
Maximum rate to be charged per the contract, if a range is specified.
Text description of rate. If the rate is currently available on the FERC website, a citation of the FERC Accession Number and the relevant FERC tariff including page number or section may be included instead of providing the entire rate algorithm. If the rate is not available on the FERC website, include the rate algorithm, if rate is calculated. If the algorithm would exceed the 300 character field limit, it may be provided in a descriptive summary (including bases and methods of calculations) with a detailed citation of the relevant FERC tariff including page number and section.
Measure stated in the contract for the product sold. FERC EQR.
The registered Balancing Authority (formerly called NERC Control Area) where service begins for a transmission or transmission-related jurisdictional sale. The Balancing Authority will be identified with the abbreviation used in OASIS applications. If receipt occurs at a trading hub, the term 'Hub' should be used.
The specific location at which the product is received if designated in the contract. If receipt occurs at a trading hub, a standardized hub name must be used. If more points of receipt are listed in the contract than can fit into the 50 character space, a description of the collection of points may be used. 'Various' alone, is unacceptable unless the contract itself uses that terminology.
The registered Balancing Authority (formerly called NERC Control Area) where a jurisdictional product is delivered and/or service ends for a transmission or transmission-related jurisdictional sale. The Balancing Authority will be identified with the abbreviation used in OASIS applications. If delivery occurs at the interconnection of two control areas, the control area that the product is entering should be used. If delivery occurs at a trading hub, the term 'Hub' should be used.
The specific location at which the product is delivered if designated in the contract. If receipt occurs at a trading hub, a standardized hub name must be used.
First date and time for the sale of the product at the rate specified.
Last date and time for the sale of the product at the rate specified.
core_ferceqr__quarterly_identity
Quarterly time series of individuals who filed FERC EQR for a company in a given quarter.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 920 -- Electric Quarterly Report (EQR)
- Primary key:
This table has no primary key. The primary key ought to be ['year_quarter', 'company_id_ferc', 'filer_unique_id'], where filer_unique_id is an employee-level ID. However, a handful of companies have erroneously reported the same filer_unique_id for multiple employees, resulting in duplicate records. In other cases, there appear to be multiple filings in a given quarter for the same company and filer, resulting in additional duplicates. Thus, there is no reliable natural primary key for the identity table.
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Year-quarter corresponding to record. Formatted like YYYYq{1-4}.
The Company Identifier (CID) obtained through FERC's Company Registration system.
(Seller) An identifier (e.g., “FS1”, “FS2”) used to designate a record containing Seller identification information. One record for each seller company must be included in an EQR for a given quarter. (Agent) – An identifier (i.e., “FA1”) used to designate a record containing Agent identification information. One record with the FA1 identifier must be included in an EQR for a given quarter.
Name of company submitting SEC 10k filing.
(Seller) – The name of the contact for the company authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. (Agent) – Name of the person who prepared the filing
Title of of utility contact 1.
Street address for contact identified in contact_name.
City for contact identified in contact_name.
State for contact identified in contact_name.
Zip code for contact identified in contact_name.
Country (USA, Canada, Mexico, or United Kingdom) for contact identified in contact_name.
Phone number for contact identified in contact_name.
Email for contact identified in contact_name.
Sellers should indicate whether they have reported their sales transactions to index price publisher(s). If they have, Sellers should indicate specifically which index publisher(s) in Field Number 73.
core_ferceqr__quarterly_index_pub
Quarterly time series of electricity market price indices that individual EQR filers reported transactions to.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 920 -- Electric Quarterly Report (EQR)
- Primary key:
year_quarter, company_id_ferc, filer_unique_id
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Year-quarter corresponding to record. Formatted like YYYYq{1-4}.
The Company Identifier (CID) obtained through FERC's Company Registration system.
(Seller) An identifier (e.g., “FS1”, “FS2”) used to designate a record containing Seller identification information. One record for each seller company must be included in an EQR for a given quarter. (Agent) – An identifier (i.e., “FA1”) used to designate a record containing Agent identification information. One record with the FA1 identifier must be included in an EQR for a given quarter.
The name of the company that is authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. This name must match the name provided as seller_company_name in the core_ferceqr__quarterly_identity table. There are a handful of cases in which this requirement is violated, so any joins between tables should rely on company_id_ferc, not the company names.
Name of index price publisher, which can be one of the following: AM - Argus Media, EIG - Energy Intelligence Group, Inc., IP - Intelligence Press, P - Platts, B - Bloomberg, PDX - Powerdex, SNL - SNL Energy
Description of the types of transactions reported to the index publisher identified in this record.
core_ferceqr__transactions
Contains information about individual electricity market transactions that took place during a given reporting quarter. Reported by the seller.
- Processing:
Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.
- Source:
FERC Form 920 -- Electric Quarterly Report (EQR)
- Primary key:
year_quarter, seller_company_id_ferc, transaction_unique_id
Usage Warnings
This table is experimental and/or a work in progress and may change in the future.
FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.
Columns
Year-quarter corresponding to record. Formatted like YYYYq{1-4}.
The Company Identifier (CID) obtained through FERC's Company Registration system corresponding to the selling company.
An identifier beginning with the letter “T” and followed by a number (e.g., “T1”, “T2”) used to designate a record containing transaction information. One record for each transaction record must be included in an EQR for a given quarter. A new transaction record must be used every time a price changes in a sale. Note, these ID's are only unique for a single company_identifier and year_quarter.
The name of the company that is authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. This name must match the name provided as seller_company_name in the core_ferceqr__quarterly_identity table. There are a handful of cases in which this requirement is violated, so any joins between tables should rely on company_id_ferc, not the company names.
The name of the purchaser of contract products and services. Unlike the seller_company_name this name is not guaranteed to match a name in the core_ferceqr__quarterly_identity name. In addition, the same customer company may appear with different names in different contracts and transactions, since this field is an unconstrained string chosen by the seller.
The FERC tariff reference cites the document that specifies the terms and conditions under which a Seller is authorized to make transmission sales, power sales or sales of related jurisdictional services at cost-based rates or at market-based rates. If the sales are market-based, the tariff that is specified in the FERC order granting the Seller Market Based Rate Authority must be listed. If a non-public utility does not have a FERC Tariff Reference, it should enter “NPU” for the FERC Tariff Reference.
Unique identifier given to each service agreement that can be used by the Seller to produce the agreement, if requested. The identifier may be the number assigned by FERC for those service agreements that have been filed with and accepted by the Commission, or it may be generated as part of an internal identification system.
Unique reference number assigned by the Seller for each transaction. May only be unique in the context of the seller's internal record keeping. This is an unrestricted text field.
First date and time the product is sold during the quarter.
Last date and time the product is sold during the quarter.
The date upon which the parties made the legally binding agreement on the price of a transaction.
If a broker service is used to consummate or effectuate a transaction, the term “Broker” shall be provided. If an exchange is used, the specific exchange that is used shall be selected from the Commission-provided list. Allowed values include BROKER, ICE, NODAL, and NYMEX.
FIXED: A fixed charge per unit of consumption. No variables are used to determine this rate. FORMULA: A calculation of a rate based upon a formula that does not contain an electric index component. ELECTRIC INDEX: A calculation of a rate based upon an index or a formula that contains an electric index component. An electric index includes an index published by an index publisher such as those required to be listed in Field Number 73 or a price published by an RTO/ISO (e.g., PJM West or Illinois Hub). RTO/ISO: If the price is the result of an RTO/ISO market or the sale is made to the RTO/ISO.
A code representing the so-called 'time zone' in which the sale was made. However, these codes do not actually correspond to time zones. Rather, they indicate an offset from UTC, which changes according to daylight savings vs. standard time. The codes are as follows: AD: Atlantic Daylight Time (UTC-3), AP: Atlantic Prevailing Time, AS: Atlantic Standard Time (UTC-4), CD: Central Daylight Time (UTC-5), CP: Central Prevailing Time, CS: Central Standard Time (UTC-6), ED: Eastern Daylight Time (UTC-4), EP: Eastern Prevailing Time, ES: Eastern Standard Time (UTC-5), MD: Mountain Daylight Time (UTC-6), MP: Mountain Prevailing Time, MS: Mountain Standard Time (UTC-7), PD: Pacific Daylight Time (UTC-7), PP: Pacific Prevailing Time, PS: Pacific Standard Time (UTC-8), UT: Coordinated Universal Time (UTC+0)
F - Firm: For transmission sales, a service or product that always has priority over non-firm service. For power sales, a service or product that is not interruptible for economic reasons. NF - Non-firm: For transmission sales, a service that is reserved and/or scheduled on an as-available basis and is subject to curtailment or interruption at a lesser priority compared to Firm service. For an energy sale, a service or product for which delivery or receipt of the energy may be interrupted for any reason or no reason, without liability on the part of either the buyer or seller. UP - Unit Power Sale: Designates a dedicated sale of energy and capacity from one or more than one specified generation unit(s). N/A: To be used only when the other available Class Names do not apply.
Contracts with durations of one year or greater are long-term (LT). Contracts with shorter durations are short-term (ST).
5: Terms of the contract (if specifically noted in the contract) set for more than 0 minutes and less than or equal to 5 minutes (> 0 and ≤ 5 minutes). 15: Terms of the contract (if specifically noted in the contract) set for more than 5 minutes and less than or equal to 15 minutes (> 5 and ≤ 15 minutes). H: Terms of the contract (if specifically noted in the contract) set for more than 15 minutes and less than or equal to 6 hours (> 15 minutes and ≤ 6 hours). D: Terms of the contract (if specifically noted in the contract) set for more than 6 and up to 60 hours (> 6 and ≤ 60 hours). W: Terms of the contract (if specifically noted in the contract) set for over 60 hours and up to 168 hours (> 60 and ≤ 168 hours). M: Terms of the contract (if specifically noted in the contract) set for more than 168 hours up to, but not including, one year (> 168 hours and < 1 year). Y: Terms of the contract (if specifically noted in the contract) set for one year or more (≥ 1 year). N/A: Terms of the contract do not specify an increment.
FP: The product described may be sold during those hours designated as on-peak and off-peak at the point of delivery. OP: The product described may be sold only during those hours designated as off-peak at the point of delivery. P: The product described may be sold only during those hours designated as on-peak at the point of delivery. N/A: To be used only when the increment peaking name is not specified in the contract.
Description of product being offered.
Measure stated in the contract for the product sold. FERC EQR.
The registered Balancing Authority (formerly called NERC Control Area) where a jurisdictional product is delivered and/or service ends for a transmission or transmission-related jurisdictional sale. The Balancing Authority will be identified with the abbreviation used in OASIS applications. If delivery occurs at the interconnection of two control areas, the control area that the product is entering should be used. If delivery occurs at a trading hub, the term 'Hub' should be used.
The specific location at which the product is delivered if designated in the contract. If receipt occurs at a trading hub, a standardized hub name must be used.
The quantity of the product in this transaction record.
Actual price charged for the product per unit. The price reported cannot be averaged or otherwise aggregated.
For product names energy, capacity, and booked out power only. Specify the quantity in MWh if the product is energy or booked out power and specify the quantity in MW-month if the product is capacity or booked out power.
For product names energy, capacity, and booked out power only. Specify the price in $/MWh if the product is energy or booked out power and specify the price in $/MW-month if the product is capacity or booked out power.
Payments received for transmission services when explicitly identified.
transaction_quantity * price + total_transmission_charge.
_core_eia860__cooling_equipment
Information about cooling equipment at generation facilities.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 6D)
- Primary key:
plant_id_eia, utility_id_eia, cooling_id_eia, report_date
Usage Warnings
This table has not been harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years.
Additional Details
A single plant can have several collections of cooling equipment, so each line in this table represents one cooling system at a given plant. You can determine how to link each set of cooling equipment to specific boilers in a plant via core_eia860__assn_boiler_cooling.
EIA-860 respondents are required to submit information about cooling equipment for plants over 100 MW - including steam, combined cycle, nuclear or solar thermal generators.
This table has been cleaned, but not tidied/normalized with other EIA 923 or 860 data. The same variables present in this table may show up in other tables and/or in other years. Once this table has been harvested, it will be converted into a``core`` table.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Plant name.
EIA Identification code for cooling system (if multiple cooling systems are not distinguished by separate IDs, the word 'PLANT' is listed to encompass the cooling system for the entire plant)
The EIA Utility Identification number.
The name of the utility.
County name.
Two letter US state abbreviation.
Actual installed cost for the existing chlorine discharge control system or the anticipated cost to bring the chlorine discharge control system into commercial operation
Actual or projected in-service date for chlorine discharge control structures and equipment
Actual installed cost for the existing system or the anticipated cost to bring the total system into commercial operation
Operating status of cooling system
The actual or projected in-service datetime of this cooling system
Type of cooling system
Type of cooling system
Type of cooling system
Type of cooling system
Name of river, lake, or water source that cooling water is discharged into
Name of river, lake, or water source that provides cooling water
Maximum distance from shore to intake
Average distance below water surface to intake
Design cooling water flow rate at 100 percent load at in-take
Maximum distance from shore to outlet
Average distance below water surface to outlet
Percent of cooling load served by dry cooling components
The plant summer capacity associated with the operating generators at the plant
Actual installed cost for the existing cooling ponds or the anticipated cost to bring the cooling ponds into commercial operation
Cooling ponds actual or projected in-service date
Total surface area of cooling pond
Total volume of water in cooling pond
Maximum power requirement for cooling towers at 100 percent load
Code that describes types of steam plants from EIA 860. See steam_plant_types_eia table for more details.
Actual installed cost for the existing cooling towers or the anticipated cost to bring the cooling towers into commercial operation
Cooling towers actual or projected in-service date
Types of cooling towers at this plant
Types of cooling towers at this plant
Types of cooling towers at this plant
Types of cooling towers at this plant
Maximum design rate of water flow at 100 percent load for the cooling towers
Type of cooling water source
Name of water source associated with the plant.
Type of cooling water
_core_eia860__fgd_equipment
Information about flue gas desulfurization equipment at generation facilities.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
EIA Form 860 -- Annual Electric Generator Report (Schedule 6E)
- Primary key:
plant_id_eia, so2_control_id_eia, report_date
Usage Warnings
This table has not been harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Sulfur dioxide control identification number. This ID is not a unique identifier.
The EIA Utility Identification number.
The name of the utility.
Two letter US state abbreviation.
Two digit state FIPS code.
County name.
County ID from the Federal Information Processing Standard Publication 6-4.
The actual or projected in-service datetime of this flue gas desulfurization system
Operating status code for flue gas desulfurization equipment.
Indicates whether flue gas can bypass the flue gas desulfurization unit.
Is saleable byproduct recovered by the unit?
Indicates if there is a sludge pond associated with this unit.
Indicates whether the sludge pond is lined.
Annual pond and land fill requirements for flue gas desulfurization equipment.
Actual installed costs for the existing systems or the anticipated costs of structures and equipment to bring a planned flue gas desulfurization system into commercial operation.
Other actual installed costs for installation of a flue gas desulfurization unit or the anticipated other costs pertaining to the installation of a flue gas desulfurization unit.
Actual installed costs for the existing sludge transport and disposal systems or the anticipated costs of sludge transport and disposal systems to bring a planned system into commercial operation.
Total actual installed costs for the existing flue gas desulfurization unit or the anticipated costs to bring a planned flue gas desulfurization unit into commercial operation.
Total number of flue gas desulfurization unit scrubber trains operated at 100 percent load.
Total number of flue gas desulfurization unit scrubber trains.
Ratio of all flue gas that is entering the flue gas desulfurization unit.
Actual flue gas exit rate, in cubic feet per minute.
Flue gas exit temperature, in degrees Fahrenheit.
Sulfur dioxide emission rate when operating at 100 percent load (pounds per hour).
Type of sulfur dioxide control equipment.
Type of sulfur dioxide control equipment.
Type of sulfur dioxide control equipment.
Type of sulfur dioxide control equipment.
Designed removal efficiency for sulfur dioxide when operating at 100 percent load. Reported at the nearest 0.1 percent by weight of gases removed from the flue gas.
Design fuel specifications for ash when burning coal or petroleum coke (nearest 0.1 percent by weight).
Design fuel specifications for sulfur when burning coal or petroleum coke (nearest 0.1 percent by weight).
Type of sorbent used by this sulfur dioxide control equipment.
Type of sorbent used by this sulfur dioxide control equipment.
Type of sorbent used by this sulfur dioxide control equipment.
Type of sorbent used by this sulfur dioxide control equipment.
Name of flue gas desulfurization equipment manufacturer.
Code corresponding to name of flue gas desulfurization equipment manufacturer.
Code that describes types of steam plants from EIA 860. See steam_plant_types_eia table for more details.
The plant summer capacity associated with the operating generators at the plant
Name of water source associated with the plant.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
_core_eia923__monthly_cooling_system_information
Monthly time series of cooling system information at generation facilities - mainly water volumes and temperatures.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 8D)
- Primary key:
plant_id_eia, report_date, cooling_id_eia
Usage Warnings
Some years use a slightly different data definition.
Additional Details
In 2008 and 2009, EIA only reports annual averages, but in later years all data is monthly.
Note: This table has been cleaned, but not harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years. Once this table has been harvested, it will be removed from the PUDL database.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
EIA Identification code for cooling system (if multiple cooling systems are not distinguished by separate IDs, the word 'PLANT' is listed to encompass the cooling system for the entire plant)
Operating status of cooling system
Type of cooling system
Total hours the system operated during the month
Calculation method for flow rates (actual or method of estimation)
Method for measurement of temperatures
Maximum cooling water temperature at intake during the summer
Maximum cooling water temperature at intake in winter
Average cooling water temperature at intake point
Maximum cooling water temperature at intake
Maximum cooling water temperature at outlet in summer
Maximum cooling water temperature at outlet in winter
Average cooling water temperature at discharge point
Maximum cooling water temperature at discharge
Annual average consumption rate of cooling water
Monthly average consumption rate of cooling water
Monthly volume of water consumed at consumption point (accurate to 0.1 million gal)
Annual average discharge rate of cooling water
Monthly average discharge rate of cooling water
Monthly volume of water discharged at discharge point (accurate to 0.1 million gal)
Monthly average diversion rate of cooling water
Monthly volume of water diverted at diversion point (accurate to 0.1 million gal)
Annual average withdrawal rate of cooling water
Monthly average withdrawal rate of cooling water
Monthly volume of water withdrawn at withdrawal point (accurate to 0.1 million gal)
Amount of elemental chlorine added to cooling water annually. May be just the amount of chlorine-containing compound if schedule 9 is filled out.
Amount of elemental chlorine added to cooling water monthly. May be just the amount of chlorine-containing compound if schedule 9 is filled out.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
_core_eia923__yearly_byproduct_disposition
Annual time series of by-products reported by thermoelectric power plants with total steam turbine capacity of 100 megawatts or greater.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 8A)
- Primary key:
plant_id_eia, report_year, byproduct_description
Additional Details
All by-products other than steam sales are reported in thousand tons and are rounded to whole numbers. Steam sales are reported in MMBTu.
Columns
Four-digit year in which the data was reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Description of combustion by-product.
Reported unit of measure for combustion byproduct. MMBtu for steam, tons for all other byproducts.
Disposed by-products in landfill, to the nearest hundred tons or in MMBtu for steam sales.
Disposed by-products offsite, to the nearest hundred tons or in MMBtu for steam sales.
Disposed by-products in ponds, to the nearest hundred tons or in MMBtu for steam sales.
Sold by-products, in tons (to the nearest 100 tons) or, for Steam, MMBtu.
Stored by-products offsite, to the nearest hundred tons or in MMBtu for steam sales.
Stored by-products onsite, to the nearest hundred tons or in MMBtu for steam sales.
Used offsite by-products, to the nearest hundred tons or in MMBtu for steam sales.
Used onsite by-products, to the nearest hundred tons or in MMBtu for steam sales.
Total by-product disposal, to the nearest hundred tons or in MMBtu for steam sales.
Whether any combustion by-products were produced by a plant. 'Y' indicates no byproducts to report. The 'Y' and 'N' values do not align with expected values of reported byproducts. This column is messy and requires standardization.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
_core_eia923__yearly_byproduct_expenses_and_revenues
Annual time series of financial information related to combustion by-products reported by thermoelectric power plants with total steam turbine capacity of 100 megawatts or greater and that produced combustion by-products during the reporting year.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 8B)
- Primary key:
plant_id_eia, report_year
Additional Details
Cost data must be entered for all entries on Schedule 8A. Financial information includes operational and maintenance expenditures (opex), capital expenditures (capex), and by-products sales revenue.
Columns
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Four-digit year in which the data was reported.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
Cost of new structures and/or equipment purchased to reduce, monitor, or eliminate airborne pollutants.
Other amortizable expenses and purchases of new structures and or equipment when such purchases are not allocated to a particular unit or item.
Cost of structures or equipment purchased to collect and dispose of objectionable solids or contained liquids.
Cost of new structures and/or equipment purchased to reduce, monitor, or eliminate waterborne pollutants, including chlorine, phosphates, acids, bases, hydrocarbons, sewage, and other pollutants.
Costs of materials and labor associated with the collection of bottom ash from all sources.
Costs of materials and labor associated with the disposal of bottom ash from all sources.
Other costs associated with the collection and disposal of bottom ash.
Costs of materials and labor associated with the collection of sulfur by-product (flue gas desulfurization).
Costs of materials and labor associated with the disposal of sulfur by-product (flue gas desulfurization).
Other costs associated with the collection and disposal of sulfur by-product (flue gas desulfurization).
Costs of materials and labor associated with the collection of fly ash from all sources.
Costs of materials and labor associated with the disposal of fly ash from all sources.
Other costs associated with the collection and disposal of fly ash.
Abatement costs of by-product collection that are not allocated to a particular expenditure, e.g., costs of operating an environmental protection office.
Abatement costs of by-product disposal that are not allocated to a particular expenditure.
Other abatement costs that are not allocated to a particular expenditure.
Sum of abatement costs associated with by-product collection.
Sum of abatement costs associated with by-product disposal.
Sum of other abatement costs associated with the collection and disposal of byproducts.
Costs associated with the collection/abatement of water pollution, e.g., equipment operation and maintenance of pumps, pipes, and settling ponds.
Costs associated with the disposal of water pollutants.
Other abatement costs associated with water pollutants.
Revenue from the sale of bottom ash by-product.
Revenue from the sale of flue gas desulfurization by-product.
Revenue from the sale of fly ash by-product.
Revenue from the sale of intermingled fly and bottom ash by-product.
Revenue from the sale of other by-products.
Total revenue from the sale of by-products.
_core_eia923__yearly_fgd_operation_maintenance
Annual time series of flue gas desulfurization (FGD) operation and maintenance expenses at generation facilities.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
EIA Form 923 -- Power Plant Operations Report (Schedule 8C)
- Primary key:
plant_id_eia, report_date, so2_control_id_eia
Usage Warnings
Some years use a slightly different data definition.
Additional Details
From 2008-2011 this table also reported operational characteristics that are now reported in 923 Air Emissions Control Info.
Note: This table has been cleaned, but not harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years. Once this table has been harvested, it will be removed from the PUDL database.
Columns
Date reported.
The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.
Sulfur dioxide control identification number. This ID is not a unique identifier.
Annual operation and maintenance expenditures for feed materials and chemicals for flue gas desulfurization equipment, excluding electricity.
Annual operation and maintenance expenditures for labor and supervision of flue gas desulfurization equipment, excluding electricity.
Annual operation and maintenance expenditures for land acquisition for flue gas desulfurization equipment, excluding electricity.
Annual operation and maintenance expenditures for maintenance, materials and all other costs of flue gas desulfurization equipment, excluding electricity
Annual operation and maintenance expenditures for waste disposal, excluding electricity.
Annual total cost of operation and maintenance expenditures on flue gas desulfurization equipment, excluding electricity
Indicates whether or not a plant has a flue gas desulfurization control unit.
Operating status code for flue gas desulfurization equipment.
Number of hours the flue gas desulfurization equipment was in operation during the year.
Electric power consumed by the flue gas desulfurization unit (in MWh).
Quantity of flue gas desulfurization sorbent used, to the nearest 100 ton.
Removal efficiency for sulfur dioxide (to the nearest 0.1 percent by weight) at tested rate at 100 percent load.
Removal efficiency for sulfur dioxide (to the nearest 0.1 percent by weight) based on designed firing rate and hours in operation (listed as a percentage).
Date of most recent test for sulfur dioxide removal efficiency.
Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.
_core_phmsagas__yearly_distribution_by_install_decade
Annual time series of miles of mains and the number of services in operation at the end of the year by install decade.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 4)
- Primary key:
report_id, report_date, operator_id_phmsa, operating_state, install_decade
Usage Warnings
This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.
Some columns contain subtotals; use caution when choosing columns to aggregate.
Additional Details
The records with an install decade of total_decade are a total - beware of aggregating these values.
Columns
Report number of the PHMSA Gas utility submission.
Date reported.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
The type of gas delivered by the distribution pipeline.
State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.
The decade the distribution pipeline was installed.
The miles of mains distribution pipeline.
Number of end in system at end of year.
_core_phmsagas__yearly_distribution_by_material
Annual time series of miles of mains and the number of services in operation at the end of the year by material for each gas distribution operator.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 1 - General)
- Primary key:
This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state and material. However, there are nulls in the operating_state across several years of reporting.
Usage Warnings
This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.
Some columns contain subtotals; use caution when choosing columns to aggregate.
The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).
Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.
Columns
Report number of the PHMSA Gas utility submission.
Date reported.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
The type of gas delivered by the distribution pipeline.
State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.
The material of the gas distribution pipe. The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).
The miles of mains distribution pipeline.
Number of end in system at end of year.
_core_phmsagas__yearly_distribution_by_material_and_size
Annual time series of miles of mains and the number of services in operation at the end of the year by material and size of pipe.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 3)
- Primary key:
This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state, main_size and material. There are nulls in the operating_state across several years of reporting.
Usage Warnings
This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.
Some columns contain subtotals; use caution when choosing columns to aggregate.
The size ranges in main_size have changed slightly over the years (ex: before 1984 they reported 0.5_in_or_less whereas after they reported 1_in_or_less)
The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).
Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.
Columns
Date reported.
Report number of the PHMSA Gas utility submission.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
The type of gas delivered by the distribution pipeline.
State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.
Size range of mains. The size ranges have changed slightly over the years (ex: before 1984 they reported 0.5_in_or_less whereas after they reported 1_in_or_less).
The material of the gas distribution pipe. The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).
The miles of mains distribution pipeline.
Number of end in system at end of year.
A free-form text field containing notes about the other material type. This column should only contain values in it for rows with other as the material type listed.
_core_phmsagas__yearly_distribution_excavation_damages
Annual time series of excavation damages from various sources.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part D - Excavation Damage)
- Primary key:
report_id, damage_type, damage_sub_type
Usage Warnings
This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.
Some columns contain subtotals; use caution when choosing columns to aggregate.
Columns
Report number of the PHMSA Gas utility submission.
Date reported.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
The type of gas delivered by the distribution pipeline.
State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.
A high level category of excavation damage causes.
A sub-category of damage_type of excavation damage causes.
Number of instances of excavation damage.
_core_phmsagas__yearly_distribution_filings
Annual time series of filings (aka submissions) from gas distribution system operators.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report
- Primary key:
report_id, report_date, operator_id_phmsa
Usage Warnings
This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.
Additional Details
This table contains information about the filer and filing type. This includes information about who filed but also whether this was an original filing or a correction.
Columns
Report number of the PHMSA Gas utility submission.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
Date reported.
Date on which the filing was submitted.
Initial date when filing was originally submitted.
Date when a correction filing was submitted.
Type of report submitted, either Initial or Supplemental.
When the data source was last updated.
PHMSA form revision identifier.
Name of representative who filed report.
Title of representative who filed report.
Phone number of representative who filed report.
Fax number of representative who filed report.
Email address of representative who filed report.
_core_phmsagas__yearly_distribution_leaks
Annual time series of total and hazardous leaks eliminated or repaired during the report year.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part C)
- Primary key:
This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state, leak_severity and leak_source. There are nulls in the operating_state across several years of reporting.
Usage Warnings
This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.
Some columns contain subtotals; use caution when choosing columns to aggregate.
Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.
Columns
Report number of the PHMSA Gas utility submission.
Date reported.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
The type of gas delivered by the distribution pipeline.
State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.
Whether or not the leak described in this record are all leaks or hazardous leaks.
The cause of the leaks.
The number of mains distribution pipeline.
Number of end in system at end of year.
_core_phmsagas__yearly_distribution_misc
Annual time series of miscellaneous distribution information.
- Processing:
Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.
- Source:
Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B & C)
- Primary key:
This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, and operating_state. There are nulls in the operating_state across several years of reporting.
Usage Warnings
This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.
Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.
Columns
Date reported.
Report number of the PHMSA Gas utility submission.
PHMSA unique operator ID. A value of zero represents an unknown operator ID.
State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.
The number of known system leaks at the end of the report year scheduled for repair.
The number of known leaks on main at the end of the report year scheduled for repair.
The total number of hazardous leaks caused by a mechanical joint failure.
Total number of leaks repaired, eliminated, or scheduled for repair on federal land during the reporting year.
The average system service length in feet.
Estimated number of services with Excess Flow Valve in the system at end of reported year related to natural gas distribution.
Total number of services with Excess Flow Valve installed during reported year related to natural gas distribution.
Estimated number of services with manual service line shut-off valves installed in the system at end of report year related to natural gas distribution.
Total number of manual service line shut-off valves installed during reported year related to natural gas distribution.
Unaccounted for gas as a fraction of total consumption for the 12 months ending June 30 of the reporting year. Calculated as follows: Take the sum of: (purchased gas + produced gas) minus (customer use + company use + appropriate adjustments). Then divide by the sum of (customer use + company use + appropriate adjustments). Prior to 2017, this field was calculated with a different deonominator (purchased gas + produced gas). The time period between 2010-2017 having this different calculation method ensured that there was no records that had a negative fraction. For all the other reporting years there are known and expected negative values in this column.
Number of Excavation Tickets received by the operator during the year, (i.e., receipt of information by the operator from the notification center).