Pippa, the DCSI mascotDCSI /National view
๐Ÿ–ฅ๏ธ

Best viewed on a larger screen

The DCSI dashboard includes interactive maps, charts, and data tables designed for desktop or tablet. You're welcome to explore on mobile, but the full experience is best on a wider screen.

Data Center Stress Index · National

The U.S. ledger

Every new data center consumes water, draws power from the grid, and occupies land. But the communities hosting them rarely have the data to evaluate whether they are getting a fair deal. This tool changes that.

Share
Find your county
See operators, dominant ownership, foreign-flag share, and stress score for any U.S. county hosting a data center
What’s at stake 2026 OUTLOOK
$725 billion. One year. Four hyperscalers.
Microsoft ($190B), Alphabet/Google ($180–190B), Meta ($125–145B), and Amazon (~$200B) collectively plan to spend up to $725 billion on data center infrastructure in 2026 alone — a 77% jump year-over-year. That spending lands as construction permits, water withdrawals, transmission upgrades, tax abatements, and zoning fights in specific counties. This dashboard tracks where it lands and what those communities get in return.
$190B
Microsoft 2026
$185B
Alphabet 2026
$135B
Meta 2026
$200B
Amazon 2026
Source: Q1 2026 earnings (Apr 29–30); Bloomberg, CNBC, Fortune. The Oracle/Related/Blackstone/PIMCO Saline Township MI data center alone closed $16B in financing on Apr 24.

What questions does this dashboard answer?

1. Which counties bear the heaviest resource burden from data centers — in water, energy, and land?
2. Are communities getting fair economic returns — jobs and wages — relative to the resources consumed?
3. How transparent are states about permitting, environmental review, and tax incentives for data centers?
4. Who actually owns these facilities — and how much of the industry is controlled by foreign entities?
5. What would happen to county stress scores if proposed federal or state legislation were enacted?
6. Where are the geographic hot spots — regions where high-burden data center counties cluster together?

What we calculate

Every U.S. county hosting a data center receives four scores, each normalized on a 0–1 scale:

Resource Burden Score (RBS)

Measures the combined strain on local water, energy, and land. Higher = more burden on community resources. Weighted: 40% water, 35% energy, 25% land.

Regulatory Opacity Score (ROS)

Measures how transparent the state regulatory environment is. Higher = less accountability in permitting, tax incentives, and environmental review.

Economic Return Score (ERS)

Measures what the community gets back in local jobs and wages per megawatt of installed capacity. Higher = better return. Uses Monte Carlo simulation for confidence intervals.

Composite Stress Index (CSI)

Blends all three scores: 50% burden, 30% opacity, 20% inverted return. Counties receive letter grades A through F. This is the single number that ranks communities by overall data center stress.

How we got the data

The entire index is built from 22 public data sources, all free. No proprietary datasets. No gated APIs. Total cost: under $500.

Facilities
PNNL DC Atlas, FracTracker Alliance, Epoch AI
Water Stress
WRI Aqueduct 4.0, spatially joined to counties
Energy
EIA-923 generation + company sustainability reports
Land Use
PNNL sqft, Census TIGER ALAND, MW-to-acreage estimates
Employment
BLS QCEW (NAICS 518210), imputed where suppressed
Regulatory
State Regulatory Index (42 jurisdictions), EPA FLIGHT
Boundaries
Census TIGER shapefiles, congressional districts
Ownership
Current pipeline: 327 foreign-flagged facilities across 13 countries, plus 17 records mis-tagged USA pending reclassification (V8 baseline was 297 across 14 — likely still undercount, see Foreign Ownership tab)
Policy
12 legislative proposals decomposed into formula parameters

What we know we don’t know

Transparency requires honesty about gaps. Here is what we cannot yet measure:

Energy is estimated, not metered. No public dataset tracks facility-level electricity consumption. V9 estimates consumption using LBNL’s 50% utilization methodology (down from V8’s 100% assumption). Operator-reported data from Meta (18 facilities) and Google/AWS PUE is used where available. For most facilities, consumption is modeled from nameplate MW.
247 counties lack acreage data. Not all facility sources report square footage or lot size. We estimate from MW capacity using industry heuristics (50 acres per 100 MW).
~65% of employment data is suppressed. BLS withholds county-level QCEW data to protect employer confidentiality. We impute from state totals proportional to MW capacity.
Behind-the-meter gas is invisible. An estimated 56 GW of on-site natural gas generation is not captured by any federal energy dataset.
Water consumption is now estimated from cooling type. V9 uses WUE benchmarks (Siddik et al. 2021/2022) to estimate gallons per MWh by cooling method. Meta’s reported per-facility water data validates the approach for 15 US facilities. However, cooling method is unknown for most facilities, and the national weighted average (0.3 L/kWh per Siddik et al. 2021) is used as default.
419 of 670 counties lack precise facility coordinates. Most facilities plot at county centroids. Individual addresses are not in public datasets.
641 facility duplicates resolved (ERR-035). FracTracker and V8 overlap caught and merged: 658 records merged, 111 net-new added. Pipeline now 4,984 facilities deduplicated.
The State Regulatory Index covers 42 of 50 states. Eight states lack sufficient data for all six transparency variables.

Methodological limitations

Peer review identified the following structural limitations. We disclose them here because transparency is a core value of this project.

Composite weights are policy judgments. The weights that combine water, energy, and land into the Resource Burden Score (0.4/0.35/0.25) and the weights that combine RBS, ROS, and ERS into the composite (0.5/0.3/0.2) reflect the analyst’s judgment about relative importance. They are not derived from the data. PCA validation shows the resulting rankings are robust (ρ > 0.96 vs. data-driven weights), but users should explore the entropy-weighted toggle for a purely data-driven alternative.
Water stress is not water consumption. WRI Aqueduct measures basin-level water stress — the ratio of withdrawals to available supply — not facility-specific water use. A data center with closed-loop cooling in a stressed basin scores the same as an evaporative facility. Facility-level gallons-per-MWh estimates by cooling type are planned for V9.
Economic data is heavily imputed. BLS suppresses ~65% of county-level employment data to protect employer confidentiality. The Economic Return Score for most counties is modeled from state totals, not observed. Counties with high imputation are flagged as Tier C data quality — interpret their ERS with caution.
Regulatory scoring has a single rater. The State Regulatory Index was scored by one analyst-AI team. An inter-rater reliability pilot (3 raters, 10 states, Cohen’s Weighted Kappa target ≥ 0.60) is planned but not yet complete. Until validated, SRI scores should be treated as preliminary.
Letter grades are relative rankings. The A-through-F grades use percentile thresholds, which means exactly 20% of counties fall into each tier by definition. An “A” means better than 80% of DC-hosting counties, not that a county has no data center impact. There are no absolute “safe” thresholds.
No external validation yet. The DCSI has not been compared against established environmental justice indices (EJScreen, SVI). This comparison is planned for V9 to demonstrate whether the DCSI captures impacts that existing tools miss.

Explore the dashboard

Each tab focuses on a different dimension of the data center landscape:

DCSI National
Interactive national map with county-level stress scores, filtering by ownership, classification, and capacity type. Includes Sankey ownership flows, burden breakdowns, and scatter analysis.
DCSI State
A one-stop-shop for any state: governor, research summary, county-level map, facility table, congressional districts, and ownership breakdown. Toggle between plain language, technical, and policy brief.
Policy Impact
Simulate how 12 real legislative proposals would change county stress scores. See proposed data centers on the map and explore what each would mean for local communities.
Methodology
Every formula, every weight, every assumption — fully disclosed. Includes PCA validation, entropy weighting sensitivity analysis, and Monte Carlo error propagation methodology.
Data Sources
Complete inventory of all 22 public data sources with file details, coverage statistics, and version information.
Community Resistance
119 data center projects cancelled, suspended, or withdrawn. Moratoriums enacted or proposed in 62 jurisdictions across 18 states. Communities are fighting back — and winning.
AI Accountability
Every AI-introduced error caught by the analyst during development. Severity, category, fix status, and the lesson: the analyst never abdicated to the machine.
Built in public. Under $500. All data free.
A project by Anna R. Dudley
DCSI National Overview

32 counties show high resource burden with below-average economic return

The Data Center Stress Index (DCSI) is a county-level composite ranking of U.S. counties hosting data centers. It measures three dimensions: the resource burden each facility places on local water, energy, and land; the regulatory opacity surrounding permitting and operations; and the economic return the community receives in jobs and wages per megawatt of installed capacity. Use this dashboard to explore which communities bear the heaviest costs — and which receive the least in return.

670
Counties with facilities
--
Facilities tracked
--
Est. TWh/yr consumed
--
Est. billion gal/yr water
--
Proposed / Under Review
--
Foreign HQ flagged
--
Cancelled / Suspended
Priority
View
Filter
Resource Burden ranks counties by the combined strain data centers place on water supply (WRI Aqueduct), energy grid (EIA-923), and land use (PNNL/USGS). Economic Return flips the lens to rank by local jobs and wages generated per MW of capacity (BLS QCEW).
Scores shows the raw composite index for each county. Hot-Spot Clusters highlights geographic concentrations of high-stress counties using spatial autocorrelation (Moran's I). Entropy Weights lets the data determine component weights instead of fixed formulas.
2026
Facilities: --Avg CSI: --Proposed: --
Filtered by:
Clear all filters
Composite Stress Index
Low concernHigh concern
Layers
Commercial Government Academic
Zoom in to see individual data centers

Interactive choropleth showing the Composite Stress Index for U.S. counties with data center facilities. Green = low stress, red = high stress. Sourced from PNNL, FracTracker, Epoch AI, WRI Aqueduct, EIA-923, and BLS. Scroll to zoom.

We have provided the most accurate information we had access to. See the Data Sources tab.
Facility Ownership
Click a segment to filter

Market share of U.S. data center capacity by parent company headquarters country or corporate operator. Foreign-owned facilities may face additional CFIUS scrutiny. Data sourced from PNNL IM3 Atlas and Epoch AI ownership records.

Ownership attribution reflects the best available public records. Some facilities are owned through subsidiaries or holding companies that may obscure the ultimate parent. See the Data Sources tab.
Resource Burden Breakdown
Click a state to see the math and filter

Each bar decomposes a state's aggregate resource burden into water stress, grid load, and land use components, the three pillars of the RBS formula (40/35/25 weighting). Water stress comes from WRI Aqueduct 4.0, grid load from EIA-923 plant-level generation, and land use from PNNL facility footprints over USGS NLCD developed area. Click any state to see the raw data and calculation.

Scores are normalized to a 0-to-1 scale. Underlying data quality varies by county and source. See the Data Sources tab for provider details.
Stress by Congressional District
Ranked by composite stress score

Districts ranked by composite stress index. This view connects resource burden to political accountability. Every district bar maps to a specific representative who can champion or block data center policy. District boundaries from Census TIGER/Line shapefiles, legislators from the @unitedstates project and Open States.

District-to-county mapping uses spatial overlay with a 1% area threshold to filter boundary slivers. See the Data Sources tab.
Economic Return Leaders
Counties with highest Jobs + Wages per MW

Counties delivering the most local economic value per megawatt of data center capacity. High ERS counties demonstrate that data centers can generate meaningful employment. Employment and wage data sourced from BLS Quarterly Census of Employment and Wages (NAICS 518210). MW capacity from PNNL, Epoch AI, FracTracker, and hand-researched overrides.

BLS suppresses employment data in counties with fewer than three reporting establishments to protect confidentiality. See the Data Sources tab.
Burden vs. Return: Which Counties Are Getting a Raw Deal?
Counties in the top-left quadrant bear the highest resource burden with the lowest economic return
Scroll to zoom / drag to pan / double-click to reset

Reading this chart: Each bubble is a county hosting data center facilities. Bubble size reflects total MW capacity.

The top-left quadrant is the danger zone: high resource burden (water, energy, land) paired with low economic return (few jobs, low wages per MW). These counties subsidize the digital economy without proportional benefit.

The bottom-right quadrant is the sweet spot: modest resource consumption alongside meaningful local employment and wages.

Click any county bubble to see its full scorecard, elected officials, and facility-level breakdown.

Scatter plot of Resource Burden Score (x-axis) vs. Economic Return Score (y-axis) for all counties with data center facilities. RBS is derived from WRI Aqueduct water stress, EIA-923 energy data, and USGS land cover. ERS is derived from BLS QCEW employment and wage data. Bubble color reflects the Composite Stress Index. Use your scroll wheel to zoom into dense clusters.

Scores reflect the best available public data. Some values are estimated where direct measurements are unavailable. See the Data Sources tab for complete sourcing.
DCSI State

Explore data center stress at the state level

Select a state to see its counties, facilities, ownership breakdown, resource burden, and economic return. All charts below filter to the selected state.

--
Counties
--
Facilities
--
Total MW
--
Proposed / Planned
--
Foreign HQ
--
Avg Stress Score
--
Governor
State Research Summary
Select a state to see a research summary.
Reporters Covering This
Select a state to see reporters covering data center issues.
Audience
Composite Stress Index
Low concernHigh concern
Facility Ownership
Within selected state
County Resource Burden
Water / Grid / Land breakdown per county
County Economic Return
Jobs + Wages per MW
Congressional Districts
Stress by district within state
Burden vs. Return
County-level scatter for selected state
Scroll to zoom / drag to pan / double-click to reset

Reading this chart: Each bubble is a county. Size reflects MW capacity.

The top-left quadrant = high burden, low return. The bottom-right = low burden, high return.

All Facilities
Select a state to see facilities

Policy Impact Lens

Select a policy proposal to see how it would change county stress scores. The map adjusts to show before/after impact. Proposed data centers are shown with dashed outlines. Use your scroll wheel to zoom into any region of the map.

Select a policy to see impact
Composite Stress Index
Low concernHigh concern
Existing Proposed
Policy impact scores are modeled estimates based on bill text and regulatory intent. They are not predictions. See the Data Sources tab for underlying data.
Score Impact Summary
How the selected policy changes county scores
Select a policy above to see a before/after analysis of how it would affect county stress scores, which counties move between tiers, and which proposed data centers would be blocked or modified.
Counties Most Affected
Top counties where scores change the most
Regulatory Timeline
Key state and federal regulatory actions affecting data centers, 2024–2026

This timeline tracks moratoriums, legislation, and executive actions that shape where and how data centers can be built. Events are color-coded by type: green for enacted restrictions, amber for proposed bills, red for vetoed or failed measures, and blue for federal actions.

Data Gap
Grid Interconnection Queue
200+ GW
Data center capacity waiting in U.S. interconnection queues (LBNL estimate). Fragmented across 7 RTOs with no unified public dataset. FERC large-load rule may standardize reporting.
Data Gap
Behind-the-Meter Generation
~56 GW
On-site gas turbines invisible to federal energy data (CEBA estimate). Bypasses utility metering, EIA-923 reporting, and state PUCs. Counties hosting BTM facilities have energy burden underestimated.
State Regulatory Transparency (ROS)
How transparent is each state's data center permitting, environmental review, and tax incentive process?

The Regulatory Opacity Score (ROS) measures how much information states disclose about data center permitting, environmental review, utility rates, tax incentives, and ownership. Higher opacity means less public transparency. Scores are derived from a 6-variable State Regulatory Index across 42 DC-hosting jurisdictions. Click any bar to filter the dashboard to that state.

Proposed Data Centers
Facilities announced or under construction, not yet operational
Cancelled & Withdrawn Projects
Facilities that were cancelled, withdrawn, denied, or rejected, evidence that democratic process and community voice can shape outcomes
Follow the Money

Public Subsidies to Data Center Operators

$16.2 billion in disclosed tax incentives, abatements, and megadeals tracked by Good Jobs First’s Subsidy Tracker (through Nov 2025). Five companies — Amazon, Apple, Meta, Google, and Microsoft — account for 86% of disclosed value. An additional 299 records have undisclosed amounts. A May 2026 audit identified at least $9B+ in additional megadeals approved Dec 2025–Apr 2026 that have not yet been added to the GJF tracker — making the true total significantly higher.

$16.2B
GJF disclosed (Nov 2025)
706
Subsidy records
86%
To top 5 firms
299
Undisclosed amounts
+$9B
Q1–Q2 2026 (not in tracker)
Top Recipients by Disclosed Value
Top States by Disclosed Value
Subsidy Awards by Year
Per-Job Subsidy Watch · May 2026 RECORD-SETTING
$77M for one job. $16M for 100. $11.7M for 125.
A May 2026 audit identified four data center deals where the public subsidy per permanent job exceeds $10 million. The top four together represent more than $10 billion in foregone public revenue against fewer than 275 permanent jobs. Reinvent Albany has called the JPMorgan deal “by far the largest government subsidy ever recorded…possibly the world.
Project · Location
Subsidy
Jobs
Per-job
JPMorgan Chase · Orangetown, Rockland Co., NY
$77M
1
$77.0M
Crusoe · Warrenton, Warren Co., MO NEW APR 2026
$1.6B
100
$16.0M
Stream / Apollo · STAMP, Genesee Co., NY PENDING
$1.46B
125
$11.7M
Nebius · Independence, Jackson Co., MO LARGEST DC ABATEMENT BY $
$6.26B
undisclosed
Sources: Reinvent Albany, First Alert 4 (Crusoe), Investigative Post (STAMP), KCUR (Nebius). Sen. Rachel May’s Stop Subsidizing Data Centers Act (NY S6453, Syracuse) would cap mid-size deals at $500K/job and ban subsidies for >100 MW facilities. (Sen. Liz Krueger sponsors the separate S9144 statewide moratorium bill.)
Recent mega-deals · not yet in GJF tracker
Q1–Q2 2026 approved subsidies
Combined: $10.4B+ · 5 deals
Approved Mar 2 2026
Nebius AI Factory
Independence, Jackson Co., MO
$6.26B abatement on $150B property + 98% real / 90% personal tax breaks for 20yr. Reportedly the largest data-center-specific tax abatement ever approved by a U.S. municipality (per Reinvent Albany, KCUR, Mar 2026); supporting council members unseated by voters in April.
Approved Apr 15 2026
Crusoe Warrenton
Warren Co., MO
$1.6B+ over 20yr at 75% personal property abatement. 100 permanent tech jobs @ $100K avg (per Fox2Now / Warren County Record). 4–2 board vote.
Pending · Hochul noncommittal
Stream / Apollo STAMP
Genesee Co., NY
$1.46B GCEDC PILOT + sales/mortgage exemptions. 125 jobs. Will use 500 MW (>80% of STAMP power).
Approved Jan 22 2026
Google Franklin Furnace
Scioto Co., OH
75% property abatement + 15-yr PILOT @ $500K/yr. $1B project, up to 1.7M sqft / 10 phases. NDA controversy prompted statewide ban-NDAs bill.
Pending · Implementation
WV HB 2014 framework
West Virginia · statewide
Preempts local control for any DC ≥90 MW. Diverts only 30% of property tax to host county; 50% to state PIT-reduction fund. Rules through 2026.
Clawbacks & community wins
When the subsidy gets stopped

A new pattern in 2026: voters, county commissions, and local ordinances are reversing or rejecting data center tax breaks. Five examples since February alone.

Mar 13 · Rejected unanimously
Brazoria County, TX
Nightpeak Energy “Old Ocean” $3.5B project
Commissioners unanimously rejected the abatement package. Cited noise, water demand, and grid load.
Apr 2026 · Voter ouster
Independence, MO
Nebius $6.26B (already approved)
Two council members who supported the deal were defeated in the April 2026 election — a democratic-accountability route to subsidy reversal even after approval.
Apr 2026 · Ordinance pending
Brevard County, FL
All future deals
Commissioners directed staff to ADD data centers to the list of businesses ineligible for ad valorem economic-development exemptions.
Apr 21 · No incentives offered
Pittsylvania County, VA
Stack Infrastructure / Berry Hill
Local government opted to charge full tax rate, no abatement. Created a new tax classification for DC equipment without accelerated depreciation.
Apr 24 · Postponed
Ferguson, MO
Mixed-use development + DC
Tax-break vote postponed amid public opposition. Public hearing process continues.
Context: Indiana’s $8.5B total is dominated by a single Amazon megadeal ($8.3B in 2024). Washington’s 258 records reflect the state’s long-running data center tax incentive program. Virginia, despite hosting more data centers than any other state, shows only $141M in disclosed subsidies — likely reflecting the use of NDAs and nondisclosure agreements that prevent public reporting. The 299 undisclosed-value records suggest significant subsidy activity that is invisible to the public.
Source: Good Jobs First Subsidy Tracker. Full-text search for “data center” across 722,000 records. Some records may include non-DC companies with data center mentions in program notes. State-by-state comparisons are limited by uneven disclosure practices. As noted by Good Jobs First: “Due to uneven disclosure, it is NOT appropriate to make state-by-state comparisons.” Data current through November 2025.
Sovereign Capital & Foreign Ownership

The audit you didn’t see coming

The DCSI flags 297 foreign-owned facilities in its current dataset. After a May 2026 sovereign-capital audit, we estimate the true count is closer to 400–450. The gap comes from acquisitions that have closed since the underlying source data was compiled, sovereign LPs that don’t show up in operator HQ fields, and crypto-to-AI conversions that fall outside the colocation taxonomy. This page documents what’s in the dataset today, what’s missing, and how to read the difference.

297
Currently flagged
~80–150
Estimated missing
12+
Sovereign LPs tracked
$40B
Largest pending acquisition
Editorial finding
“Abu Dhabi state capital (Mubadala and MGX) holds disclosed equity positions in three U.S. AI cloud operators — Crusoe, Lambda, and Stargate — for a combined $7B+ in equity exposure. None are currently flagged as foreign-owned in the DCSI dataset.”
May 2026 sovereign capital audit. Sources: Crusoe Series E disclosures (Oct 2025), Lambda Series E (TechCrunch Nov 2025), Stargate JV cap table (OpenAI announcements).
Stargate Program Audit · May 2026 7 U.S. SITES
~7,700 MW. 7 U.S. counties. ~$80B+ disclosed capex. UAE sovereign exposure on every site.
The Stargate JV (OpenAI 40% / SoftBank 40% / Oracle / MGX 7%) has now named seven U.S. data center sites. Every one carries indirect MGX/UAE sovereign equity exposure via the JV structure. Two sites (Lordstown OH, Milam TX) have direct SoftBank Group build/operate roles. Five of seven sites face active community pushback, lawsuits, or political opposition.
Site · County
MW
Capex
Status
Pushback
Abilene · Taylor, TX BTM
1,200
$11.6B+
Phase 1 live (Sept 2025)
Mod
Saline Township · Washtenaw, MI
1,400
$16B
Construction (Apr 24 2026 close)
Shackelford / Frontier · Shackelford, TX BTM
1,400
n/d
Construction
Low
Project Jupiter · Doña Ana, NM
~1,000
$165B IRB ceiling
Permitting
⚠ Suit
Lordstown · Trumbull, OH
600–1,000
$375M land
Site prep / litigation
⚠ Suit
Milam / Freebird · Milam, TX
1,200
~$5–7B est.
Phase 1 construction
Mod
Lighthouse · Ozaukee (Port Washington), WI
~902
$15B+
Construction
⚠ Recall
BTMBehind-the-meter — site bypasses utility interconnect via on-site (typically gas) microgrid generation. BTM sites avoid ERCOT / RTO queue waits but also escape state PUC oversight, ratepayer cost-allocation rules, and most public IRP transparency. Crusoe’s Abilene microgrid + the Vantage / Stargate Frontier campus next door form a ~2.6 GW BTM gas cluster in West Texas. Crusoe Permian sites (Pecos, Midland) and Pacifico’s 5 GW GW Ranch off-grid project extend the same model.
Sources: OpenAI Five New Stargate Sites announcement (Sept 2025), Related Digital Saline financing close (Apr 2026), DCD Lordstown ban, Truthout Doña Ana lawsuit coverage. Stargate UAE (1 GW Abu Dhabi) is separate; not counted in the U.S. 7 GW figure.
Counter-program context: Anthropic’s Project Rainier (New Carlisle, IN) commits ~2.2 GW with Amazon Trainium2 chips at full buildout (~$26B+). Phase 1 already operating with 500K Trainium-2 chips. Anthropic-Google-Broadcom adds another 3.5 GW pipeline. Anthropic’s near-term operating MW likely exceeds Stargate’s through 2026.

1 · Sovereign Capital Network

Force-directed graph of how foreign sovereign wealth funds anchor U.S. AI infrastructure operators. Click any node for expanded context. Built directly from May 2026 audit findings.

2 · The 12 Sovereign LPs

Each tile = a foreign sovereign or sovereign-fund-of-fund LP and the U.S. operators it touches. Click any tile for the full bio, asset list, and source citations.

3 · By country

Total facilities and capacity by HQ country, aggregated from current dataset (does not yet include audit-identified gaps).

4 · The Aligned acquisition — H1 2026

Single largest pending control change

In October 2025, the AI Infrastructure Partnership (AIP) consortium — co-founded by BlackRock GIP, Microsoft, Abu Dhabi’s MGX, and Nvidia, with xAI joining the acquiring group and Kuwait Investment Authority and Singapore’s Temasek as anchor LPs — agreed to acquire 100% of Aligned Data Centers from Macquarie at a $40B enterprise value. Some reporting now puts close in late 2026 rather than H1.

Aligned operates 50 campuses with >5 GW of operational and planned capacity (78 data centers per Aligned disclosures). Major U.S. sites: Plano TX (HQ), Phoenix, Ashburn VA, Salt Lake City, Atlanta, Dallas, Hillsboro OR, Columbus OH, Chicago, and Northern Virginia.

On close (H1 2026), Aligned’s controlling ownership flips from Australian (Macquarie) to a UAE / Kuwait / Singapore / U.S. mixed sovereign-controlled posture. The DCSI should consider every Aligned U.S. campus as foreign-controlled post-close. This single transaction may add 40–50 facilities to the foreign-owned count.

Timeline
2018 — Macquarie acquires Aligned (2 sites, 85 MW)
2025 expansion — Aligned grows to 50 sites / 5 GW
Oct 2025 — AIP $40B acquisition announced
H1 2026 — Expected close

5 · CFIUS-scrutinized transactions

First-ever CFIUS judicial enforcement
Suirui — Jupiter Systems
Hayward CA. Hong Kong / Suirui Group acquisition forced into divestiture by July 2025 presidential order. DOJ enforcement suit filed February 2026 — the first-ever judicial enforcement of a CFIUS divestment order.
FCC NPRM · Apr 9 2026
Chinese carriers + Huawei / ZTE equipment ban
Building on the 2025 vote, the FCC issued a formal Notice of Proposed Rulemaking on Apr 9 2026 to (1) bar China Mobile / Telecom / Unicom from any U.S. data-center interconnect, and (2) extend “covered list” equipment restrictions (Huawei, ZTE, Hikvision, Dahua, Hytera) to commercial colocation and hyperscale campuses. Comment period closes Jun 9 2026; final order expected Q3.
Trump “Fast-Track” lane
February 2025 EO — America First Investment Policy
Created a CFIUS Fast-Track lane for “ally” sovereign wealth (UAE, Saudi, Singapore) into U.S. AI infrastructure. Triggered the wave of UAE/Saudi capital documented above.

6 · What we don’t see

Methodology gap

The current DCSI flag is binary: a facility’s operator HQ is either US or non-US. This misses three categories of foreign control:

  1. Sovereign-fund-of-fund LP exposure. When AustralianSuper holds a $1.5B minority + board seat in DataBank, every DataBank U.S. site has Australian sovereign exposure even though the operator is US-domiciled.
  2. Pending acquisitions. Aligned’s 50 campuses are still flagged as Macquarie/Australian today, but flip to AIP/UAE-Kuwait-Singapore on H1 2026 close. Same for SoftBank’s $4B DigitalBridge buy (Vantage, DataBank historical, Switch).
  3. Crypto-to-AI conversions. Australian-listed IREN’s Childress, TX 750 MW campus, Bitmain’s first U.S. factory, Genesis Digital Assets’ 600 MW Texas footprint — all foreign operators outside traditional colocation taxonomies.

Proposed methodology change: Add a control-weighted flag. Any facility whose operator has >25% foreign sovereign or sovereign-fund-of-fund LP triggers a “sovereign-exposed” tag, even if the operator HQ remains US.

7 · What you can do

For Congress

CFIUS reform: extend mandatory review thresholds to sovereign-fund-of-fund LP positions >25% in critical infrastructure including AI data centers.

For state legislators

Mandate foreign-LP disclosure as a condition of any data center tax incentive. Florida HB 1007 (2026) attempted this but the NDA-prohibition provision was stripped before passage.

For reporters

Track AIP, Mubadala, MGX, GIC, CPP cap-table announcements. Bloomberg’s Josh Saul, WSJ’s Sebastian Herrera, Inside Climate’s Kristoffer Tigue are among the reporters most actively covering this beat.

Sources: Bloomberg, Financial Times, CNBC, Data Center Dynamics, Reuters, AP, OpenAI announcements, Aligned Data Centers blog, Macquarie Asset Management, GIC newsroom, CPP Investments, BlackRock IR, AGBI, TechCrunch, US DOJ. Full citation list available on request. Audit conducted May 2-3, 2026 by independent agent research; findings reviewed by Anna R. Dudley.

Methodology

How the DCSI is calculated, what data it uses, and where human judgment enters.

The Analytical Question

For every U.S. county hosting a data center: what resource burden is that facility placing on the community, and what economic return is the community receiving?

Score Architecture

Resource Burden Score (RBS)

RBS = 0.4 x Water + 0.35 x Energy + 0.25 x Land

Percentile-rank normalization across all 670 DC-hosting counties. Water draws on WRI Aqueduct 4.0 baseline water stress with seasonal indicators. Energy is estimated facility-level consumption (MW × utilization × 8,760 hours) following LBNL methodology, supplemented with EIA-923 generation where local power plants exist. Land is facility footprint as a share of county land area, with MW-derived estimates for counties missing observed acreage.

Regulatory Opacity Score (ROS)

County ROS = State Regulatory Index x (1 + Pushback Modifier + COI Modifier)

6-variable State Regulatory Index across 42 DC-hosting jurisdictions: environmental review, resource disclosure, tax incentive accountability, permitting openness, utility rate transparency, and ownership disclosure. Each variable scored 0–3 against statute citations. Community pushback applies a −10% modifier; the Company Opacity Index applies up to a −5% modifier where local operators are demonstrably more transparent.

Economic Return Score (ERS)

ERS = 0.5 x Jobs_per_MW + 0.5 x Wage_Premium

Wage Premium is data center industry average pay divided by county all-industry average pay. Monte Carlo error propagation (1,000 iterations) produces 90% credible intervals. Counties are tagged Tier A (high confidence), B (moderate), or C (imputed inputs). Mean wage premium across DC counties: ~1.9x.

Composites

Burden Mode: 0.5 x RBS + 0.3 x ROS + 0.2 x (1 - ERS)
Return Mode: 0.5 x (1 - ERS) + 0.3 x RBS + 0.2 x ROS

Limitations & Caveats May 2026 red-team review

The DCSI is an advocacy-grade composite built by a single analyst working in public, not a peer-reviewed academic index. Following an independent May 2026 methodology red-team, these caveats are now foregrounded:

  • Weights are author-selected, not empirically derived. The 0.40 / 0.35 / 0.25 (Water/Energy/Land) and 0.50 / 0.30 / 0.20 (RBS/ROS/inverted-ERS) splits reflect editorial policy priorities. PCA and entropy-weighting cross-checks confirm rank stability across the bulk of the distribution but DO show divergence at the F-grade boundary. Plausible weight perturbations (e.g., 0.30 / 0.30 / 0.40) flip the letter grade for an estimated 10–15% of counties.
  • Percentile-rank normalization creates a closed system. A county’s grade is defined relative to the other 669 DC hosts. Adding facilities to the dataset can mechanically shift other counties’ grades without their own metrics changing. The grading system is structurally relative, not absolute.
  • 62% of facilities have unknown cooling type. Their water sub-score uses the Siddik et al. 2021 national weighted average (0.3 L/kWh). For these facilities, the “Water” signal is effectively a deterministic function of estimated kWh, not directly observed water.
  • 247 counties (37%) have MW-derived land acreage. Energy and Land sub-scores in those counties share an underlying MW input, introducing multicollinearity. The three-dimensional RBS framing functionally collapses to ~2 dimensions for ~37% of counties.
  • ROS is single-rater. The 6-variable State Regulatory Index across 42 jurisdictions was scored by one analyst. Inter-rater reliability pilot (Cohen’s ฮบ) deferred. ROS should be read as “preliminary” until a second-rater audit is complete.
  • 65% of QCEW employment data is suppressed by BLS and imputed from state totals. The wage-premium calculation (1.9x mean) is heavily right-skewed; median ratio is likely lower.
  • Tier C data quality counties get the same letter grade as Tier A counties. The grade card does not visually disclose tier. A “Tier C F” should not be read as equivalent to a “Tier A F.”
  • Universe = DC-hosting counties only. Spillover effects on watershed-adjacent and transmission-corridor counties are not captured. The index is more honestly described as a “Host County Stress Index.”
  • Granger causality results on the 11-year window are not multiple-comparisons corrected. Treat any county-level “DC announcements Granger-cause water stress” finding as exploratory, not confirmatory.
  • “Built for under $500” reflects hardware/hosting/data costs only. It does not include researcher labor (which is the dominant cost of the project).

These caveats are deliberately listed in the methodology section, not buried in a footnote. Readers should weigh letter grades alongside their data quality tier, and policy advocates citing the dashboard should disclose that the weights are policy-weighted rather than empirically derived.

Company Opacity Index (COI)

12-indicator binary scoring of corporate transparency across 142 data center companies. Indicators span environmental reporting, energy disclosure, water disclosure, PUE reporting, renewable targets, cooling technology, community engagement, tax incentive disclosure, beneficial ownership, supply chain transparency, third-party audits, and incident reporting.

COI = 1.0 − (indicators_disclosed / 12)

Scale: 0 = fully transparent, 1 = fully opaque. Tiers: Transparent (≤0.25), Open (0.26–0.50), Partial (0.51–0.65), Opaque (0.66–0.80), Dark (>0.80). County-level COI is capacity-weighted (larger facilities contribute more) and feeds the ROS calculation as a transparency credit of up to −5%.

Energy Efficiency Score (EES)

Facility-level composite derived from PUE, grid dependency, cooling impact, backup emissions, and a transparency penalty. Quality flags: measured (real data for 3+ components), partial (1–2 real data points), insufficient (fully imputed — default 0.593).

EES = f(PUE_efficiency, grid_dependency, cooling_impact, backup_emissions) × (1 − transparency_penalty)

Higher EES = more efficient. EES is currently displayed as an informational overlay and is not factored into the composite CSI; integration into RBS Energy is under evaluation.

Data Quality Tiers

Every county receives a data quality tier reflecting how much of its scoring input comes from observed source data versus estimates or imputations.

Tier A: ≥3 of 4 dimensions observed  |  Tier B: 2 of 4 observed  |  Tier C: ≤1 observed

The four dimensions assessed are: (1) water stress (observed via Aqueduct vs. state-median imputed), (2) energy (EIA-923 supply data vs. MW demand proxy), (3) land (source acreage vs. MW-estimated), and (4) employment (BLS QCEW observed vs. state-residual imputed). Counties with Tier C data have the majority of their score driven by estimates and should be interpreted with caution. The tier is displayed in the county tooltip, scorecard, and sidebar.

Spatial Autocorrelation (Moran's I)

Measures whether data center stress clusters geographically or distributes randomly. A positive Moran's I indicates clustering: counties near high-stress counties tend to also be high-stress, suggesting regional infrastructure strain rather than isolated incidents. Computed on the CSI values using queen contiguity weights from the Census TIGER county shapefile.

I = (N / W) x (Sum_ij w_ij(x_i - x_bar)(x_j - x_bar)) / (Sum_i(x_i - x_bar)^2)

Local Moran's I (LISA) identifies specific hot-spot and cold-spot clusters. Counties flagged as hot spots (High-High) are surrounded by other high-stress counties. These are the regional pressure zones where infrastructure strain compounds.

V10 Methodology Supplement May 2026 audit response

Following the May 2026 red-team review, six methodological gaps were addressed. Each is documented below.

1. ROS Inter-Rater Reliability Pilot

An independent second rater scored 10 randomly-selected states blind to the original DCSI ROS values. The rater used the same 6-variable rubric (permit transparency, environmental review, energy disclosure, water disclosure, tax incentive accountability, ownership disclosure) on a 0–3 scale.

Blind second-rater results (10 states)
State
Total/18
ROS
Tier
New Jersey
12
0.333
Transparent
Connecticut
11
0.389
Relatively Transparent
Oregon
11
0.389
Relatively Transparent
Virginia
9
0.500
Mixed
Wisconsin
8
0.556
Mixed-Opaque
Texas
4
0.778
Opaque
Iowa
4
0.778
Opaque
Tennessee
3
0.833
Highly Opaque
South Dakota
3
0.833
Highly Opaque
Mississippi
3
0.833
Highly Opaque

Pilot finding: Reliable agreement on the extreme tiers (NJ Transparent, TN/SD/MS Highly Opaque). Rater identified 8 specific variable-state cells with genuine scoring ambiguity — primarily on environmental_review (Oregon, NJ, Wisconsin) where partial regimes (EFSC, EJ Law, WEPA) sit between “none” and “full NEPA-equivalent.” Cohen’s weighted κ computation against original DCSI scores is pending; preliminary indication is moderate agreement (ฮบ โ‰ˆ 0.55–0.70) — sufficient to call ROS reliable at the tier level but not at the variable level. Future work: third-rater calibration on the 8 ambiguous cells.

2. Greenfield vs. Peering-Hub Disaggregation

The DCSI’s 670 host counties are not homogeneous. ~15 fiber-mature peering hubs (Loudoun VA, Santa Clara CA, Cook IL, Manhattan NY, Maricopa AZ, Dallas TX, etc.) capture network-effect economic returns that the other ~655 counties cannot replicate. Loudoun’s $700M+ in DC tax revenue is the ceiling, not the median.

Peering hubs (n≈15) collect a median of ~$400M+/year in DC-related local revenue. Greenfield counties (n≈655) collect a median of under $4M/year. When the dashboard cites “low ERS” outcomes for the median host county, the comparison set should be greenfield, not the peering-hub anchored mean. The list of 15 peering-hub FIPS is hardcoded in the dashboard JS as PEERING_HUB_FIPS and excluded from greenfield aggregations.

3. Weight Sensitivity Sweep

The composite’s default weights (RBS = 0.40 Water + 0.35 Energy + 0.25 Land; CSI = 0.50 RBS + 0.30 ROS + 0.20 inverted ERS) are author-selected. Plausible alternatives:

Scheme
RBS weights
Est. grade flips vs. default
Direction
Default (DCSI)
0.40 / 0.35 / 0.25
— baseline
Land-heavy
0.30 / 0.30 / 0.40
~80–100 counties (~12–15%)
Western counties harden up
Energy-led
0.30 / 0.50 / 0.20
~60–80 counties (~9–12%)
VA/NC clusters worsen
Equal weights
0.33 / 0.33 / 0.33
~40–60 counties (~6–9%)
Distributed shifts, no regional bias
PCA-derived
0.42 / 0.41 / 0.17
~30–50 counties (~5–7%)
Land-data-poor counties soften

Most counties are weight-robust (grade does not flip across plausible perturbations). The ~10–15% that DO flip cluster at the D/F boundary — meaning marginal-call counties should not be cited as definitively “F-graded” without acknowledging weight sensitivity.

4. Next-Best-Use Opportunity Cost

The DCSI “burden” framing implies data center development is net-negative. A fairer comparison is opportunity cost: what would the same 250 MW of grid capacity, 200 acres of industrial-zoned land, and $5B of capex produce if deployed elsewhere?

Alternative use of 250 MW + 200 acres
Permanent jobs
Annual tax base
Note
Hyperscale data center (250 MW)
~50–150
~$50–700M
Loudoun is the ceiling
Advanced manufacturing (semiconductor fab)
~1,500–3,000
~$30–80M
TSMC AZ benchmark
Battery / EV plant
~1,000–2,500
~$25–60M
Ford F-150 Lightning TN benchmark
Pharma fill-finish facility
~800–1,500
~$15–40M
Lilly NC benchmark
Light industrial / warehouse
~300–700
~$5–15M
Lower MW / land productivity
Conservation / brownfield retire
~0
~$0
Ecological baseline

Data centers are a high-tax-base, low-employment use. The right comparison frame is not “$77M for 1 job is bad” but “the same MW + acres + capex deployed in advanced manufacturing creates 10–30x more permanent employment.” This frames the policy question as opportunity cost, not absolute waste.

5. Granger Causality Findings — Now Exploratory Only

Earlier dashboard versions reported county-level Granger tests for “DC announcements precede water stress.” On red-team review, the 11-year window (2015–2026) is borderline-underpowered for county-level testing, and 670 simultaneous tests at α=0.05 yields ~33 false positives by chance. The findings have been downgraded to exploratory and are no longer cited as confirmatory evidence. FDR correction (Benjamini-Hochberg) and pre-registration are V11 deliverables.

6. Moratorium Count Reconciliation

The DCSI tracks 62 moratoriums; datacenterbans.com tracks 222+ across 30 states. The gap is methodological:

  • DCSI counts: moratoriums passed (enacted) plus formally-introduced state bills with named sponsors. Excludes failed bills, dropped proposals, and informal council discussions.
  • datacenterbans.com counts: all moratoriums, ordinances, “in-progress” bills, formal opposition resolutions, study commissions, and informal pause requests. More comprehensive but lower bar.

Both are valid scopes; reading either headline number requires understanding the criteria. The DCSI 58 figure is the conservative estimate; 222 is the maximalist count. The dashboard’s reported number is footnoted with this distinction in the Community Resistance tab.

Entropy-Weighted Scoring

Rather than fixed weights (0.4/0.35/0.25), entropy weighting lets the data determine how much each component contributes to the composite score. Components with more variation across counties receive higher weights; components where all counties score similarly receive lower weights. This prevents a uniformly high-stress component from dominating the index without adding discriminatory power.

w_j = (1 - E_j) / Sum(1 - E_k), where E_j = -Sum p_ij ln(p_ij) / ln(n)

Both fixed and entropy-weighted composites are computed. The dashboard defaults to fixed weights (which are more interpretable for policy audiences) but allows toggling to entropy-weighted for analytical rigor.

Granger Causality Testing

Tests whether data center facility announcements (from FracTracker timeline data) Granger-cause changes in county-level water stress scores (from Aqueduct monthly data). If past facility announcements help predict future water stress beyond what water stress's own history predicts, this provides statistical evidence of a causal link between data center expansion and resource degradation.

F-test: H0: DC announcements do not Granger-cause water stress changes

Applied county-by-county where sufficient time-series data exists (2015 to 2026). Results reported as significant/not significant with lag selection via AIC.

Grading Methodology — Percentile Thresholds

Counties receive letter grades A through F based on their percentile rank within all 670 data-center-hosting counties:

A = 0–20th percentile  |  B = 20–40th  |  C = 40–60th  |  D = 60–80th  |  F = 80–100th

Percentile grading is a relative ranking, not an absolute threshold. An “A” grade means a county has lower composite stress than 80% of DC-hosting counties — it does not mean the county experiences no resource burden. Tier C data quality counties (where ≤1 of 4 scoring dimensions uses observed data) should be interpreted with particular caution; their grades are driven primarily by estimates and imputations.

What This Tool Does Not Do

Does not rank counties without facilities. Does not make causal claims. Does not generate regulatory findings. Informs judgment; it does not replace it.

Methodology Updates & Known Limitations

A running ledger of the choices, validations, and trade-offs that shaped each version of the index. Kept here so the methodology body stays focused on what the score is, not what it has been.

Weights are policy-weighted, not empirically derived

The primary weights (0.4/0.35/0.25 for RBS; 0.5/0.3/0.2 for CSI) reflect the analyst’s judgment about relative importance. PCA, entropy, and equal-weight validation runs all produce rankings highly correlated with the policy-weighted version (Spearman ρ > 0.96), but the choice of weights is ultimately a value judgment. The dashboard’s entropy-weighted toggle exposes a purely data-driven alternative.

V9 Energy — LBNL methodology

Estimated facility-level consumption uses LBNL methodology (MW × utilization × 8,760 hours). Utilization defaults to 50% per LBNL 2024 national average, adjusted by facility type (hyperscale 58%, colocation 50%, enterprise 43%). PUE sourced from operator reports where available (Google per-campus, Meta fleet 1.08, AWS per-region, Microsoft 1.16) or estimated from facility type. Replaces V8’s 100% utilization with flat PUE 1.3. EIA-923 generation data supplemented with an MW-based demand proxy for 89 counties with data centers but no local power plants. Renewable energy credit reduces energy burden for facilities reporting above-median renewable sourcing (up to −10%).

V8 Land — MW-derived acreage estimates

For 247 counties missing facility footprint data, land use is estimated using an industry heuristic of ~20 acres per 100 MW. Source acreage from PNNL and FracTracker is preferred where available.

V8 ROS — Single-rater scoring

All 6 regulatory variables across 42 jurisdictions were scored by a single rater (anna_claude). Inter-rater reliability pilot (Cohen’s Weighted Kappa > 0.60 per variable) deferred. Cronbach’s Alpha was dropped as a gate because the six variables are formative (independent dimensions), not reflective.

V8 ROS — COI integration

Company Opacity Index now acts as a transparency modifier on ROS: ROS × (1 + pushback_mod + COI_mod) where COI_mod = −0.05 × (1 − county_avg_coi). Counties whose operators are demonstrably more transparent receive up to a 5% reduction in regulatory opacity, recognizing that corporate transparency partially compensates for weak state regulation.

EES is informational, not part of CSI

The Energy Efficiency Score is currently displayed as an overlay only. It is not factored into the composite CSI. Integration into the RBS Energy sub-score is under evaluation for a future version.

CUSUM removed in V8

CUSUM change-point detection was removed because the available Aqueduct data covers a single year of seasonal variation, not a multi-year time series. Running CUSUM on 12 monthly values detected seasonal patterns (summer vs. winter), not genuine acceleration of water stress. The 38% flag rate confirmed this was noise, not signal. CUSUM has been replaced by the data quality tier system and facility-level mapping.

Grading is relative, not absolute

Percentile grading means exactly 20% of counties will always fall into each tier. There are no empirically validated “safe” or “critical” stress levels for data center hosting. Natural-breaks (Jenks) classification is under evaluation as a potential alternative.

V7 ERS — Wage Premium replaces raw wages

ERS now uses Wage Premium (DC industry average pay divided by county all-industry average pay) instead of raw wages_per_mw. Mean wage premium across DC counties is approximately 1.9x. Monte Carlo error propagation (1,000 iterations) produces 90% credible intervals with Tier A/B/C confidence flags.

Known Gap: Grid Interconnection Queue

Regional Transmission Organizations (RTOs) and Independent System Operators (ISOs) maintain queues of power generation and large-load interconnection requests. These queues reveal where data centers plan to draw hundreds of megawatts years before construction begins — LBNL estimates over 200 GW of data center capacity is currently waiting in U.S. interconnection queues. However, this data is fragmented across 7 RTOs (PJM, ERCOT, MISO, SPP, CAISO, NYISO, ISO-NE) and dozens of non-RTO utilities, each with different reporting formats, naming conventions, and update schedules. No free, unified public dataset exists. The DCSI currently uses EIA-923 for actual generation and MW estimates for demand, but cannot model the pipeline of future grid load from interconnection requests. FERC’s proposed large-load rule (finalization deadline April 30, 2026) may create a standardized reporting framework that makes this data tractable.

Known Gap: Behind-the-Meter Generation (~56 GW)

A growing number of data centers operate on-site natural gas turbines or fuel cells that are not connected to the grid and therefore invisible to federal energy data. The Clean Energy Buyers Alliance estimates that approximately 56 GW of behind-the-meter (BTM) gas generation is either planned or under construction for U.S. data centers, primarily in Texas, Georgia, and Virginia. These facilities bypass utility metering, EIA-923 reporting, and state public utility commissions. The DCSI’s energy burden calculations are based on grid-connected power data (EIA-923) and MW capacity estimates. BTM generation represents a significant and growing blind spot: counties hosting BTM-powered data centers may have their energy burden systematically underestimated. Until federal disclosure mandates (such as the Durbin S. 4213 or the Clean Cloud Act) create reporting requirements for BTM facilities, this gap cannot be closed with public data alone.

Reference

Data Sources

Last source scan: May 3, 2026 Latest audit cycle
May 2–3 audits added: 5 new megadeals (Nebius MO $6.26B, Crusoe MO $1.6B, Stream/Apollo NY $1.46B, Google OH 75% PILOT, JPMorgan NY $77M); 80–150 missing foreign-flagged facilities identified (Aligned, Compass, Centersquare, NTT Prince William, Mitsubishi Estate, Yondr, EDGNEX); 12 sovereign LP cap-table profiles documented; 3 governor updates (VA Spanberger, NJ Sherrill, SD Rhoden).
Pending integration: SUBSIDY_DATA aggregate JSON needs re-aggregation after Q1–Q2 2026 megadeals appended to GJF CSV (711 records total, was 706). Foreign-flagged facility CSV draft awaiting reviewer approval.
At risk: EPA GHGRP proposed rule to terminate the program entirely; Maine Data Center Advisory Council (formed via Mills EO after LD 307 veto) report due early 2027.
PNNL IM3 Data Center Atlas
FreeUpdated Feb 5, 2026
1,479 U.S. data center facility locations with county FIPS, coordinates, operator, and square footage. February 2026 update adds projected data center locations alongside existing sites, with a growth-projection component based on fiber, electricity, and water infrastructure analysis, a new forecasting layer not present in earlier versions.
Source: MSD Live / PNNL (im3.pnnl.gov/datacenter-atlas)Join key: state_id + county_id to FIPSFiles: 5Download
Epoch AI Data Center Dataset
FreeUpdated Apr 9โ€“10, 2026
Frontier AI data centers with verified MW capacity, H100-equivalent GPU counts (2.5M total, 13 US sites). Both the Frontier Data Centers CSV and Frontier Data Center Timelines CSV were refreshed April 9โ€“10, 2026. Five AI data centers are projected to hit 1 GW capacity in 2026, capital cost, and construction timelines. Tracks the largest AI compute installations including xAI Colossus (425 MW), OpenAI Stargate Abilene (590 MW), Anthropic-Amazon New Carlisle (1,092 MW), and Meta Prometheus (695 MW). Tier 1 source for MW corrections.
Source: epoch.ai (CC-BY licensed)Join key: Facility name + address geocodingFiles: 5 (centers, timelines, chillers, cooling towers, ZIP)Download
EIA-923 Power Plant Operations
FreeJanuary 2026 monthly available
Plant-level electricity generation, fuel consumption, and environmental data. January 2026 monthly release published March 24, 2026. Annual 2024 final data released September 2025; 2025 annual expected June 2026. Covers 2015 to 2026 across three schedule types for time-series energy analysis.
Source: eia.govJoin key: Plant ID to EIA-860 to CountyFiles: 32Download
EIA-860 Plant Location Crosswalk
Free
Maps every power plant to its county, latitude, longitude, and utility. Essential join table connecting EIA-923 generation data to county-level geography.
Source: eia.govJoin key: Plant Code to CountyFiles: 13Download
WRI Aqueduct 4.0 Water Risk Atlas
Free
Global water stress scores: baseline annual and baseline monthly (seasonal pattern analysis), plus future projections (2030/2050/2080 scenarios).
Source: World Resources InstituteJoin key: Spatial overlay to County FIPSFiles: 3 CSV + GDBDownload
BLS QCEW: NAICS 518210 & 5182
FreeQ3 2025 available
County-level employment and wage data for data processing/hosting (518210) and broader parent code (5182). Q3 2025 release (March 10, 2026) now available. Q4 2025 scheduled for June 2, 2026. Covers 2020 to Q3 2025 with all ownership types.
Source: bls.gov/cewJoin key: area_fipsFiles: 12Download
EPA FLIGHT: Emissions by Unit
Freeโš  AT RISK
Critical risk: EPA has proposed a rule to end the GHGRP entirely. If finalized, this data source, a foundational input to the Regulatory Opacity Score's diesel generator component, would disappear permanently. The 2025 reporting deadline has also been extended from March 31 to October 30, 2026, delaying the next data release. No facility-level data has been published since the last cycle. Monitor closely for DCSI implications. Unit-level greenhouse gas emissions including diesel generator hours from Subparts C, D, and AA.
Source: EPA GHGRPJoin key: Facility to CountyFiles: 1 (.xlsb)Download
FracTracker National DC Tracker
FreeNew dashboard, April 7, 2026
FracTracker launched a redesigned "Open U.S. Data Centers Tracker Dashboard" on April 7, 2026 with new data on project status, scale, and impacts on energy systems and communities. Covers 1,446+ facilities with status (657 proposed, 529 operating, 119 approved/under construction, 52 expanding, 46 suspended, 43 cancelled). Includes community pushback documentation (172 facilities flagged), NDA tracking, MW capacity, cooling type, and power source. April 2026 dashboard adds 44 new fields including resistance status, advocacy information, and source URLs.
Source: FracTracker Alliance / ArcGISJoin key: County + lat/lonFiles: 4 (main CSV + PEC Virginia + Sci4GA layers)Download
USGS NLCD Land Cover
Free
National Land Cover Database raster (1985 to 2023) for land cover change analysis. Requires zonal statistics processing with county shapefile to extract developed-land area per county.
Source: USGS / ScienceBaseJoin key: Raster to County polygon overlayFiles: 5 (992 MB TIF)Download
Census TIGER County Shapefile
Free
2025 county boundary polygons for all 3,200+ U.S. counties. Used for spatial joins (Aqueduct to County, NLCD to County).
Source: Census BureauJoin key: GEOID (FIPS)Files: 7Download
Congressional & State Legislators
Free
Federal legislators from @unitedstates project. State legislators from Open States (51 state files). Used for "Contact Your Rep" feature in county modal.
Source: github.com/unitedstates + Open StatesFiles: 52Download
MW Capacity Overrides (Web Research)
Curated
Hand-researched MW power capacity corrections for facilities where the default square-footage estimate diverges significantly from publicly reported values. Sources include utility interconnection filings, operator press releases, engineering reports (PASE, DPR), and data center tracking databases (Baxtel, Aterio, interconnection.fyi). This override layer exists because campus-level square footage often includes non-IT space (offices, cooling infrastructure, parking), causing the standard 6 MW/100K sqft conversion to overestimate by 3 to 20 times for hyperscale facilities.
Source: Multiple (utility filings, operator sites, trade press)Join key: Facility name + State + CountyFiles: mw_overrides_v2.csv (97 entries) + mw_web_research_v2.csv (452 entries)Priority: Overrides sqft-derived estimates in pipeline
Ownership Affiliation Overrides + Entity Normalization
CuratedUpdated May 2 2026
Hand-researched ownership corrections for 228 data center facilities where the original source listed operators as "Unknown" or "Other." Identifies the actual corporate operator, headquarters country, and flags foreign-owned entities. Sources include company press releases, SEC filings, utility interconnection records, and industry databases. Foreign-HQ operators identified include NTT (Japan), Cologix (Canada), Nebius/ex-Yandex (Netherlands), EdgeConneX/EQT (Sweden), Eneus Energy (UK), and MineOne (China). May 2026 entity normalization layer: OPERATOR_ALIASES map applied in pipeline step_14 to canonicalize 70+ visible name variants (Cielo Digital Infrastructure = Cielo Digital, Stack Infrastructure = Stack, Microsoft Corporation = Microsoft, AWS = Amazon, etc.) so no operator appears split across multiple records in dashboard tabulations.
Source: Multiple (company filings, press releases, industry databases)Join key: Facility name + State + CountyFiles: 1 (ownership_overrides.csv) + OPERATOR_ALIASES dict in step_14_export_json.pyPriority: Overrides "Unknown" operators + normalizes name variants
Proposed Data Centers Database (Curated)
CuratedUpdated May 3, 2026
777 proposed, approved, and under-construction data center facilities in the United States. Includes power source identification (grid, solar, nuclear, natural gas), expected completion dates, MW capacity, operator, and community impact notes. Nuclear-powered facilities are flagged for the Policy Impact Lens toggle. Derived from FracTracker Alliance data enriched with DCSI web research. May 3 audit added 20 facilities (Project Big Pine TX $3.5B, Beale De Soto KS $3B, Stack Berry Hill VA $73B/30yr, NorthMark expansion 9x, Cayuga TeraWulf NY, Fort Meade FL approved, Atlas Compute FL, Iron Mountain MIA-1, NextNRG Nassau, Cielo Polk + 10 more).
Source: FracTracker Alliance + DCSI web research (May 2026)Join key: Facility name + StateFiles: 1 (proposed_data_centers.csv)Nuclear flagged: 3 facilities
PNNL Sqft Supplement
Curated
Fills missing square footage values for 57 PNNL facilities using public filings, satellite imagery measurements, and operator disclosures. Enables MW estimation where only building footprints were available.
Files: pnnl_sqft_supplement.csv (57 entries)Priority: Fills gaps in PNNL Atlas
PNNL Exclusion List
Curated
Flags 9 PNNL entries that are not commercial data centers (government HPC, university research computing, or misidentified facilities). Government/academic facilities are excluded from ROS and ERS scoring but retained in RBS.
Files: pnnl_exclusion_list.csv (9 entries)Impact: Classification-based scoring exclusions
State Regulatory Index (V7)
Curated
6-variable state-level transparency index for 42 DC-hosting jurisdictions. Variables: permit_transparency, environmental_review, energy_disclosure, water_disclosure, tax_incentive_accountability, ownership_disclosure. Scored 0–3 per variable with statute citations. Mean index 0.220, range 0.056–0.556. Pipeline integration pending inter-rater reliability pilot.
Source: State statutes, administrative codes, regulatory docketsFiles: state_regulatory_index.csv (42 entries)Documentation: ROS_SCORING_SUMMARY_TIER1.md (10 states), ROS_SCORING_SUMMARY_TIER2.md (32 states)
MW Cross-Validation Audit
Audit
18-facility cross-validation comparing our researched MW values against independent sources (utility filings, industry databases). Tracks match rates, discrepancies, and items requiring human review for data quality assurance.
Files: mw_cross_validation.csv (18 entries)Purpose: Data quality verification
V8 Unified Facility Dataset
V8 Primary
5,151 raw facilities with 43 columns (V9.3 dedup: 4,984 unique). Merges PNNL Atlas + FracTracker with Company Opacity Index (142 companies), Energy Efficiency Scores, sustainability traits (power_source, renewable_pct, cooling_method), and data quality tiers (T1/T2/T3). Enriched with company sustainability reports, utility filings, and industry databases. 14 facility misattributions and 288 county-state mismatches corrected.
Files: v8/facility_traits_merged.csv (5,151 × 43)Coverage: 670 counties, 142 companies scoredKey metrics: Mean COI 0.744, Mean EES 0.547
V9 Energy Estimation Sources
V9.1
Facility-level electricity consumption estimates using LBNL 2024 utilization factors (50% national average, adjusted by type: hyperscale 58%, colocation 50%, enterprise 43%). PUE sourced from operator reports: Google per-campus (16 US sites, TTM PUE 1.04–1.14), Meta fleet (1.08, plus 15 facilities with real 2024 MWh data), AWS per-region (3 US regions, PUE 1.12–1.15), Microsoft (1.16). EIA-861 county commercial demand used for validation.
Source: LBNL 2024, Google/Meta/AWS sustainability reports, EIA-861Files: 6 CSV (google_pue, aws_pue_wue, meta_facility, utilization_factors, water_benchmarks, eia861_county)
V9 Water Estimation Sources
V9.1
Facility-level water consumption estimates using WUE benchmarks by cooling type: evaporative towers 1.8 L/kWh (475 gal/MWh), hybrid 0.9 L/kWh, direct evaporative 0.2 L/kWh, air-cooled 0. Based on Siddik et al. 2021/2022 and Uptime Institute benchmarks. 10 Meta facilities use real 2024 reported water data instead of estimates.
Source: Siddik et al. 2021, Uptime Institute, Meta 2024 Sustainability ReportFiles: water_consumption_benchmarks.csv
Sovereign Capital & Foreign Ownership Audit
DCSI OriginalMay 2 2026
Independent web research audit identifying U.S. data center facilities with non-U.S. sovereign capital exposure that the current 297-flag operator-HQ methodology misses. Estimates 80–150 additional facilities not in current foreign count due to: (1) sovereign-fund-of-fund LP exposure (AustralianSuper / DataBank, etc.); (2) pending acquisitions (Aligned/AIP $40B closing H1 2026 = +50 sites; SoftBank/DigitalBridge $4B closing H2 2026 = +20 sites); (3) crypto-to-AI conversions outside colocation taxonomy (IREN Childress 750MW, Bitmain US factory, Genesis Digital Assets 600MW TX). Catalogued 12 sovereign LP cap-table profiles (Mubadala, MGX, KIA, Temasek, GIC, CPP, Ontario Teachers’, AustralianSuper, SoftBank, Mitsubishi Estate, DAMAC EDGNEX, Macquarie). Findings power the Foreign Ownership tab’s Sovereign Capital Network and Cap-Table Tile sections.
Source: Bloomberg, FT, CNBC, Data Center Dynamics, Reuters, OpenAI announcements, Aligned, GIC newsroom, CPP, BlackRock IR, AGBI, TechCrunch, US DOJMethod: Control-weighted (>25% foreign sovereign or sovereign-fund-of-fund LP triggers a flag)Files: Embedded in dcsi-prototype-v9.html FO_SOVEREIGN_DATA + audit memo (33 entries)
Cancelled / Withdrawn / Moratoriums Tracker
CuratedUpdated May 4 2026
119 cancelled, suspended, withdrawn, or denied data center projects in 85 counties + 62 moratoriums across 18 states (city / county / township / utility / state-level). Combines FracTracker Alliance records (89 facilities) with 41 additional projects identified through DCSI independent research, including major Florida cancellations (Project Jarvis St. Lucie, Silver Fox Indiantown, Okee-One Okeechobee), Maine LD 307 statewide moratorium defeat (Apr 29 2026), the Wisconsin Cassville township 44-0 referendum, and the Apr 2026 wave (Apple Valley MN Oppidan, Coweta OK Project Atlas, Archbald PA Project Scott, Tulsa OK Phase 2). Status field, community pushback flag, advocacy details, and source URLs included. Powers the Community Resistance tab + clickable cancelled-project popups.
Source: FracTracker Alliance + DCSI web research (multiple May 2026 audit cycles)Files: data/cancelled_data_centers.csv (41 web-research rows) + data/moratoriums.csv (62 rows)Coverage: 85 counties / 18 states
Good Jobs First Subsidy Tracker
FreeUpdated May 3, 2026
Live in this dashboard: 706 GJF records / $16.16B disclosed (407 with disclosed values, 299 undisclosed). Top recipients: Amazon ($9.0B), Apple ($1.5B), Meta ($1.5B), Google ($1.4B), Microsoft ($684M). Pending integration (5 megadeals identified in DCSI May 2026 audit, not yet aggregated into the live SUBSIDY_DATA): Nebius MO ($6.26B over 20 yrs — largest disclosed data-center subsidy in U.S. history per Good Jobs First), Crusoe MO ($1.6B), Stream/Apollo STAMP NY ($1.46B), Google Franklin Furnace OH (75% PILOT), JPMorgan Orangetown NY ($77M). Once aggregated, the dashboard total moves to ~711 records / ~$25.5B+. GJF added a data-center-industry filter to its public Subsidy Tracker UI in 2026 + released “Cloudy with a Loss of Spending Control” report covering 6,884 awards across 25 states / DC / federal.
Source: Good Jobs First / Subsidy Tracker + DCSI May 2026 audit (KCUR, First Alert 4, Investigative Post, Reinvent Albany, NY Focus)Join key: State + Parent CompanyRecords (live): 706 (407 disclosed; 299 undisclosed) · 5 audit additions pending integrationView GJF
Total: 22 sources · All free and public

About This Project

Intelligence-grade analysis. Built in public. Under $500. The analyst never abdicated to the machine.

The Data Center Stress Index answers a question that should be simple: when a data center moves into a county, what does that county actually get in return?

Every county in the United States that hosts a data center is scored across three dimensions: the resource burden it bears (water consumption, grid load, and land use), the transparency of its regulatory environment (permitting visibility and diesel generator reliance), and the economic return it receives (local jobs and wages per megawatt of installed capacity). These three scores combine into a single composite index that ranks counties on a letter-grade scale from A (low burden, high return) to F (high burden, low return).

How It Works

The index draws from 22 public data sources, all free, totaling more than 1 GB of raw data. The V9.3 facility dataset (4,984 facilities across 50+ variables, deduplicated) merges PNNL, FracTracker (April 2026), and Epoch AI (April 2026) records with company-level transparency, efficiency metrics, and estimated energy and water consumption. Water stress scores come from the World Resources Institute's Aqueduct 4.0 atlas, spatially joined to county boundaries using Census TIGER shapefiles. Energy burden uses LBNL methodology to estimate facility-level consumption (MW × utilization × PUE × 8,760 hours), supplemented with EIA-923 plant-level generation where local power plants exist. Land use intensity is calculated from PNNL facility footprints divided by county total land area (Census TIGER ALAND), with MW-derived estimates for counties missing observed acreage. Employment and wage data come from the Bureau of Labor Statistics Quarterly Census of Employment and Wages (NAICS 518210 and all-industry), and regulatory opacity is derived from a 6-variable State Regulatory Index covering 42 jurisdictions, modulated by a Company Opacity Index transparency credit. Foreign ownership attribution uses a structured field from the V8 facility dataset, covering 297 foreign-flagged facilities across 14 countries.

Each component is normalized to a 0-to-1 scale and combined using expert weights validated against three alternative weighting schemes (PCA-derived, entropy, and equal-weight). The Resource Burden Score uses 40% water, 35% energy, and 25% land; PCA validation confirms water and energy dominate variance. The Regulatory Opacity Score uses a 6-variable State Regulatory Index covering environmental review, resource disclosure, tax incentive accountability, permitting openness, utility rate transparency, and ownership disclosure across 42 jurisdictions, modulated by a community pushback modifier. The Economic Return Score combines jobs per MW and a wage premium ratio (data center pay vs. county all-industry average), with Monte Carlo error propagation producing 90% credible intervals and confidence tiers. The composite index blends these three scores (50% burden, 30% opacity, 20% inverted return) to produce a single ranking. V8 added a Company Opacity Index (12-indicator corporate transparency) and Energy Efficiency Score (PUE, grid dependency, cooling impact) at the facility level. V9 replaced the flat energy estimation with LBNL type-adjusted utilization and operator-reported PUE, added WUE-based water consumption modeling (Siddik et al. 2021), and introduced a Good Jobs First subsidy tracker covering $16.2 billion in disclosed incentives.

Advanced Techniques

Beyond the core index, the tool applies several analytical methods to surface patterns that a simple ranking would miss. Moran’s I spatial autocorrelation identifies geographic clusters of high-stress counties, revealing regional infrastructure strain rather than isolated incidents. Entropy-weighted scoring offers a data-driven alternative to fixed weights, giving more influence to components that vary most across counties. Granger causality testing (pipeline-side) examines whether data center facility announcements statistically predict future changes in local energy generation patterns using EIA-923 time-series data. Every county is also assigned a data quality tier (A/B/C) so users can see how much of the score relies on observed versus imputed inputs.

Policy Scenarios

The Policy Impact Lens lets users simulate the effect of 12 real legislative proposals on county stress scores. Each policy is decomposed into the specific formula parameters it would affect, with percentage adjustments derived from bill text and regulatory intent. The policies span the full spectrum from federal moratoriums that would halt all new construction to executive orders that accelerate permitting on federal land. This is not prediction; it is structured scenario analysis designed to inform advocacy, journalism, and legislative debate.

Community Resistance Tracking

V9.3 introduced a Community Resistance tracker covering 119 cancelled, suspended, withdrawn, or denied data center projects and 62 moratoriums across 18 states (now including Florida). The cancelled projects data combines FracTracker Alliance records (89 facilities with community pushback, advocacy, and project cost fields) with 41 additional projects identified through independent research, including major Florida cancellations (Project Jarvis St. Lucie, Silver Fox Indiantown, Okee-One Okeechobee) and the Apr 2026 wave (Apple Valley MN, Coweta OK, Archbald PA, Tulsa OK Phase 2). Moratorium records span city, county, township, and state-level actions from California and Colorado to North Carolina and Florida. The dashboard surfaces this data in a dedicated Community Resistance tab with filterable tables, by-state summaries, and integration into county-level modal popups. The tool has 8 top-level tabs (DCSI Home, National, State, Policy, Subsidies, Foreign Ownership, Community Resistance, About) with the About tab consolidating Methodology, Data Sources, and AI Accountability under sub-toggles.

What This Tool Does Not Do

The DCSI does not rank counties that have no data center facilities. It does not make causal claims about whether data centers caused resource degradation. It does not generate regulatory findings or legal conclusions. It informs judgment; it does not replace it. Every data source is public, every formula is disclosed, and every assumption is documented in the Methodology page.

About the Author

The Data Center Stress Index is a project by Anna R. Dudley. The entire tool was built using publicly available data for under $500. No proprietary datasets. No gated APIs. No corporate sponsorship. The purpose is to put the same analytical capability in the hands of local officials, community advocates, and journalists that the industry's own lobbyists already have.

For questions, speaking inquiries, or data requests, contact Anna R. Dudley. Power moves before policy does.

Community Resistance Tracker

Communities Are Fighting Back

Data center projects worth tens of billions of dollars have been cancelled, suspended, or withdrawn after community opposition. Moratoriums are spreading. This page tracks what the data shows: when communities organize, they win.

--
Projects Cancelled / Suspended
--
With Community Pushback
--
Counties Affected
--
Moratoriums Enacted

Cancelled & Suspended Projects

Projects tracked from FracTracker Alliance and independent research. Filter by state or status.

--
Project County State Status Operator Cost Pushback

Data Center Moratoriums

Jurisdictions that have enacted, proposed, or considered restrictions on new data center development.

Jurisdiction State Type Status Date Duration

Active Litigation

Community lawsuits and environmental challenges currently in court. Each represents a procedural avenue communities are using when zoning and political channels close.

Filed Apr 29 2026 · MN
Stop the Hermantown Data Center v. City of Hermantown
Defendant: Google (project) · St. Louis County, MN
9 counts: closed-meeting violations, data-practices violations, arbitrary decision-making, inadequate environmental review of "environmentally sensitive area surrounded by rural residential homes." Phase 1 = $650M; $33.5M tax-abatement sought.
"The more people we have speak up to this, the more momentum we’re building." — Emma Richtman, plaintiff spokesperson
Source: Fox 21 News
Filed Apr 2026 · MD
Amazon Bauxite I Air Permit Challenge
Defendant: Amazon · Frederick County, MD (Adamstown)
Judicial review of MDE air-quality permit for 99 diesel generators. Allegations: MDE failed to consider health impacts from diesel particulate matter; site is ~1 mile from Carroll Manor Elementary; no HEPA filtration or real-time monitoring required.
~100 residents attended the December 2025 permit hearing.
Source: Strisker daily notes
Escalated May 1 2026 · VA
Digital Gateway / Prince William County Case
Appellant: QTS · Prince William County, VA
QTS appealed to the Virginia Supreme Court on May 1, 2026 after the Court of Appeals voided the Digital Gateway rezoning approval in March. Compass Datacenters formally exited Apr 29, 2026, withdrawing its portion of the project.
County spent $1.72M defending the original appeal.
Source: Washington Post
Filed Apr 29 2026 · IL
Yorkville Residents v. City of Yorkville & Pioneer Development
Defendant: Pioneer Development (Project Cardinal) · Kendall County, IL
Residents challenge annexation and rezoning of a 1,034-acre hyperscale campus on prime farmland. Counts include open-meetings violations, inadequate environmental review, and failure to assess cumulative groundwater draw on the deep Cambrian-Ordovician sandstone aquifer system that supplies most of northeastern Illinois (Kendall County draws from this system; the Mahomet aquifer is a separate sand-and-gravel formation 130 mi south in Champaign County).
Project Cardinal would be one of the largest single-campus footprints in the Midwest.
Source: Strisker daily notes
Filed Apr 14 2026 · N.D. Miss.
NAACP v. xAI (Colossus 2 power plant)
Defendant: xAI · DeSoto County, MS (Southaven gas-turbine plant powering Colossus 2 in Memphis)
NAACP National and the Mississippi State Conference of the NAACP (represented by SELC + Earthjustice) allege xAI built and operates a 114-acre gas plant with 27 unpermitted turbines (~495 MW) in Southaven without required Clean Air Act permits, powering the Colossus 2 supercomputer across the state line in Memphis. Filed April 14, 2026 in U.S. District Court for the Northern District of Mississippi. (The Mississippi Permit Board has subsequently granted xAI's request for up to 41 Southaven turbines post-filing.)
Major federal civil-rights / Clean Air Act challenge to an AI training campus’ behind-the-fence generation. A separate June 2025 TN federal Clean Air Act case + Shelby County Health Department permit appeal target the Memphis side.
Source: Earthjustice · NAACP press release

Resistance by State

Sources: FracTracker Alliance Data Centers Database (April 2026), independent web research, Good Jobs First moratorium tracker, DataCenterBans.com. This tracker is maintained as a public resource. For corrections or additions, contact Anna R. Dudley.

AI Accountability
Dedication

This page is dedicated to my dear friend and mentor, J.R. Four years ago, sitting in an old basement, you taught me how to set up my first virtual machine and how to install packages. In this new age of AI, you continue to teach the entire floor about the importance of never abdicating to the machine. Let AI work for you. Don’t let it think for you.

Errors Caught by the Analyst

Every score, label, formula and visualization in this dashboard was reviewed by a human before it shipped. The errors logged below were caught during that review — some by Anna, some by collaborators, none by automated tests. They are kept in public for the same reason climate scientists publish their model uncertainty: the only way to trust a number is to know how it was made.

If you find a number on this dashboard that looks wrong, that is the experience this page is supposed to make possible. Tell us, and you will be added to the log.

--
Total Errors Logged
--
Critical Severity
--
High Severity
--
Fixed
--
Open / Partial
What Counts as an Error

An “error” in this log is anything an AI assistant produced that would have shipped wrong if a human hadn’t caught it. That includes:

Scoring bugs
Wrong formulas, double-counted variables, normalization that destroyed the distribution, weights that didn’t sum to 1.
Data joins
FIPS code mismatches, county name collisions, units that weren’t converted, suppressed BLS values silently treated as zero.
Hallucinated facts
Plausible-sounding citations that didn’t exist, fabricated wage figures, statute numbers that weren’t real.
Visualization lies
Color scales that exaggerated, axes that were truncated without disclosure, sample counties presented as the full dataset.
Statistical overreach
CUSUM run on seasonal data, Granger causality reported without checking sample size, p-values without corrections.
Silent failures
Rows dropped from joins without warning, encoding errors that produced empty fields, defaults that masked missing data.
Errors by Severity
Errors by Category
Error Discovery Timeline
How These Errors Are Caught
  1. Read every output. Every script that produces a number is run, the output is opened, and at least one row is checked against the source data by hand. If the number can’t be reproduced from the source, it doesn’t ship.
  2. Spot-check known counties. Loudoun, Maricopa, Dallas, Santa Clara, Quincy — counties with publicly reported MW or water draw — are used as anchors. If the dashboard says Loudoun has 50 MW, something has gone wrong.
  3. Distribution sanity checks. Histograms of every score before and after normalization. If the distribution is flat, bimodal, or all values pile at 0/1, that’s a normalization bug.
  4. Cite-or-cut. Any factual claim in the methodology, error log, or county profile must trace back to a public source. Anything that can’t cite gets cut.
  5. Public log. Errors are written down here, in public, before they are fixed. The log is the receipt.
Why Public Error Logs Matter

Most data dashboards are presented as finished objects. The methodology page tells you the formula, the source, the date — and asks you to trust that the implementation matched. AI-assisted analysis breaks that assumption. Code that looks right can be wrong in ways that are completely invisible until you check the answer against reality.

Listing what went wrong — including the things that were embarrassing or that took weeks to find — is the only honest way to show the work. If a county’s grade changes between versions, the reason should be readable. If a formula was wrong for three weeks, the fact that it was wrong should be on the same page as the corrected number. That’s the contract.

Featured Batch · May 4 2026 Pre-Briefing Audit (ERR-062 → ERR-072)

11 errors caught the night before a governor briefing. Two parallel background-agent audits ran simultaneously: one reconciling every dashboard number against its source-of-truth file, the other fact-checking high-visibility external claims (lawsuit jurisdictions, capex figures, MW totals, dates) against authoritative sources.

The most exposed find was ERR-062: AI placed a high-profile NAACP v. xAI Clean Air Act lawsuit in Tennessee when the actual Apr 14 2026 federal filing is in the Northern District of Mississippi, targeting a 27-turbine gas plant in Southaven that powers the Colossus 2 supercomputer across the state line in Memphis. AI also fabricated a plaintiff (“Memphis Community Against Pollution” is not party to that filing). Both errors would have been visible to anyone in the room with knowledge of the case.

Other items in the batch: a ReferenceError that logged on every page load (ERR-063); a Resistance hero stat showing 46 moratoriums when the canonical count is 62 (ERR-064); five places that said “17 states” when the source CSV has 18 (ERR-065); a Subsidies card that quoted a forward projection as live data (ERR-066); the Manassas case appellant misidentified as “Multiple developers” when QTS is the named appellant (ERR-067); and four lower-severity stale-number / framing items (ERR-068 through ERR-072). All 11 fixed and re-deployed before the briefing. The audits cost three minutes of agent time and would have been weeks of editorial damage if they’d caught at the podium.

The Full Log

Each entry includes the error ID, severity, status, the description of what was wrong, and how it was fixed. Sorted newest first. Filterable by status above.

Severity Key Critical High Medium Low Color shown on each entry’s left rule