OPEN SIGNAL
Briefings ·

Signal Briefing: May 24, 2026

Owned-capacity buildout is reshaping hyperscaler cost structures as the industry crosses from lease-heavy to campus-heavy infrastructure posture.

Hyperscalers Are Shifting From Lease-Heavy to Owned-Campus Infrastructure

The multi-year trajectory of hyperscaler capital allocation has moved from co-location and leased shell agreements toward ground-up owned campuses. This transition is visible in the capex composition disclosures across the Big Four: Alphabet, Microsoft, Meta, and Amazon have each reported step-change increases in gross property, plant, and equipment relative to operating lease liabilities over FY24–FY25. Owning the land, building shell, and power interconnect delivers lower 10-year total cost of occupancy — at the cost of longer lead times and upfront capital intensity.

Why this matters. The shift to owned capacity locks in geographic commitments years in advance, making hyperscaler infrastructure choices more path-dependent than in the co-lo era. Regions that secured fiber, water, and substation access early — northern Virginia, central Iowa, the Pacific Northwest — benefit from incumbency effects, while secondary markets must offer aggressive incentives to compete for the next wave.

Confidence: high — multi-year trend visible in 10-K property and lease disclosures across all four major hyperscalers.


HBM Supply Concentration Is the Binding Constraint on H100/H200-Class Deployment Timelines

High-bandwidth memory production for AI-class GPUs is concentrated among three vendors — SK Hynix, Micron, and Samsung — with SK Hynix holding the largest share of HBM3 and HBM3e capacity allocated to NVIDIA, per industry analyst coverage through 2024. Yield challenges at leading-edge DRAM node transitions mean allocation pressure cascades directly into GPU delivery schedules; a data center operator cannot swap in a substitute memory stack.

Why this matters. Because HBM is a co-packaged component, GPU supply is effectively HBM-supply-limited at peak demand. This creates a structural bottleneck that NVIDIA’s fabless model cannot resolve unilaterally — it depends on TSMC CoWoS advanced packaging capacity and DRAM vendor ramp rates simultaneously. Operators planning large-scale H-series deployments should model HBM allocation risk as a primary supply chain variable, not a secondary one.

Confidence: high — supply concentration figures and CoWoS dependency are well-documented in NVIDIA FY25 10-K supply chain disclosures and semiconductor analyst coverage through mid-2025.


Inference Token Pricing Has Compressed Into a Structurally Deflationary Regime

Spot pricing for inference on frontier-class models declined sharply across 2024 and into 2025 as both first-party (OpenAI, Anthropic, Google) and third-party inference providers (Together AI, Fireworks, Groq) expanded capacity. Public API pricing for GPT-4-class output tokens declined by roughly an order of magnitude between early 2023 and late 2024, per publicly posted rate cards. The deflationary vector is structural: model efficiency improvements (quantization, speculative decoding, MoE architectures) are compounding with hardware scale economies.

Why this matters. Falling inference prices are a demand accelerant — they expand the universe of economically viable AI-powered products — but they compress gross margins for inference-as-a-service providers who built cost structures around higher ASPs. The squeeze creates pressure to differentiate on latency, reliability, and fine-tuning rather than raw token throughput, and it accelerates the shift toward proprietary model weights as a moat.

Confidence: high — based on publicly posted API pricing pages and rate-card archives across major providers, 2023–2025.


Data Center Power Demand Is Outpacing Utility Grid Investment Cycles in Key U.S. Markets

The IEA’s 2024 electricity report estimated that data centers globally consumed approximately 200–250 TWh annually, with AI workload growth representing the fastest-expanding segment. In the U.S., utility interconnection queues — particularly in PJM and MISO territories — have extended to multi-year wait times, driven by the combination of renewable generation additions and large load interconnection requests from hyperscalers. The mismatch between the 18-to-36-month hyperscaler deployment cycle and the 5-to-10-year transmission build cycle is not resolvable in the near term.

Why this matters. Power availability — not land, permits, or fiber — is increasingly the first-order constraint on U.S. data center siting. This is pushing operators toward markets with surplus generation (natural gas-rich regions, stranded nuclear capacity), toward behind-the-meter generation agreements, and toward co-location with existing industrial power infrastructure. Utilities that can offer expedited large-load service agreements hold meaningful pricing power through the late 2020s.

Confidence: high — IEA 2024 Electricity Report figures; PJM interconnection queue data is public record.


Open-Weight Model Ecosystem Is Fragmenting the Enterprise Model Procurement Market

The release cadence of competitive open-weight models — Llama 3 family, Mistral variants, and a range of fine-tunable derivatives — has established a structural alternative to proprietary API procurement for enterprises with inference infrastructure. Crucially, open-weight deployment economics favor organizations with owned or reserved GPU capacity: the marginal cost of inference is power and hardware depreciation, not per-token API fees. This creates a bifurcated market where API-first pricing applies to SMB and developer segments while large enterprises increasingly evaluate build-vs-buy at the model layer.

Why this matters. Model commoditization at the open-weight tier increases switching costs for inference infrastructure itself (hardware, orchestration, serving stacks) while reducing them at the model layer. The competitive moat migrates up the stack toward proprietary fine-tuning data, RLHF pipelines, and evaluation infrastructure. For AI infrastructure investors, this dynamic favors picks-and-shovels plays — GPU clusters, networking, storage — over bets on proprietary model API revenue at the commodity end of the capability range.

Confidence: medium — structural analysis based on observed 2024 open-weight release patterns and public enterprise procurement discussions; specific adoption rates in 2025–2026 are extrapolated.

Get the signal in your inbox

Free. Sourced. AI-written. The AI buildout, daily.

No spam. Unsubscribe anytime.