Factlen ExplainerAI TransparencyExplainerJun 29, 2026, 1:05 AM· 8 min read

Stanford AI Index: Frontier Model Transparency Score Plummets 30% as Top Labs Disclose Less

The 2026 Stanford AI Index Report reveals a sharp decline in industry transparency, with major developers increasingly hiding the training data and compute power behind their most capable models.

By Factlen Editorial Team

Share this story

Independent AI Researchers 40%Enterprise AI Adopters 35%Industry Analysts 25%

Independent AI Researchers: Advocates for rigorous auditing and public accountability in AI development.
Enterprise AI Adopters: Focuses on the operational and legal risks of deploying opaque systems.
Industry Analysts: Analyzes the competitive pressures and financial incentives driving corporate secrecy.

What's not represented

· Open-source developers who rely on published research to build community-driven alternatives.
· Artists and content creators whose copyrighted work may be hidden within undisclosed training datasets.

Why this matters

As artificial intelligence is rapidly integrated into healthcare, law, and enterprise workflows, the systems making these critical decisions are becoming impenetrable black boxes. Without transparency into how these models are trained and what data they consume, society is forced to blindly trust corporate safety claims, making independent auditing for bias, copyright, and security flaws virtually impossible.

Key points

The Foundation Model Transparency Index average score dropped from 58 in 2024 to 40 in 2025.
Major AI developers like Meta, Mistral, and OpenAI saw significant declines in their transparency scores.
Of the 95 notable AI models released over the past year, 80 were deployed without any accompanying training code.
IBM remains the sole outlier, achieving an unprecedented transparency score of 95 out of 100.
The shift toward secrecy is driven by a hyper-competitive capability race where margins between top models are razor-thin.
Independent researchers warn that this opacity makes third-party auditing for safety and bias virtually impossible.

40/100

2025 Average Transparency Score

95/100

IBM's Industry-Leading Score

14/100

Lowest Score (xAI & Midjourney)

Top Models Released Without Training Code

The artificial intelligence industry is currently navigating a profound and consequential paradox: as foundation models become exponentially more capable and deeply integrated into daily life, the internal mechanisms powering them are vanishing behind corporate walls. For the better part of a decade, the rapid advancement of machine learning was fueled by a radically open academic ethos. Researchers at top institutions and corporate labs freely shared their datasets, published their model architectures, and open-sourced their training methodologies, allowing the entire global community to iterate on each breakthrough. Today, that era of radical transparency is officially over. The industry has aggressively pivoted toward secrecy, prioritizing proprietary commercial advantage and security over public disclosure, fundamentally altering how artificial intelligence is developed and deployed.[6]

The starkest empirical evidence of this shift comes from the newly released 2026 Stanford AI Index Report, an exhaustive annual audit of the industry widely considered the most authoritative barometer of artificial intelligence trends. Embedded within its findings is the latest Foundation Model Transparency Index (FMTI), a comprehensive metric developed by a consortium of researchers from Stanford, MIT, and Princeton. The index evaluates major AI developers across 100 distinct indicators, measuring how openly they disclose details about their training data, compute power, risk mitigation strategies, and post-deployment usage. The results reveal a precipitous decline: in a single year, the industry's average transparency score plummeted from 58 out of 100 down to just 40. This 30 percent contraction is not a statistical anomaly; it represents a deliberate, structural shift by the world's most prominent AI laboratories.[1][2][3]

The retreat from openness is both broad and steep, affecting nearly every major player in the generative AI space. The Stanford researchers found that the most capable models on the market are now reliably the least transparent. Meta, which has historically championed open-science approaches and open-weight model releases, saw its transparency score drop dramatically from 60 to 31. Mistral, the European startup that built its reputation as the darling of the open-source AI community, plummeted from a score of 55 down to 18. OpenAI, the creator of ChatGPT, saw its score fall by 14 points as it continued to withhold the architectural details of its latest frontier models. The data confirms that as models become more commercially valuable and strategically sensitive, the laboratories building them are actively choosing to reveal less about how they actually function.[2][3][5]

The industry's average transparency score plummeted by 30 percent in a single year.

At the very bottom of the transparency index, the opacity is nearly total. Companies like xAI and the image-generation lab Midjourney tied for the lowest score in the assessment, each earning a mere 14 out of 100. Even newly assessed Chinese developers like DeepSeek and Alibaba, despite releasing highly capable open-weight models that rival American systems, scored poorly on transparency at 32 and 26, respectively. The sole, glaring outlier in this industry-wide race to the bottom is IBM. The legacy technology giant achieved an unprecedented score of 95 out of 100, setting a lonely precedent in an increasingly opaque field. IBM remains the only major player providing sufficient granular detail for external researchers to fully replicate its training processes, proving that high transparency is still technically feasible, even if it is commercially unpopular.[2][7]

Understanding the transparency deficit requires looking at exactly what information is being withheld. The opacity centers on the most critical, foundational ingredients of artificial intelligence development: the provenance of training data, the sheer volume of compute power utilized, the exact sizes of the datasets, and the total parameter counts of the neural networks. According to the Stanford report, of the 95 notable AI models released over the past year, 80 were deployed without any accompanying training code. The "recipe" for frontier intelligence is no longer treated as a shared scientific discovery; it is now guarded as a highly sensitive trade secret. While companies are still willing to publish benchmark scores showcasing their models' capabilities, the underlying data that dictates how those models reason, what biases they might harbor, and whose copyrighted material they may have ingested remains entirely hidden from public view.[4][5][7]

IBM remains the sole outlier in an industry-wide race toward opacity.

Understanding the transparency deficit requires looking at exactly what information is being withheld.

This pervasive secrecy is a direct byproduct of the hyper-competitive capability race currently consuming the technology sector. The performance gap between top-tier models has narrowed to razor-thin margins. As of early 2026, Anthropic's leading model edges out its closest global competitor by a mere 2.7 percent on standard evaluations. When billions of dollars in enterprise contracts, consumer subscriptions, and venture capital funding hinge on single-digit performance advantages, companies view the disclosure of their training methodologies as a catastrophic competitive risk. Revealing the specific data mixtures or architectural tweaks that yield a smarter model essentially hands a roadmap to well-funded rivals. Consequently, the financial incentives heavily favor silence, overriding the academic pressure to publish and share that defined the industry just a few years ago.[3][8]

However, this strategic opacity creates a massive, systemic vacuum for enterprise buyers, independent researchers, and policymakers. Organizations across the globe are rapidly integrating these models into critical, high-stakes workflows—from legal contract analysis and financial forecasting to medical diagnostics and autonomous customer service. They are doing so while flying largely blind, purchasing access to black-box systems without knowing what data the models ingested or what hidden vulnerabilities they might contain. Independent researchers warn that the line between "we cannot explain how the model works" and "we will not explain how it works" is becoming dangerously blurred. Without access to training data or parameter counts, third-party auditing for safety, copyright infringement, or fairness becomes virtually impossible, forcing society to rely entirely on the self-reported safety metrics of the corporations selling the technology.[4][5][8]

The authors of the Stanford AI Index characterize this dynamic not merely as a shift in corporate strategy, but as a fundamental governance infrastructure failure. In the early days of AI development, shared capability benchmarks became industry standards because the field exerted immense peer pressure on researchers to report them. However, no similar social or economic mechanism has materialized to force companies to maintain transparency. As artificial intelligence transitions from a niche research discipline into the foundational infrastructure of the global economy, the transparency deficit poses a critical challenge. It highlights a widening gap between the raw power of the technology and the ability of independent institutions to evaluate, govern, and understand it, setting the stage for inevitable clashes between AI developers and the regulators tasked with keeping them accountable.[1][5][6]

Beyond the mechanics of model training, the transparency deficit extends into the physical world, specifically regarding the environmental and societal impacts of artificial intelligence. The Stanford researchers noted that the thirteen major companies assessed in the index share little to no concrete information about the energy consumption, carbon footprint, or water usage required to build and deploy their flagship models. Training a frontier model in 2026 requires tens of thousands of specialized GPUs running continuously for months, drawing massive amounts of electricity from local grids. Yet, the exact environmental toll is routinely omitted from technical reports. This lack of disclosure prevents policymakers and climate scientists from accurately measuring the technology's true cost, leaving the public to guess at the physical resources required to sustain the cloud infrastructure powering their digital assistants.[1][2][5]

The 'recipe' for frontier intelligence is now treated as a closely guarded trade secret.

The collapse in voluntary transparency is actively reshaping the global regulatory landscape. Because the industry has demonstrated that it will not self-police disclosure when billions of dollars are at stake, governments are increasingly stepping in to mandate it. The European Union’s AI Act and various state-level frameworks in the United States are beginning to demand the exact types of documentation that AI labs have recently stopped publishing. Policymakers are realizing that they cannot regulate what they cannot measure. The Foundation Model Transparency Index serves as a crucial beacon in this regulatory push, identifying the specific areas—like training data provenance and post-deployment monitoring—that are most resistant to improvement without legal intervention. The era of trusting AI developers to grade their own homework is rapidly closing.[2][4][6]

The transparency crisis also exposes the growing semantic confusion around what it means for an AI model to be "open." Many companies generate positive public relations by releasing "open-weight" models, allowing developers to download and run the systems locally. However, the Stanford index reveals that providing model weights is not synonymous with true transparency. A company can release a highly capable open-weight model while simultaneously refusing to disclose the data it was trained on, the hardware used to build it, or the specific alignment techniques applied to make it safe. This creates a facade of openness; developers can use the tool, but they remain entirely ignorant of its internal construction. True open-source AI, which includes the training code and the datasets, is becoming exceedingly rare at the frontier of capability.[2][6][7]

The exact environmental toll of training frontier models is routinely omitted from technical reports.

Ultimately, the findings of the 2026 Stanford AI Index Report serve as a stark warning about the trajectory of the artificial intelligence industry. The technology is scaling faster than the institutions built to govern it, and the deliberate reduction in transparency only accelerates that divergence. If the current trend holds, the most consequential technological systems of the 21st century will operate as impenetrable black boxes, understood only by a shrinking circle of corporate engineers. Reversing this trend will require more than just academic pressure; it will demand a concerted effort from enterprise buyers demanding visibility, regulators enforcing disclosure, and independent evaluators continuing to shine a light on the industry's practices. Until then, the transparency deficit remains the defining vulnerability of the AI revolution.[1][5][8]

How we got here

October 2023
Stanford researchers launch the first Foundation Model Transparency Index, establishing a baseline for industry disclosure.
May 2024
The second edition of the index shows a brief improvement, with the average transparency score rising to 58 out of 100.
December 2025
The 2025 index reveals a sharp reversal, with the average score plummeting to 40 as top labs withhold training details.
April 2026
The 2026 Stanford AI Index Report officially characterizes the transparency deficit as a critical governance infrastructure failure.

Viewpoints in depth

Independent AI Researchers

Advocates for rigorous auditing and public accountability in AI development.

Academic institutions and independent evaluators argue that the lack of transparency is a critical vulnerability for society. They contend that without access to training data, parameter counts, and compute metrics, it is impossible to independently verify the safety claims made by AI developers. This camp views the shift toward secrecy not as a necessary business practice, but as a governance failure that prevents third-party auditing for bias, copyright infringement, and systemic risks.

Enterprise AI Adopters

Focuses on the operational and legal risks of deploying opaque systems.

For organizations integrating AI into critical workflows—such as healthcare, law, and finance—the transparency deficit represents a massive liability. Enterprise leaders and governance professionals argue that purchasing access to 'black-box' models exposes them to unknown legal and operational risks. They demand greater visibility into what data these models ingested and how they make decisions, warning that deploying systems they do not fully understand shifts the relationship from informed adoption to a blind alliance.

Industry Analysts

Analyzes the competitive pressures and financial incentives driving corporate secrecy.

Market analysts and industry observers point out that the retreat from transparency is a rational, if concerning, response to the hyper-competitive AI landscape. With the performance gap between top-tier models narrowing to razor-thin margins, revealing the 'recipe' for a frontier model is seen as a catastrophic competitive risk. This camp highlights that when billions of dollars in enterprise contracts and venture capital are at stake, the financial incentives heavily favor silence over the academic pressure to publish.

What we don't know

It remains unclear whether impending regulations like the EU AI Act will successfully force companies to reverse their stance on secrecy.
The exact environmental impact—including water and energy usage—of training the latest frontier models is still largely hidden from the public.
We do not know the full extent of copyrighted material that has been ingested into the training datasets of the most capable closed models.

Key terms

Foundation Model: A large-scale artificial intelligence system trained on vast amounts of data that can be adapted to a wide range of downstream tasks.
Parameter Count: The number of internal variables a neural network uses to make decisions; a key indicator of a model's size and complexity.
Open-Weight Model: An AI model where the final, trained architecture is available for download, but the underlying training data and code are kept secret.
Compute: The total amount of processing power and hardware resources required to train an artificial intelligence system.

Frequently asked

Why are AI companies becoming less transparent?

As the performance gap between top models narrows, companies view the disclosure of their training data and architectures as a competitive risk that could help rivals copy their work.

Which company scored the highest for transparency?

IBM scored an industry-leading 95 out of 100, making it the only major player to provide enough detail for external researchers to replicate its training processes.

Are open-weight models the same as open-source?

No. While open-weight models allow developers to download and use the system, they often do not include the training data or code required to truly understand how the model was built.

What information are AI labs hiding?

Major developers are increasingly withholding details about their training data sources, the amount of compute power used, the size of their datasets, and the environmental impact of their models.

Sources

[1]Stanford University HAIIndependent AI Researchers
The 2026 AI Index Report
Read on Stanford University HAI →
[2]Stanford CRFMIndependent AI Researchers
The 2025 Foundation Model Transparency Index
Read on Stanford CRFM →
[3]The State of AIIndustry Analysts
More Power, Less Transparency
Read on The State of AI →
[4]ComplexDiscoveryEnterprise AI Adopters
AI Now Scales Faster Than the Institutions Built to Govern It
Read on ComplexDiscovery →
[5]TechLetterIndustry Analysts
Stanford AI Index 2026: Capability Is Outrunning Every System Around It
Read on TechLetter →
[6]Factlen Editorial TeamIndependent AI Researchers
Synthesis by Factlen editorial team
Read on Factlen Editorial Team →
[7]HaystackIDEnterprise AI Adopters
Declining Transparency and Governance Challenges
Read on HaystackID →
[8]AIFODEnterprise AI Adopters
The Transparency Crisis in AI
Read on AIFOD →

Stay informed

Every angle. Every day.

Get ai stories with full source coverage and perspective breakdowns delivered to your inbox.

Get the briefing →Browse ai