Skip to content
FonteumThe Graph

The capability layer

APIREST + bulk accessMCP serverCallable by AI agentsFHIR R4 APIBulk exportAttestation & audit packReconciliationSource-vs-source diffsEntity graphSnapshotsPoint-in-time, bitemporal

By use case

Exclusion & sanctions screeningCredentialing & provider-data enrichmentAudit evidence & defensible programsProvider data for AI / RAGM&A & network diligence

By buyer

Compliance & riskDevelopers & AI teams

The differentiator

Coverage & sourcesThe catalogFreshnessMethodologyCare CompareFacility qualityBrowse all datasets →
Research

The dev on-ramp

DocsAPI referenceMCPQuickstartStatusChangelogSDKs & integrations
Pricing
Sign inTry the FHIR sandbox →Request access →

Platform

APIMCP serverFHIR R4 APIBulk exportAttestation & audit packReconciliationEntity graphSnapshots

Solutions

Exclusion & sanctions screeningCredentialing & provider-data enrichmentAudit evidence & defensible programsProvider data for AI / RAGM&A & network diligenceCompliance & riskDevelopers & AI teams

Data

Coverage & sourcesFreshnessMethodologyCare CompareBrowse all datasets →
Research

Developers

DocsAPI referenceMCPQuickstartStatusChangelogSDKs & integrations
Pricing
Sign inTry the FHIR sandbox →Request access →
  1. All studies
  2. /Who runs the clinical trials: academia, not pharma, 2026
CARE QUALITY · ISSUE 085
clinicaltrials-govOriginal Research

Who runs the clinical trials: academia, not pharma, 2026

Of the 589,453 studies registered on ClinicalTrials.gov, 71.4% are run by academic and hospital sponsors and only 22.1% by industry — the registry of medical research is led by universities, not pharma. Yet industry runs 41.2% of phased drug-development trials, and 53.8% of all sponsors registered just one study.

BY FONTEUM RESEARCH BUREAU · JUNE 17, 2026 · 9 MIN READ · ASSERTED VIA SLSA L2REVIEWED BY DR. JENNIFER MONTECILLO, MDSNAPSHOT 2026-06-14 · DOI 10.5072/fonteum/clinical-trial-sponsor-concentration-2026 · LAST UPDATED JUNE 17, 2026
ClinicalTrials.gov · 2026-06-14
Reviewed by Dr. Jennifer Montecillo, MD, non-practicing medical reviewer. Gullas College of Medicine, 2019. Non-practicing medical reviewer focused on source interpretation, terminology, and limitations language. About our reviewers →
Reproduce this study →
Who sponsors clinical trials, by lead-sponsor class, 2026clinicaltrials-gov · 2026-06-14
Academic / hospital (Other)
71.4
Industry
22.1
Other government
2.6
NIH
2
Trial network
0.8
U.S. federal
0.8
Built on ClinicalTrials.gov · snapshot 2026-06-14 · reproducible · re-derive the figures yourself
Key findings
71.4%
of the 589,453 studies on ClinicalTrials.gov are run by academic, hospital, and foundation sponsors — the registry's 'Other' class — against 22.1% led by industry and 5.4% by government (NIH, other U.S. federal, and other government combined). The public ledger of medical research is led by universities, not pharma
clinicaltrials-gov · CMS
41.2%
of phased interventional trials — the regulated drug-development pipeline, Early Phase 1 through Phase 4 — are run by industry, nearly double its 22.1% share of all studies. The academic share falls from 71.4% to 51.7%. Industry concentrates where the regulatory stakes are highest
clinicaltrials-gov · CMS
26.2%
of the registry is run by just 100 sponsors — 0.2% of the 50,756 distinct lead sponsors. The top 1,000 sponsors run 62.7%; the top 10 alone run 6.0%. The registry is steeply concentrated at the head
clinicaltrials-gov · CMS
53.8%
of all 50,756 lead sponsors registered exactly one study, and 82.4% registered five or fewer — the median sponsor is a one-time registrant. Of the 100 most prolific sponsors, 79 are academic or hospital institutions, running 73.3% of all top-100 studies
clinicaltrials-gov · CMS
589,453
registered studies across 50,756 distinct lead sponsors make up the snapshot, dated 2026-06-14 and pulled from the ClinicalTrials.gov Data API v2; 76.3% interventional, 23.3% observational. Every figure is a count over published records — no investigator, sponsor, or trial is named, ranked, or scored
clinicaltrials-gov · CMS
On this page
Academia runs the registry, not pharmaThe inversion: who runs the drug-development pipelineConcentrated at the top, fragmented at the bottomThe most prolific sponsors are universitiesWhat one row actually isMethodologyLimitationsSources

ClinicalTrials.gov is the public ledger of medical research: every interventional and observational study a sponsor registers, with its phase, its conditions, its status — and the institution that runs it. Read for what is starting or stopping, the registry tells one story. Read for who is doing the registering and it tells another, one that cuts against the popular picture of clinical research as something pharmaceutical companies do. The sponsors on the registry are mostly not companies at all, and the few institutions that dominate the top of the list are mostly not pharma.

Academia runs the registry, not pharma

Of the 589,453 studies registered on ClinicalTrials.gov, 420,647 — 71.4% — are run by sponsors in the registry's "Other" class: universities, hospitals, and foundations. Industry runs 130,207, or 22.1%. Government sponsors of every kind combined run 5.4%.

Lead sponsor classStudiesShare of registry
Academic, hospital, foundation (Other)420,64771.4%
Industry130,20722.1%
Other government15,5642.6%
NIH11,5362.0%
Trial network4,9660.8%
U.S. federal (non-NIH)4,8900.8%

Source: ClinicalTrials.gov, registered studies by lead-sponsor class, snapshot 2026-06-14. The remaining ~1,600 studies are filed by individual sponsors or carry no coded class.

The registry codes each study's lead sponsor into a class, and the "Other" bucket — academic medical centers, universities, hospital systems, and research foundations — is more than three times the size of industry. The three government classes (the NIH, other U.S. federal agencies, and other governments) together account for 31,990 studies, 5.4%. By the count of who registers studies, clinical research is an academic and hospital activity, and has been throughout the registry's history.

A sponsor count is a record of who registers research, not a measure of its quality, size, or importance. A university running many small studies and a company running a few large trials are different shapes of the same enterprise — the count describes the shape, nothing more.

The inversion: who runs the drug-development pipeline

Narrow the count to phased interventional trials — the regulated pipeline that tests a drug or biologic in humans — and the picture flips toward industry. Of the 220,186 phased trials, industry runs 41.2%, nearly double its 22.1% share of all studies; the academic share falls from 71.4% to 51.7%.

Lead sponsor classPhased trialsShare of phased
Academic, hospital, foundation (Other)113,92651.7%
Industry90,72341.2%
NIH7,0493.2%
Other government3,9831.8%
Trial network2,7221.2%
U.S. federal (non-NIH)1,4120.6%

Source: ClinicalTrials.gov, phased interventional trials (Early Phase 1 through Phase 4) by lead-sponsor class, snapshot 2026-06-14.

This is the resolution of the apparent paradox. Academic sponsors run the most studies, but a large share of that volume is observational research and non-phased interventional work — behavioral, device, surgical, and procedural studies that never carry a drug-development phase label. Industry's footprint is the opposite shape: comparatively few studies overall, but heavily weighted toward the phased trials that move a candidate toward approval. Where the cost, the regulatory exposure, and the commercial stakes are highest, industry's share roughly doubles. "Pharma runs clinical trials" is true of the drug-development pipeline and false of the registry as a whole — and the two facts coexist in the same table.

Concentrated at the top, fragmented at the bottom

The registry is steeply concentrated among its most prolific sponsors. The top 100 sponsors — 0.2% of the 50,756 distinct lead sponsors — run 26.2% of all registered studies. The top 1,000 run 62.7%.

Sponsor cohortStudiesShare of registry
Top 10 sponsors35,5236.0%
Top 50 sponsors104,56917.7%
Top 100 sponsors154,54226.2%
Top 1,000 sponsors369,56462.7%
All 50,756 sponsors589,453100%

Source: ClinicalTrials.gov, share of registered studies run by the N most prolific lead sponsors, snapshot 2026-06-14.

That concentration sits on top of an enormous thin tail. 53.8% of all lead sponsors registered exactly one study, and 82.4% registered five or fewer — the median sponsor appears on the registry once and never again.

Sponsor sizeSponsorsShare of sponsors
Exactly one study27,32853.8%
Five or fewer studies41,79982.4%
All sponsors50,756100%

Source: ClinicalTrials.gov, distribution of lead sponsors by number of registered studies, snapshot 2026-06-14.

These are two faces of one distribution. A short head of standing research institutions — academic medical centers, large hospital systems, and the bigger pharmaceutical companies — registers studies year after year and accumulates thousands each. Behind them stretches a tail of more than 27,000 sponsors that registered a single study: a one-off investigator-initiated trial, a small clinic's pilot, a one-time collaboration. The registry is not a list of equals; it is a power law.

The most prolific sponsors are universities

Among the 100 most prolific lead sponsors, 79 are academic or hospital institutions, and they account for 73.3% of all studies run by the top 100. Only 16 of the top 100 are industry sponsors.

Class of the 100 most prolific sponsorsSponsorsTheir studiesShare of top-100 studies
Academic, hospital, foundation79113,21473.3%
Industry1631,51720.4%
NIH37,1114.6%
U.S. federal (non-NIH)11,7251.1%
Uncoded class19750.6%

Source: ClinicalTrials.gov, lead-sponsor class of the 100 most prolific sponsors, snapshot 2026-06-14.

The concentration at the head of the registry is academic, not industrial. The handful of industry sponsors that crack the top 100 are large and familiar pharmaceutical companies, but they are outnumbered nearly five to one by universities, national cancer centers, and hospital systems that each run thousands of studies. The institutions that register the most clinical research are the same kind of institutions that run the bulk of it — and they are overwhelmingly academic. Industry's weight shows up not at the very top of the sponsor list but in the type of trial it concentrates on, as the phased-pipeline table shows.

What one row actually is

Each row in clinical_trials is one registered study: its NCT identifier, brief and official titles, overall status, study type and phase, lead sponsor and sponsor class, enrollment count, conditions, start and completion dates, and the listed overall officials. The published registry is the current roster of registered studies as of the snapshot, and counting and grouping these rows by sponsor and sponsor class is the entire method here. A sponsor is identified by lead_sponsor_name; sponsor "class" is the registry's own lead_sponsor_class coding. Investigator and official names are present where ClinicalTrials.gov published them, but the name-to-entity-graph link is deferred and held, so no trial, sponsor, or investigator renders on any individual provider profile. Every figure in this study is a count or percentage at the sponsor-class or sponsor-cohort level. No investigator, sponsor, or trial is named, ranked, or scored.

Methodology

All figures are direct aggregations over the clinical_trials table, populated from the ClinicalTrials.gov Data API v2 — the U.S. National Library of Medicine's registry of interventional and observational clinical studies. The table holds 589,453 registered studies across 50,756 distinct lead sponsors; snapshot 2026-06-14; public, read-only; license per the ClinicalTrials.gov Terms & Conditions for NLM public data. Fonteum re-pulls the registry on snapshot publication, so figures advance with each refresh.

Sponsor-class shares are computed over the whole registry — every registered study, regardless of status — because the question is who registers research, not how it ends. The "academic / hospital" group is the registry's OTHER class; "industry" is INDUSTRY; "government" combines NIH, FED (other U.S. federal), and OTHER_GOV. The phased-pipeline cut restricts to interventional trials whose phase is one of the seven phased buckets (Early Phase 1 through Phase 4), the set of trials that test a drug or biologic in humans. Concentration figures rank lead sponsors by their number of registered studies and sum the top N; the one-study and five-or-fewer figures count sponsors by their study total. Because these are counts and ratios over a published registry, every figure is exact as of the snapshot rather than estimated. Methodology version: clinicaltrials/v1. The source-provenance contract is documented in the provenance methodology.

Limitations

  • A record of registration, not of quality or importance. A sponsor count measures how many studies an institution has registered. It is unrelated to the quality, size, cost, or scientific importance of that research, and unrelated to exclusion, sanction, or any assessment of an investigator's or provider's conduct. This study draws no inference about any party from a sponsor count.
  • Aggregate and category-level only. Every figure is a count or percentage at the sponsor-class or sponsor-cohort level. No investigator, sponsor, or individual trial is named, ranked, or scored, and the name-to-entity-graph link is deferred, so no trial renders on any provider profile.
  • The registry is global, not U.S.-only. ClinicalTrials.gov is the U.S. NLM registry mandated for federally funded and FDA-regulated research, but it lists studies sponsored worldwide. "Who runs the trials" here is a count over all registered sponsors, including non-U.S. institutions, not a United States–only tally.
  • Sponsor class is the registry's own coding. The academic / industry / government split follows the lead_sponsor_class field as ClinicalTrials.gov records it. A small number of studies carry no coded class or an individual sponsor; these are reported as their own rows and excluded from the three named groups.
  • Lead sponsor only; collaborators are not counted. Each study is attributed to its single lead sponsor. Industry-funded studies run under an academic lead sponsor, and collaborations, are counted under the lead sponsor — so industry's true financial footprint is wider than its lead-sponsor share.
  • Snapshot, not a trend model. Figures reflect the single registry snapshot dated 2026-06-14. The registry is refreshed on publication and grows continuously, so shares shift between releases; this study does not model change over time.

Sources

  • ClinicalTrials.gov — the U.S. National Library of Medicine registry behind every figure in this study.
  • ClinicalTrials.gov Data API v2 — the public API Fonteum pulls the registry snapshot from.
  • ClinicalTrials.gov — sponsor and collaborator definitions — the official definitions of lead sponsor and the sponsor classes used to group studies here.

The companion dataset page for ClinicalTrials.gov Studies lists the full schema and refresh cadence, and the clinical-trials data explorer exposes the underlying records. For the other half of the registry's story — where these trials stop — see the Phase 2 valley where clinical trials discontinue; for the drug-and-device lifecycle the industry pipeline feeds, the 2026 drug and device recall landscape; and for who funds the clinical economy these sponsors sit inside, the manufacturers behind industry payments to clinicians, the pharma payments tied to GLP-1 prescribing, the changing shape of Medicare enrollment, and who opts out of Medicare entirely.

Frequently asked questions

Who runs the most clinical trials — pharma or universities?
Universities and hospitals. Of the 589,453 studies registered on ClinicalTrials.gov, 71.4% are led by the registry's 'Other' class — academic institutions, hospitals, and foundations — against 22.1% led by industry and 5.4% by government. The popular image of clinical research as a pharma activity is the opposite of what the registry of record shows: by study count it is overwhelmingly an academic and hospital enterprise.
If academia runs most trials, why is industry associated with drug development?
Because the two run different kinds of trials. Restrict the count to phased interventional trials — the regulated pipeline that tests a drug or biologic in humans, Early Phase 1 through Phase 4 — and industry's share jumps from 22.1% of all studies to 41.2%, while the academic share falls from 71.4% to 51.7%. Industry concentrates in the high-stakes, expensive, regulator-facing pipeline; academic and hospital sponsors run a far larger share of observational and non-phased interventional work.
How concentrated is the clinical-trials registry?
Steeply, at the top. The 100 most prolific lead sponsors — 0.2% of the 50,756 distinct sponsors — run 26.2% of all registered studies; the top 1,000 run 62.7%, and the top 10 alone run 6.0%. A small number of large research institutions and companies account for a disproportionate share of the registered record.
Is the registry dominated by a few large sponsors, then?
It is concentrated and fragmented at the same time. While the top 100 sponsors run 26.2% of studies, 53.8% of all 50,756 lead sponsors registered exactly one study, and 82.4% registered five or fewer. The median sponsor is a one-time registrant. The registry is a short head of prolific institutions sitting on top of a very long tail of sponsors that appear once.
Does running many trials mean a sponsor is better or more important?
No. A sponsor count records who registers research — how many studies an institution has filed — not the quality, size, cost, or scientific importance of that research. A university with hundreds of small investigator-initiated studies and a company with a handful of large multi-site trials are different shapes of the same enterprise. This study counts and groups sponsors; it does not name, rank, or score any one of them, and draws no inference about any investigator or provider.
Are these all American trials?
No. ClinicalTrials.gov is run by the U.S. National Library of Medicine and is the registry U.S. law requires for federally funded and FDA-regulated research, but its coverage is global — it includes studies sponsored from outside the United States. So 'who runs the trials' here means who runs the studies on this registry worldwide, not a U.S.-only count. The sponsor-class and concentration patterns are computed over the registry as a whole.
Can I reproduce these figures?
Yes. Every number is a direct count over the public clinical_trials table — the ClinicalTrials.gov Data API v2 registry, snapshot 2026-06-14 — with no modeling. The exact SQL for the sponsor-class mix, the phased-pipeline inversion, the top-N concentration, the one-study tail, and the class composition of the top 100 sponsors is published in the reproducibility block below.

Who uses this data

The source data behind this study is public

Compliance teams, journalists, and researchers work from the same federal source families cited above — queried by NPI or facility identifier through Fonteum’s open dataset pages and API. Every figure traces to a frozen, downloadable snapshot you can reproduce yourself.

Browse ClinicalTrials.gov→Query the API →How we built this →

Datasets used

ClinicalTrials.gov→

Reproducibility

Every claim, reproducible

The SQL+
clinical-trial-sponsor-concentration-2026.sql
-- WHO RUNS the clinical-trials registry — the composition and concentration of
-- the sponsors behind every study on ClinicalTrials.gov. Fully reproducible.
--
-- Question: of the studies registered on ClinicalTrials.gov, who sponsors them?
-- What share is run by industry vs academic/hospital vs government sponsors,
-- how concentrated is the registry among the most prolific sponsors, and how
-- long is the tail of one-study sponsors? The lead figure: 71.4% of the
-- 589,453 registered studies are led by academic, hospital, and foundation
-- sponsors (the "Other" class), against 22.1% led by industry. A sponsor count
-- is a record of who registers research — NOT a quality, fraud, or wrongdoing
-- signal of any kind, and says nothing about any investigator, sponsor, or
-- provider.
--
-- Source:
--   public.clinical_trials — ClinicalTrials.gov Data API v2 (the U.S. National
--     Library of Medicine registry of interventional and observational
--     studies). 589,453 registered studies across 50,756 distinct lead
--     sponsors; snapshot 2026-06-14. Public, read-only. License:
--     ClinicalTrials.gov Terms & Conditions (NLM public data).
--     methodology_version = 'clinicaltrials/v1'.
--
-- Universe: this study reads the published registry AS A WHOLE — every row is a
--   study NLM lists on ClinicalTrials.gov as of the 2026-06-14 snapshot. The
--   registry is global in coverage: it includes trials sponsored from outside
--   the United States as well as U.S.-based research.
--
-- Counting note: 589,453 distinct NCT IDs, one row each; no row is duplicated.
--   "Sponsor" = lead_sponsor_name; sponsor "class" = lead_sponsor_class as the
--   registry codes it (OTHER = academic / hospital / foundation; INDUSTRY;
--   NIH; FED = other U.S. federal; OTHER_GOV; NETWORK; INDIV; etc.). No
--   individual investigator, sponsor, or provider is named, ranked, or scored
--   in the published study.

-- ============================================================================
-- (1) Universe reconciliation — the registry at a glance.
-- ============================================================================
SELECT
  count(*)                                                          AS studies,
  count(DISTINCT nct_id)                                            AS distinct_nct,
  count(DISTINCT lead_sponsor_name)                                 AS sponsors,
  count(*) FILTER (WHERE study_type = 'INTERVENTIONAL')             AS interventional,
  count(*) FILTER (WHERE study_type = 'OBSERVATIONAL')              AS observational,
  max(ingested_at)::date                                            AS snapshot
FROM public.clinical_trials;
--  studies 589,453 · distinct_nct 589,453 · sponsors 50,756
--  interventional 449,800 (76.3%) · observational 137,624 (23.3%) · snapshot 2026-06-14

-- ============================================================================
-- (2) HEADLINE: who runs the registry — share of all 589,453 studies by lead
--     sponsor class. Academic / hospital / foundation sponsors (OTHER) run
--     71.4%; industry runs 22.1%; the three government classes combined
--     (NIH + FED + OTHER_GOV) run 5.4%. The registry is led by universities and
--     hospitals, not pharma.
-- ============================================================================
SELECT
  coalesce(lead_sponsor_class, '(uncoded)')                         AS sponsor_class,
  count(*)                                                          AS studies,
  round(100.0 * count(*) / 589453, 1)                               AS pct_of_registry
FROM public.clinical_trials
GROUP BY sponsor_class
ORDER BY studies DESC;
--  OTHER     420,647 · 71.4%      INDUSTRY  130,207 · 22.1%
--  OTHER_GOV  15,564 ·  2.6%      NIH        11,536 ·  2.0%
--  NETWORK     4,966 ·  0.8%      FED         4,890 ·  0.8%
--  (uncoded)     975 ·  0.2%      INDIV         566 ·  0.1%
--  UNKNOWN        99 · ~0%        AMBIG           3 · ~0%
--  government combined (NIH + FED + OTHER_GOV) = 31,990 = 5.4%

-- ============================================================================
-- (3) THE INVERSION: who runs the regulated drug-development pipeline. Restrict
--     to PHASED interventional trials (Early Phase 1 .. Phase 4) — the trials
--     that test a drug or biologic in humans. Industry's share JUMPS from 22.1%
--     of all studies to 41.2% of phased trials; the OTHER class falls from
--     71.4% to 51.7%. Industry concentrates where the regulatory stakes are.
-- ============================================================================
SELECT
  coalesce(lead_sponsor_class, '(uncoded)')                         AS sponsor_class,
  count(*)                                                          AS phased_trials,
  round(100.0 * count(*) / sum(count(*)) OVER (), 1)                AS pct_of_phased
FROM public.clinical_trials
WHERE study_type = 'INTERVENTIONAL'
  AND phase IN ('EARLY_PHASE1','PHASE1','PHASE1/PHASE2','PHASE2',
                'PHASE2/PHASE3','PHASE3','PHASE4')
GROUP BY sponsor_class
ORDER BY phased_trials DESC;
--  OTHER     113,926 · 51.7%      INDUSTRY  90,723 · 41.2%
--  NIH         7,049 ·  3.2%      OTHER_GOV  3,983 ·  1.8%
--  NETWORK     2,722 ·  1.2%      FED        1,412 ·  0.6%
--  total phased interventional = 220,186

-- ============================================================================
-- (4) CONCENTRATION at the top — share of the registry run by the N most
--     prolific lead sponsors. The top 100 sponsors (0.2% of the 50,756) run
--     26.2% of all studies; the top 1,000 run 62.7%. "Top N" is a concentration
--     metric over the sponsor count distribution; no individual sponsor is
--     named.
-- ============================================================================
WITH s AS (
  SELECT count(*) AS c FROM public.clinical_trials GROUP BY lead_sponsor_name
),
r AS (SELECT c, row_number() OVER (ORDER BY c DESC) AS rn FROM s)
SELECT
  count(*)                                          AS sponsors,
  sum(c)                                            AS studies,
  sum(c) FILTER (WHERE rn <= 10)                    AS top10_studies,
  round(100.0 * sum(c) FILTER (WHERE rn <= 10)   / sum(c), 1)  AS top10_pct,
  sum(c) FILTER (WHERE rn <= 50)                    AS top50_studies,
  round(100.0 * sum(c) FILTER (WHERE rn <= 50)   / sum(c), 1)  AS top50_pct,
  sum(c) FILTER (WHERE rn <= 100)                   AS top100_studies,
  round(100.0 * sum(c) FILTER (WHERE rn <= 100)  / sum(c), 1)  AS top100_pct,
  sum(c) FILTER (WHERE rn <= 1000)                  AS top1000_studies,
  round(100.0 * sum(c) FILTER (WHERE rn <= 1000) / sum(c), 1)  AS top1000_pct
FROM r;
--  sponsors 50,756 · studies 589,453
--  top10 35,523 (6.0%) · top50 104,569 (17.7%)
--  top100 154,542 (26.2%) · top1000 369,564 (62.7%)

-- ============================================================================
-- (5) FRAGMENTATION at the bottom — the long tail. 53.8% of all lead sponsors
--     registered exactly one study; 82.4% registered five or fewer. The median
--     sponsor is a one-time registrant. Concentration at the top and a vast
--     thin tail are two faces of the same distribution.
-- ============================================================================
WITH s AS (
  SELECT count(*) AS c FROM public.clinical_trials GROUP BY lead_sponsor_name
)
SELECT
  count(*)                                          AS sponsors,
  count(*) FILTER (WHERE c = 1)                      AS one_study,
  round(100.0 * count(*) FILTER (WHERE c = 1) / count(*), 1)   AS one_study_pct,
  count(*) FILTER (WHERE c <= 5)                     AS five_or_fewer,
  round(100.0 * count(*) FILTER (WHERE c <= 5) / count(*), 1)  AS five_or_fewer_pct
FROM s;
--  sponsors 50,756 · one_study 27,328 (53.8%) · five_or_fewer 41,799 (82.4%)

-- ============================================================================
-- (6) WHO is at the top — the CLASS composition of the 100 most prolific
--     sponsors (aggregate; no sponsor named). 79 of the 100 are academic /
--     hospital / foundation sponsors, and they account for 73.3% of all
--     top-100 studies. The concentration at the head of the registry is
--     academic, not industrial.
-- ============================================================================
WITH s AS (
  SELECT lead_sponsor_class, count(*) AS c
  FROM public.clinical_trials GROUP BY lead_sponsor_name, lead_sponsor_class
),
r AS (SELECT lead_sponsor_class, c, row_number() OVER (ORDER BY c DESC) AS rn FROM s)
SELECT
  coalesce(lead_sponsor_class, '(uncoded)')         AS sponsor_class,
  count(*)                                          AS sponsors_in_top100,
  sum(c)                                            AS their_studies,
  round(100.0 * sum(c) / 154542, 1)                 AS pct_of_top100
FROM r
WHERE rn <= 100
GROUP BY sponsor_class
ORDER BY their_studies DESC;
--  OTHER 79 · 113,214 · 73.3%      INDUSTRY 16 · 31,517 · 20.4%
--  NIH    3 ·   7,111 ·  4.6%      FED       1 ·  1,725 ·  1.1%
--  (uncoded) 1 · 975 · 0.6%
The snapshot+
dataset_idclinicaltrials-gov
snapshot_date2026-06-14
sha256
doi10.5072/fonteum/clinical-trial-sponsor-concentration-2026
slsa_provenance_url
The JOINs+
universe: the registry as a whole                                                 -- 589,453 registered studies, 50,756 distinct lead sponsors, snapshot 2026-06-14
sponsor-class mix: GROUP BY lead_sponsor_class                                     -- OTHER (academic/hospital) 71.4%, INDUSTRY 22.1%, gov (NIH+FED+OTHER_GOV) 5.4%
drug-development pipeline = INTERVENTIONAL and phase in the 7 phased buckets        -- 220,186 trials; INDUSTRY 41.2% (vs 22.1% of all), OTHER 51.7%
concentration: GROUP BY lead_sponsor_name, rank by study count                      -- top 10 6.0%, top 100 26.2%, top 1,000 62.7%
fragmentation: sponsors with exactly 1 study / with <= 5                            -- 27,328 (53.8%) one-study; 41,799 (82.4%) five or fewer
top-100 composition: lead_sponsor_class of the 100 most prolific sponsors           -- 79 academic/hospital (113,214 studies, 73.3% of top-100), 16 industry
The pipeline version+
git_sha
slsa_provenance
methodology_versionclinicaltrials/v1

Reproduce this

Run the exact query against the frozen 2026-06-14.

-- WHO RUNS the clinical-trials registry — the composition and concentration of -- the sponsors behind every study on ClinicalTrials.gov. Fully reproducible. -- -- Question: of the studies registered on ClinicalTrials.gov, who sponsors them? -- What share is run by industry vs academic/hospital vs government sponsors, -- how concentrated is the registry among the most prolific sponsors, and how -- long is the tail of one-study sponsors? The lead figure: 71.4% of the -- 589,453 registered studies are led by academic, hospital, and foundation -- sponsors (the "Other" class), against 22.1% led by industry. A sponsor count -- is a record of who registers research — NOT a quality, fraud, or wrongdoing -- signal of any kind, and says nothing about any investigator, sponsor, or -- provider. -- -- Source: -- public.clinical_trials — ClinicalTrials.gov Data API v2 (the U.S. National -- Library of Medicine registry of interventional and observational -- studies). 589,453 registered studies across 50,756 distinct lead -- sponsors; snapshot 2026-06-14. Public, read-only. License: -- ClinicalTrials.gov Terms & Conditions (NLM public data). -- methodology_version = 'clinicaltrials/v1'. -- -- Universe: this study reads the published registry AS A WHOLE — every row is a -- study NLM lists on ClinicalTrials.gov as of the 2026-06-14 snapshot. The -- registry is global in coverage: it includes trials sponsored from outside -- the United States as well as U.S.-based research. -- -- Counting note: 589,453 distinct NCT IDs, one row each; no row is duplicated. -- "Sponsor" = lead_sponsor_name; sponsor "class" = lead_sponsor_class as the -- registry codes it (OTHER = academic / hospital / foundation; INDUSTRY; -- NIH; FED = other U.S. federal; OTHER_GOV; NETWORK; INDIV; etc.). No -- individual investigator, sponsor, or provider is named, ranked, or scored -- in the published study. -- ============================================================================ -- (1) Universe reconciliation — the registry at a glance. -- ============================================================================ SELECT count(*) AS studies, count(DISTINCT nct_id) AS distinct_nct, count(DISTINCT lead_sponsor_name) AS sponsors, count(*) FILTER (WHERE study_type = 'INTERVENTIONAL') AS interventional, count(*) FILTER (WHERE study_type = 'OBSERVATIONAL') AS observational, max(ingested_at)::date AS snapshot FROM public.clinical_trials; -- studies 589,453 · distinct_nct 589,453 · sponsors 50,756 -- interventional 449,800 (76.3%) · observational 137,624 (23.3%) · snapshot 2026-06-14 -- ============================================================================ -- (2) HEADLINE: who runs the registry — share of all 589,453 studies by lead -- sponsor class. Academic / hospital / foundation sponsors (OTHER) run -- 71.4%; industry runs 22.1%; the three government classes combined -- (NIH + FED + OTHER_GOV) run 5.4%. The registry is led by universities and -- hospitals, not pharma. -- ============================================================================ SELECT coalesce(lead_sponsor_class, '(uncoded)') AS sponsor_class, count(*) AS studies, round(100.0 * count(*) / 589453, 1) AS pct_of_registry FROM public.clinical_trials GROUP BY sponsor_class ORDER BY studies DESC; -- OTHER 420,647 · 71.4% INDUSTRY 130,207 · 22.1% -- OTHER_GOV 15,564 · 2.6% NIH 11,536 · 2.0% -- NETWORK 4,966 · 0.8% FED 4,890 · 0.8% -- (uncoded) 975 · 0.2% INDIV 566 · 0.1% -- UNKNOWN 99 · ~0% AMBIG 3 · ~0% -- government combined (NIH + FED + OTHER_GOV) = 31,990 = 5.4% -- ============================================================================ -- (3) THE INVERSION: who runs the regulated drug-development pipeline. Restrict -- to PHASED interventional trials (Early Phase 1 .. Phase 4) — the trials -- that test a drug or biologic in humans. Industry's share JUMPS from 22.1% -- of all studies to 41.2% of phased trials; the OTHER class falls from -- 71.4% to 51.7%. Industry concentrates where the regulatory stakes are. -- ============================================================================ SELECT coalesce(lead_sponsor_class, '(uncoded)') AS sponsor_class, count(*) AS phased_trials, round(100.0 * count(*) / sum(count(*)) OVER (), 1) AS pct_of_phased FROM public.clinical_trials WHERE study_type = 'INTERVENTIONAL' AND phase IN ('EARLY_PHASE1','PHASE1','PHASE1/PHASE2','PHASE2', 'PHASE2/PHASE3','PHASE3','PHASE4') GROUP BY sponsor_class ORDER BY phased_trials DESC; -- OTHER 113,926 · 51.7% INDUSTRY 90,723 · 41.2% -- NIH 7,049 · 3.2% OTHER_GOV 3,983 · 1.8% -- NETWORK 2,722 · 1.2% FED 1,412 · 0.6% -- total phased interventional = 220,186 -- ============================================================================ -- (4) CONCENTRATION at the top — share of the registry run by the N most -- prolific lead sponsors. The top 100 sponsors (0.2% of the 50,756) run -- 26.2% of all studies; the top 1,000 run 62.7%. "Top N" is a concentration -- metric over the sponsor count distribution; no individual sponsor is -- named. -- ============================================================================ WITH s AS ( SELECT count(*) AS c FROM public.clinical_trials GROUP BY lead_sponsor_name ), r AS (SELECT c, row_number() OVER (ORDER BY c DESC) AS rn FROM s) SELECT count(*) AS sponsors, sum(c) AS studies, sum(c) FILTER (WHERE rn <= 10) AS top10_studies, round(100.0 * sum(c) FILTER (WHERE rn <= 10) / sum(c), 1) AS top10_pct, sum(c) FILTER (WHERE rn <= 50) AS top50_studies, round(100.0 * sum(c) FILTER (WHERE rn <= 50) / sum(c), 1) AS top50_pct, sum(c) FILTER (WHERE rn <= 100) AS top100_studies, round(100.0 * sum(c) FILTER (WHERE rn <= 100) / sum(c), 1) AS top100_pct, sum(c) FILTER (WHERE rn <= 1000) AS top1000_studies, round(100.0 * sum(c) FILTER (WHERE rn <= 1000) / sum(c), 1) AS top1000_pct FROM r; -- sponsors 50,756 · studies 589,453 -- top10 35,523 (6.0%) · top50 104,569 (17.7%) -- top100 154,542 (26.2%) · top1000 369,564 (62.7%) -- ============================================================================ -- (5) FRAGMENTATION at the bottom — the long tail. 53.8% of all lead sponsors -- registered exactly one study; 82.4% registered five or fewer. The median -- sponsor is a one-time registrant. Concentration at the top and a vast -- thin tail are two faces of the same distribution. -- ============================================================================ WITH s AS ( SELECT count(*) AS c FROM public.clinical_trials GROUP BY lead_sponsor_name ) SELECT count(*) AS sponsors, count(*) FILTER (WHERE c = 1) AS one_study, round(100.0 * count(*) FILTER (WHERE c = 1) / count(*), 1) AS one_study_pct, count(*) FILTER (WHERE c <= 5) AS five_or_fewer, round(100.0 * count(*) FILTER (WHERE c <= 5) / count(*), 1) AS five_or_fewer_pct FROM s; -- sponsors 50,756 · one_study 27,328 (53.8%) · five_or_fewer 41,799 (82.4%) -- ============================================================================ -- (6) WHO is at the top — the CLASS composition of the 100 most prolific -- sponsors (aggregate; no sponsor named). 79 of the 100 are academic / -- hospital / foundation sponsors, and they account for 73.3% of all -- top-100 studies. The concentration at the head of the registry is -- academic, not industrial. -- ============================================================================ WITH s AS ( SELECT lead_sponsor_class, count(*) AS c FROM public.clinical_trials GROUP BY lead_sponsor_name, lead_sponsor_class ), r AS (SELECT lead_sponsor_class, c, row_number() OVER (ORDER BY c DESC) AS rn FROM s) SELECT coalesce(lead_sponsor_class, '(uncoded)') AS sponsor_class, count(*) AS sponsors_in_top100, sum(c) AS their_studies, round(100.0 * sum(c) / 154542, 1) AS pct_of_top100 FROM r WHERE rn <= 100 GROUP BY sponsor_class ORDER BY their_studies DESC; -- OTHER 79 · 113,214 · 73.3% INDUSTRY 16 · 31,517 · 20.4% -- NIH 3 · 7,111 · 4.6% FED 1 · 1,725 · 1.1% -- (uncoded) 1 · 975 · 0.6%

Cite this study

Citation-ready for researchers and AI.

Fonteum Research Bureau (2026). Who runs the clinical trials: academia, not pharma, 2026. ClinicalTrials.gov, snapshot 2026-06-14. https://fonteum.com/research/clinical-trial-sponsor-concentration-2026

Check the chain

Each figure is snapshot-attested — re-derive the hash from the federal file.

1
Snapshot
clinicaltrials-gov · 2026-06-14
2
Field hash
SHA-256 a3f1c9…7e6b
3
Signed
Ed25519 · verifiable
✓ Chain signed · check it in Attest →

Related studies

  • CARE QUALITY · JUN 2026Where clinical trials stop: the Phase 2 valley, 2026Of the 373,998 clinical trials on ClinicalTrials.gov that have reached a final outcome, 51,959 — about 1 in 7 — ended in discontinuation rather than completion. Attrition is uneven: 23.0% of settled Phase 2 trials stop early, against 14.7% at Phase 1, the signature of the efficacy-stage valley where a candidate must first prove it works.
  • CARE QUALITY · JUN 2026How serious is the average FDA recall? About one in eleven is the most serious tierOf 56,777 FDA drug and device recalls on file, 5,237 — 9.2%, about one in eleven — carry Class I, the agency's most serious tier, signaling a reasonable probability of serious injury or death. Devices drive 68.9% of recalls; 99.4% are voluntary, company-initiated withdrawals, not FDA-ordered.
  • FINANCIAL DISTRESS · JUN 2026Which companies pay U.S. doctors the most? Device makers, not pharmaIn 2024, drug and device companies disclosed $3.31 billion in general payments to U.S. physicians under the Sunshine Act — and the largest payers are device makers, not pharma. BioNTech led at $180.6 million from just 164 royalty payments; the top 25 of 1,763 reporting companies account for 52% of every general-payment dollar.
  • FINANCIAL DISTRESS · JUN 2026GLP-1 makers paid 120,237 Medicare prescribers $32.8 million in 2024 — and the prescribers they paid wrote far moreIn 2024, the makers of Ozempic and Mounjaro paid $32.8 million to 120,237 Medicare prescribers for meals, talks, and travel tied to GLP-1 drugs. Prescribers who accepted a payment wrote 78% more GLP-1 prescriptions than those who did not — a correlation this study reports at the group level, without inferring cause.
  • WORKFORCE · JUN 2026Who is enrolled in Medicare? The nurse practitioner is now the most common clinician413,539 nurse practitioner enrollments make NPs the single most common clinician type in Medicare's provider-enrollment file — 13.9% of all 2.98 million PECOS records, nearly triple the largest physician specialty. Together, NPs and physician assistants are one in five enrollments. Advanced-practice providers now anchor the Medicare workforce.

Federal source citations

  1. [1]ClinicalTrials.gov · snapshot 2026-06-14 · federal source family · US-Government-Works
Dataset catalog →Source registry →Methodology →Chain integrity →All research →Provider lookup →

Fonteum Research · June 17, 2026 · All figures trace to the frozen federal-data snapshot cited above.

The substrate, by the numbers

44federal source familiesDistinct CMS, OIG, HRSA, FDA and peer datasets
35dataset pagesCitable, downloadable /data catalog pages
65reproducible studiesEach shipping the SQL behind its figures

Built on the authoritative federal record

The primary sources, named on every page.

These are the federal agencies whose public datasets Fonteum ingests and attributes — the issuing authorities, not customers or partners. Every figure on the site links back to one of them.

  • CMS
  • HHS-OIG
  • HRSA
  • FDA
  • NLM
  • NUCC
  • Census
  • BLS
  • BEA

See the full source registry, with license and refresh cadence for each →

Reproducible by design

Every figure traces to its federal source.

14-tuple provenance

Every rendered fact ties to a source URL, dataset ID, snapshot date, row key, and SHA-256 — the full chain-of-custody record.

Reproducible SQL

Each study ships the exact query behind its figures, run against the cited federal snapshot. Re-run it yourself.

Daily reconciliation

Published counts are reconciled against the upstream federal datasets on a daily cadence, with drift logged.

Named medical review

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

Read the full provenance and attestation methodology →

Two doors

Use the free API and open data

Query providers, facilities, sanctions, and quality scores — each field carrying its federal source. Self-serve, no call to start.

Explore the API →Browse the data catalog →

Talk to us

Managed pilots, enterprise terms, and audit-ready, signed attestation packages for compliance, risk, and research teams.

Talk to us →
Fonteum
Platform
Platform overviewAPIMCP serverFHIR R4 APIBulk exportAttestation & audit packReconciliationEntity graphSnapshots
Solutions
All solutionsExclusion & sanctions screeningCredentialing & enrichmentAudit evidenceProvider data for AI / RAGM&A & network diligenceCompliance & riskDevelopers & AI teams
Data & sources
Coverage & sourcesBrowse all datasetsFreshnessMethodologyCare CompareSanctionsOwnershipStaffingDeficienciesSpecial Focus Facilities
Developers
Developer hubDocsAPI referenceQuickstartStatusChangelogSDKs & integrationsWebhooks
Research
Research hubGlossaryComparisonsCitationsWhy Fonteum
Company
AboutPressCustomersPricingContactEditorial policyCorrections
Trust & legal
TrustQualitySecurityPrivacy policyTerms of serviceMedical disclaimer

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

© 2026 Fonteum LLC. All rights reserved.

·hello@fonteum.com

The U.S. healthcare graph AI can cite — every fact carries its source.

Request access→