142,889 sites across 241 countries, aggregated from 14 public trial registries and enrichment datasets. Open data, transparent methodology, free API — from the American Institute for Medical Research.
A unified, free database of clinical research sites — the places where clinical trials are actually run — built for pharma business development teams, CROs, academic researchers, health-policy analysts, and anyone who needs to know who's running what, where.
Sites from 14 trial registries are deduplicated and merged — not just U.S. CT.gov, but India's CTRI, Europe's EU CTIS and ISRCTN, Asia's ChiCTR / JPRN / KCT, and more.
Every site gets an A/B/C/D insight quality tier based on trial volume, recency, TA diversity, contact completeness, investigator depth, and data quality. Methodology is public.
Therapeutic area, country, tier, facility type, network, regulator, active status. Multi-select TA. Works on both map and table views.
Filtered results exportable as CSV. No account required.
Every data point carries its source registry. Additive integration model — new sources never overwrite existing verified data.
1. Registry scrape. Each national registry pulled via its public API or ICTRP aggregation, kept as a per-registry source file.
2. Multi-phase deduplication. Multiline cleanup, internal-source dedup, cross-registry match (name + city + country + coordinate clustering), and merge. Reduced ~166K raw to ~143K distinct sites.
3. Condition/TA enrichment. Title keyword extraction across 17 TA categories for registries that don't expose structured condition data.
4. Geocoding. Nominatim + Photon (Komoot) for building-level, city-centroid fallback. 93% mapped.
5. Quality scoring. Six-component weighted formula (trial volume 30%, recent activity 20%, condition diversity 15%, contact quality 15%, investigators 10%, data completeness 10%) → A/B/C/D tier.
6. Cross-source validation. External supplementary datasets add a second independent trial-count signal and catch sites we'd otherwise miss. Additive only — never overwrites.
Update cadence: monthly for each registry; continuous for derived enrichments. Every change is logged for reversibility.
There are commercial site directories with glossier UIs and sales teams. Most charge six-figure annual fees and lock the data behind a contract. Here's the honest picture:
| Feature | AIMR (this) | Commercial directories |
|---|---|---|
| Total sites | ~143K | typically 80K–200K |
| Countries covered | 241 | varies |
| Multi-registry (beyond CT.gov) | Yes — 14 sources | Varies |
| Indian CTRI coverage | ~38K sites | Varies |
| Cost | Free | ~$100K–$500K/yr |
| Open API | Yes, no auth | Contract-gated |
| Methodology transparency | Published | Proprietary |
| CSV export | Free | Gated |
| Ranking signal | Multi-factor tier + score | Proprietary |
Shortlist sites by indication, geography, trial history, and regulatory jurisdiction. Export CSV for CRM import.
Programmatic access via JSON API, country+TA filters, cross-source validation for feasibility work.
Citable, methodologically transparent dataset for clinical-trial ecosystem analysis.
Country-level regulator metadata, registry source tracking, regional distribution.
Explore the interactive map, query the API, or download a CSV. Reach out if you need richer fields, a data partnership, or to report issues.