Skip to content
FonteumThe Graph
DataResearchCare CompareThe DifferAttestAPI
See the proof
  • Data
  • Research
  • Care Compare
  • The Differ
  • Attest
  • API
See the proof

Built on the authoritative federal record

The primary sources, named on every page.

These are the federal agencies whose public datasets Fonteum ingests and attributes — the issuing authorities, not customers or partners. Every figure on the site links back to one of them.

  • CMS
  • HHS-OIG
  • HRSA
  • FDA
  • NLM
  • NUCC
  • Census
  • BLS
  • BEA

See the full source registry, with license and refresh cadence for each →

Reproducible by design

Every figure traces to its federal source.

14-tuple provenance

Every rendered fact ties to a source URL, dataset ID, snapshot date, row key, and SHA-256 — the full chain-of-custody record.

Reproducible SQL

Each study ships the exact query behind its figures, run against the cited federal snapshot. Re-run it yourself.

Daily reconciliation

Published counts are reconciled against the upstream federal datasets on a daily cadence, with drift logged.

Named medical review

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

Read the full provenance and attestation methodology →

Two doors

Use the free API and open data

Query providers, facilities, sanctions, and quality scores — each field carrying its federal source. Self-serve, no call to start.

Explore the API →Browse the data catalog →

Talk to us

Managed pilots, enterprise terms, and audit-ready, signed attestation packages for compliance, risk, and research teams.

Talk to us →
Fonteum
Products
The DifferAttestAPIFHIR API
Data
Care CompareResearchData catalogSources
Company
Why FonteumAboutPressEditorial policyCorrections
Legal
Privacy policyTerms of serviceMedical disclaimer

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

© 2026 Fonteum LLC. All rights reserved.

The U.S. healthcare graph AI can cite — every fact carries its source.

Request access→

The substrate, by the numbers

9.2Mgraph entitiesProviders, organizations, owners, and facilities
13.3Mlinked identifiersNPIs, CCNs, LEIs and more, resolved to entities
4.7Mgraph edgesSource-attested relationships between entities
44federal source familiesDistinct CMS, OIG, HRSA, FDA and peer datasets
35dataset pagesCitable, downloadable /data catalog pages
52reproducible studiesEach shipping the SQL behind its figures
Federated identity

One canonical identity. Every federal source.

NPPES uses NPIs. PECOS uses PECOS-IDs. CMS Care Compare uses CCNs. OIG LEIE uses NPIs and legacy LEIE-IDs. HRSA HPSA uses Provider-IDs. Fonteum resolves them into one canonical identity per provider across all 5 identity-graph datasets, with confidence-banded matching + per-link methodology version + source evidence sufficient to manually audit every link decision.

0providers canonically linked across 5 identity-graph datasets·0high-confidence links·methodology —·last extended —

Verifier reference + code samples → Cryptographic chain → Snapshot integrity →

Confidence distribution

Public endpoints default to high (≥ 0.9) only.

No identity links yet. The linker runs every 5 minutes against the NPPES specialty caches + 4 source-side tables (pecos_providers, carecompare_providers, leie_providers, hrsa_providers). All 4 source tables ship empty; once Cowork backfills normalize source_snapshots payloads into them, the linker will begin producing links and this chart will populate.

Match methods

Nine strategies across 5 federal sources, confidence-ordered.

NPPES ↔ PECOS

  • exact_npi_in_pecos · confidence 1.0
    PECOS row carries an explicit npi matching an NPPES record. Strongest possible link.
  • exact_name_address · confidence 0.95
    Normalized provider name AND byte-equal normalized address. Not 1.0 because group practices can share an office.
  • name_address_fuzzy · confidence 0.7-0.9
    Jaro-Winkler similarity, gated by exact state + zip5. Fuzzy ceiling 0.9 — public default (≥ 0.9) excludes bottom of fuzzy band.

NPPES ↔ Care Compare CCN

  • exact_npi_in_carecompare · confidence 1.0
    Care Compare row carries an NPI matching the NPPES record.
  • exact_name_address_carecompare · confidence 0.95
    Facility name + byte-equal address. No fuzzy strategy — Care Compare facility names have high lexical variance; deferred to Phase 3.

NPPES ↔ OIG LEIE

Sample queries

Three reads. No credentials.

# Resolve an NPI to its canonical PECOS-ID
curl https://fonteum.com/api/v1/identity/nppes/1245319599

# Resolve a PECOS-ID to its canonical NPI
curl https://fonteum.com/api/v1/identity/pecos/PAC-100001

# Aggregate stats: total linked + confidence-band distribution
curl https://fonteum.com/api/v1/identity/stats

# Opt-in to surface low-confidence (<0.9) links for diligence
curl 'https://fonteum.com/api/v1/identity/nppes/1245319599?include_low_confidence=true'

Full reference + Node + Python + Go samples at /docs/identity.

The Bloomberg ticker for healthcare providers

Once a canonical identity layer exists and is queryable, it becomes the default reference for the category. Every Fonteum consumer (MCP server, webhook events, semantic search, future dbt package) joins against provider_identity_links, not against the federal sources directly.

FHIR R4 API
US Core 6.1.0
live · /api/fhir
Attestation chain
SHA-256 signed
public · /chain
Methodology
version v2026.05.0
public · /methodology
Provenance
14-tuple per field
source · date · confidence
Source registry
23 federal families
all primary-source · /sources
See also
  • Data source: Identity Graph
  • exact_npi_in_leie · confidence 1.0
    Modern LEIE rows carry an NPI matching the NPPES record.
  • name_dob_leie · confidence 0.95
    Legacy LEIE rows (pre-NPI-era exclusions). Normalized name AND identical DOB. No fuzzy strategy — false-positive cost too high for exclusion claims.

NPPES ↔ HRSA HPSA

  • exact_npi_in_hrsa · confidence 1.0
    HRSA HPSA row carries an NPI matching the NPPES record.

Operator-asserted

  • manual · confidence 1.0
    Operator-asserted via direct SQL. Used for high-value disputed records. UI for self-service operator curation deferred to Phase 3.