Skip to content
FonteumThe Graph
DataResearchCare CompareThe DifferAttestAPI
See the proof
  • Data
  • Research
  • Care Compare
  • The Differ
  • Attest
  • API
See the proof
Integrity runbook

Re-hash a snapshot. Don’t take our word for it.

Every dated snapshot Fonteum publishes is SHA-256 hashed and attested at write time. This runbook walks you through re-deriving that hash yourself from the upstream source archive and matching it against the published attestation — no credentials, no API key, just the bytes and a standard hashing tool.

← Attestation index The signed attestation chain →

What a match proves

Bytes, not promises.

Fonteum stores a SHA-256 hash of every ingested source archive in the snapshot_attestations table. The hash is computed once, at the moment of ingestion, and locked. A 64-character match between your recomputed hash and the published one attests three things at once:

  • Byte-exactness — the archive you fetched is byte-identical to what Fonteum ingested for that snapshot.
  • No silent edit — nothing in the pipeline altered the source bytes between the upstream portal and the published dataset.
  • Provenance — the snapshot ties back to a named federal source archive at a stated date, recorded on the attestation row.

This is the foundation layer. The signed, Certificate-Transparency-style chain at /docs/chain layers Ed25519 signatures and prev-hash linkage on top of these same hashes once enough snapshots have accumulated.

Runbook — shell

Five steps, one command that matters.

On macOS and Linux, shasum -a 256 (or sha256sum) is all you need. On Windows, use certutil -hashfile snapshot.bin SHA256.

# 1. Open the attestation for the snapshot you care about.
#    Browse https://fonteum.com/trust/integrity and follow any
#    "attestation →" link, or hit the attestation page directly:
#      https://fonteum.com/verify/<snapshot_id>
#    Note the published content_hash, hash_algorithm (sha-256),
#    content_size_bytes, and source_archive_url.

# 2. Re-fetch the exact upstream source archive that was ingested.
curl -L -o snapshot.bin "<source_archive_url from the attestation>"

# 3. Confirm the byte count matches content_size_bytes (cheap pre-check).
wc -c snapshot.bin

# 4. Re-hash the bytes with the same algorithm.
shasum -a 256 snapshot.bin
#    → 9f86d081884c7d659a2feaa0c55ad015a3bf4f1b2b0b822cd15d6c15b0f00a08  snapshot.bin

# 5. Compare the 64-character hex against the published content_hash.
#    An exact match attests the bytes you read are byte-identical to what
#    Fonteum ingested at snapshot time. Any mismatch means the upstream
#    file changed, the download is incomplete, or the wrong archive was fetched.
Runbook — Node 20+

Same check, in code.

Any language with a SHA-256 implementation works — the hash is computed over the raw archive bytes, nothing else. Node, stdlib only:

// Node 20+ (stdlib only) — re-hash a downloaded archive and compare.
import { createHash } from 'node:crypto';
import { readFile } from 'node:fs/promises';

// The published content_hash from https://fonteum.com/verify/<snapshot_id>
const PUBLISHED = '<paste the 64-char content_hash here>';

const bytes = await readFile('snapshot.bin');
const recomputed = createHash('sha256').update(bytes).digest('hex');

console.log('recomputed:', recomputed);
console.log('matches:   ', recomputed === PUBLISHED.toLowerCase());
When the hash doesn’t match

A mismatch is a signal, not a dead end.

A non-matching hash usually means one of the following — work down the list before concluding the data drifted:

  • Partial download — the byte count from wc -c does not equal content_size_bytes. Re-fetch.
  • Upstream republished the file — federal portals sometimes re-issue an archive in place. The attestation pins the bytes as they were at snapshot date; a newer upstream file is expected to differ.
  • Wrong archive — confirm you fetched the exact source_archive_url recorded on the attestation, not a sibling or index page.
  • Transform vs. raw — the hash covers the upstream source archive, not the post-ingest parsed table. Hash the archive, not the rendered dataset.

If the archive is byte-complete, fetched from the recorded URL, and still doesn’t match, that is exactly the kind of discrepancy this system exists to surface — email security@fonteum.com with the snapshot id.

Related

Where the hashes live.

  • https://fonteum.com/trust/integrity — the public attestation index: every snapshot, its hash, and a deep-link to its attestation record.
  • https://fonteum.com/docs/chain — the signed chain reference, with Node, Python, and Go samples that check the chain itself.
  • https://fonteum.com/methodology — how each source family is ingested, dated, and attested.

Built on the authoritative federal record

The primary sources, named on every page.

These are the federal agencies whose public datasets Fonteum ingests and attributes — the issuing authorities, not customers or partners. Every figure on the site links back to one of them.

  • CMS
  • HHS-OIG
  • HRSA
  • FDA
  • NLM
  • NUCC
  • Census
  • BLS
  • BEA

See the full source registry, with license and refresh cadence for each →

Reproducible by design

Every figure traces to its federal source.

14-tuple provenance

Every rendered fact ties to a source URL, dataset ID, snapshot date, row key, and SHA-256 — the full chain-of-custody record.

Reproducible SQL

Each study ships the exact query behind its figures, run against the cited federal snapshot. Re-run it yourself.

Daily reconciliation

Published counts are reconciled against the upstream federal datasets on a daily cadence, with drift logged.

Named medical review

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

Read the full provenance and attestation methodology →

Two doors

Use the free API and open data

Query providers, facilities, sanctions, and quality scores — each field carrying its federal source. Self-serve, no call to start.

Explore the API →Browse the data catalog →

Talk to us

Managed pilots, enterprise terms, and audit-ready, signed attestation packages for compliance, risk, and research teams.

Talk to us →
Fonteum
Products
The DifferAttestAPIFHIR API
Data
Care CompareResearchData catalogSources
Company
Why FonteumAboutPressEditorial policyCorrections
Legal
Privacy policyTerms of serviceMedical disclaimer

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

© 2026 Fonteum LLC. All rights reserved.

The U.S. healthcare graph AI can cite — every fact carries its source.

Request access→

The substrate, by the numbers

9.2Mgraph entitiesProviders, organizations, owners, and facilities
12.5Mlinked identifiersNPIs, CCNs, LEIs and more, resolved to entities
4.7Mgraph edgesSource-attested relationships between entities
44federal source familiesDistinct CMS, OIG, HRSA, FDA and peer datasets
33dataset pagesCitable, downloadable /data catalog pages
48reproducible studiesEach shipping the SQL behind its figures