Frame Check

Source Network reliability tiers

Per-provider precision, recall, and F1 measured against a seed corpus of claims whose ground truth was established independently. The resulting tier is what Frame Check's trust verdict uses to weight verifications: a 'verified' from a strong provider counts more than a 'verified' from a weak one.

Sample size

The current corpus tests 6 claims total, 4 to 5 per provider. F1 values at that N are preliminary reliability indicators, not population statistics. Tiers are likely to shift as the corpus grows. Cite a specific dated run URL, not this aggregate page, so your citation stays stable when the tiers update.

Current tiers (run 2026-04-17-wikipedia)

Corpus version: 0.1 · Claims tested: 6 · Run: 2026-04-17-wikipedia · Raw data: raw_verdicts.json

Provider Tier F1 Precision Recall N tested
wikipediastrong0.890.801.006

Tier definitions

strong
F1 >= 0.80. Verifications from this provider carry full weight in the trust verdict.
moderate
F1 between 0.55 and 0.79. Verifications count, but large weak-majority sets will lower the trust level.
weak
F1 below 0.55. A trust verdict riding predominantly on weak providers gets lowered one notch and a caveat attached; citers should treat this set as tentative.
uncalibrated
Provider exists in the Source Network but has no measurement yet. Uncalibrated is NOT the same as weak: unmeasured is unknown, not poor. The trust verdict treats uncalibrated as neutral and notes the absence of measurement to the user.

All calibration runs: calibration index. Methodology and detection protocol: methodology. Full citation guide: how to cite. Licensed CC-BY-4.0.