Skip to main content
aifinhub
AI in Markets Calculator Guide

How to use Hallucination Detector

Paste a source document and an LLM's extraction. Every numeric claim in the output is matched against the source — mismatches and unsupported claims are flagged so you catch fabrication before the number reaches a trading rule.

By Orbyd Editorial · AI Fin Hub Team
Best Next MovePlaygrounds

Hallucination Detector

Paste a source document + an LLM's extraction. Every numeric claim in the output is checked against the source. Client-side. Catches silent fabrication.

CalculatorOpen ->

On This Page

What It Does

Use the calculator with intent

Paste a source document and an LLM's extraction. Every numeric claim in the output is matched against the source — mismatches and unsupported claims are flagged so you catch fabrication before the number reaches a trading rule.

Engineers piping LLM extractions into trading or research pipelines who need a deterministic check that the numbers in the output actually appear in the source.

Interpreting Results

Flagged claims are the work. Each flag falls into one of three buckets: number not found in source (hallucination), number found but mis-attributed (paraphrase error), number rounded outside tolerance (precision drift). All three deserve a manual review before action.

Input Steps

Field by field

  1. 1

    Provide

    Provide source context (the document or retrieval result the model was supposed to ground its answer in).

  2. 2

    Provide

    Provide the model's output to be checked.

  3. 3

    Run calculation

    Run the detector. It checks entity grounding, numerical grounding, and self-consistency across N samples.

  4. 4

    Read outputs

    Read flagged spans. Each flag includes the failure type and severity. Review flags before accepting the output.

  5. 5

    For

    For batch use, upload a list of (context, output) pairs and read aggregate flag rates. Flag rate above 10% suggests prompt or model needs adjustment.

Common Scenarios

Use realistic starting points

10-K extraction sanity check

Source

10-K filing

Extraction

LLM-generated financials table

Every dollar figure in the table should match the filing line-item within rounding tolerance; unmatched figures are the hallucinations.

Earnings transcript Q&A summary

Source

Earnings call transcript

Extraction

Bulleted summary with growth rates

Growth rate quotes need to match management's spoken numbers exactly; reconstruction-from-prior-year errors are common.

Try These Tools

Run the numbers next

FAQ

Questions people ask next

The short answers readers usually want after the first pass.

Three checks documented on the methodology page: (1) entity grounding — does every named entity in the output appear in the source context? (2) numerical grounding — do numbers in the output match values in the source within tolerance? (3) self-consistency — does the same prompt produce stable outputs across N samples? Any failure flags the response.

Related Content

Keep the topic connected

Planning estimates only — not financial, tax, or investment advice.