Compare simulated model answers that sound polished against the evidence needed to decide whether they are actually correct. The tool separates fluency, confidence, correctness, and verification.
Live preview · launch for the interactive version
Polished wording, citations, percentages, and orderly steps can make an answer feel reliable before anyone checks it.
A confidence meter can be high even when the citation is invented, the historical claim overreaches, or the math contains one hidden error.
Participants reveal the evidence, write a claim, and name what a human needs to verify before using the answer.
Treat each confident answer as a claim, not a conclusion.
Read the simulated model answer before revealing the correction. Notice what makes it feel trustworthy.
Mark whether the answer sounds confident and whether you think it is actually correct. Keep those judgments separate.
Use the evidence panel to identify the unsupported source, wrong step, overclaim, or missed visual detail.
Save an observation that explains what a student, editor, or teacher would need to verify next.