Fine-grained Hallucination Detection and Editing for Language Models

January 19, 2024 · 2 min read

Developer Advocate

A preview of the paper

Entity: an entity in a statement is incorrect (eg. Christmas falls on Nov. 25th)
Relation: semantic relationship in a statement is incorrect (eg. The mouse ate the cat.)
Contradictory: statements that entirely contradict relevant evidence from the web (eg. Raptors are yet to win an NBA final.)

Invented: statements of concepts that do not exist in world knowledge (eg. MJ created the sideways somersault)
Subjective: Statement that lacks universal validity - basically an opinion (eg. The Raptors are the best NBA team)
Unverifiable: potentially factual statement but cannot be grounded in world evidence(eg. Jensen sleeps in a leather jacket.)

Entity and Relation are usually word level, and so can be fixed with small edits if you know where they occur.

Contradictory, Invented, Subjective, and Unverifiable are often sentence level and thus need to be removed completely to fix the issue.

Ready to start building?

Check out the Quickstart tutorial, or build amazing apps with a free trial of Weaviate Cloud (WCD).

GitHub

Forum

Slack

X (Twitter)

By submitting, I agree to the Terms of Service and Privacy Policy.