Hallucination Detection

After generating the response, we can check if the response contains hallucinations.
In a RAG context, there are several points to consider: 
- does the response answer the question?
- is it based on the provided context?
- are there obvious hallucinations?

There are different ways to implement hallucination detection:
1. Models specifically trained for this task, for instance from [vectara](https://huggingface.co/vectara/hallucination_evaluation_model)
2. LLMs answering the points above
3. Other techniques, e.g. [SelfCheckGPT](https://huggingface.co/blog/dhuynh95/automatic-hallucination-detection#tldr)

For now I've implemented a simple hallucination detection that makes use of an LLM in ef1763b94e5a8bdf56a193e5bb0c37e49ef5e366.

There is the [RAG Triad](https://www.trulens.org/getting_started/core_concepts/rag_triad/) that is used to check for hallucinations.
This could be done by an LLM too, we can ask it to return a score for each of the criteria and classify them as hallucination based on thresholds.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hallucination Detection #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Hallucination Detection #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions