Stopping hallucinations by feeding LLMs the right information first.
The way to avoid hallucination is to stop the model from reaching into the model layers for information. If you give it structured or unstructured information in raw format and then ask it to summarize or list, it gives you exactly what you provided. Without that context, it makes things up because it’s doing probabilistic retrieval from its training. This is retrieval augmented generation. You get the information first, augment the prompt with it, then generate the response based on that specific context.
— Large Language Models: one interface to rule them all? · The Edge