Skip to main content

Retrieval-Augmented Generation for Large Language Models: A Survey

· 2 min read
Leonie Monigatti

A preview of the paper

A recent survey on Retrieval-Augmented Generation (RAG) mentions an evolving paradigm: Modular RAG.

Modular RAG is comprised of various functional modules. Thus, modular RAG is not standalone. Instead, different RAG patterns are composed of different modules.

For example, the following animation shows: 🥚 The original naive RAG paradigm consists of the “Retrieval”, "Augmentation," and "Generation" modules.

🐣 After naive RAG has shown some limitations, advanced RAG has emerged as a new paradigm. A typical pattern of Advanced RAG builds upon the foundation of Naive RAG by adding “Rewrite” and “Rerank” modules.

🐓 Different RAG patterns, such as DSP, can be composed of entirely different modules.

The modular RAG paradigm is slowly becoming the norm in the RAG domain due to its versatility and flexibility, allowing:

  • the adaption of modules within the RAG process to suit your specific problem,
  • for a serialized pipeline or an end-to-end training approach across multiple modules.

I definitely recommend checking out the full survey if you want to catch up on recent advancements in the RAG domain.

🔗 arXiv Link

📜 Download paper

Ready to start building?

Check out the Quickstart tutorial, or build amazing apps with a free trial of Weaviate Cloud (WCD).

Don't want to miss another blog post?

Sign up for our bi-weekly newsletter to stay updated!


By submitting, I agree to the Terms of Service and Privacy Policy.