Retrieval Augmented Generation

Jun 2024

Assess

Why?

Retrieval Augmented Generation (RAG) is an architecture that augments the capabilities of a Large Language Model (LLM) like ChatGPT by adding an information retrieval system that provides grounding data. Adding an information retrieval system gives you control over grounding data used by an LLM when it formulates a response. For an enterprise solution, RAG architecture means that you can constrain generative AI to your enterprise content sourced from vectorized documents and images, and other data formats if you have embedding models for that content.

Relates to VectorDB

What?

References

"Retrieval Augmented Generation" -- Confluence page in the AD Space under "Large Language Models/LLMs Theory/Inference" section

Mar 2024

Assess

Why?

What?

References

"Retrieval Augmented Generation" -- Confluence page in the AD Space under "Large Language Models/LLMs Theory/Inference" section