Skip to content

Latest commit

 

History

History
9 lines (5 loc) · 484 Bytes

File metadata and controls

9 lines (5 loc) · 484 Bytes

Retrieval Augment Generation

Retrieval-augmented generation (RAG) for large language models (LLMs) aims to improve prediction quality by using an external datastore at inference time to build a richer prompt that includes some combination of context, history, and recent/relevant knowledge.

In Azure, RAG pattern is implemented with following components