0% found this document useful (0 votes)

33 views29 pages

12 Essential RAG Types 1735544647

The document provides a comprehensive guide on 12 essential types of Retrieval-Augmented Generation (RAG) models, including Naive RAG, Advanced RAG, Modular RAG, and others, each designed to enhance the performance of large language models by integrating external knowledge sources. Key features and applications of each RAG type are discussed, highlighting their advantages in improving accuracy, relevance, and efficiency in information retrieval and generation tasks. The document emphasizes the evolution of RAG techniques to address specific challenges and improve the capabilities of AI systems.

Uploaded by

alaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views29 pages

12 Essential RAG Types 1735544647

Uploaded by

alaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

12 Essential RAG Types

(A Comprehensive Guide)

Karn Singh
What is Retrieval-Augmented Generation (RAG)
Retrieval-Augmented Generation (RAG) is a powerful technique that enhances the capabilities of large language models

(LLMs) by integrating external knowledge sources. There are various types of RAG models, each designed to address specific

needs and improve the performance of information retrieval and generation tasks.
Types of RAG Model

Agentic
RAG
RAG
Naive RAG
Definition: Naive RAG enhances large language models (LLMs) by integrating external knowledge into their responses.

Purpose: Addresses LLM limitations, particularly the inability to access real-time or updated information.
Naive RAG

Key Steps in Naive RAG

• Document Chunking:

• Large documents are divided into smaller, manageable chunks for efficient retrieval.

• Embedding Model:

• Both document chunks and user queries are converted into numerical representations (embeddings) for semantic

comparison.

• Retrieval:

• Relevant document chunks are retrieved from an indexed database based on user query embeddings.

• Response Generation:

• The LLM generates coherent responses using the retrieved chunks and the original query, ensuring relevance and

context.
Advanced RAG
Definition: Advanced RAG enhances Naive RAG by integrating sophisticated techniques for better retrieval and generation.

Techniques Used

• Re-ranking: Prioritizes retrieved documents

based on relevance.

• Dynamic Embeddings: Adjusts embeddings for

specific tasks or domains.

• Hierarchical Indexing: Organizes data into a

structured hierarchy for improved retrieval.

• Corrective RAG (CRAG): Scores and filters

documents for relevance and accuracy.

Advanced RAG

Goals:

• Enhance efficiency, accuracy, and relevance of information retrieval.

• Tackle complex queries and handle diverse data sources effectively.

Advantages Over Naive RAG:

• Better relevance and coherence in responses due to advanced filtering and re-ranking.

• Enhanced query optimization through methods like query rewriting.

• Improved scalability for handling larger datasets efficiently.

Applications: Suitable for complex applications requiring higher precision, such as advanced question-answering systems and AI

chatbots.

Advanced RAG represents a significant evolution in the capabilities of RAG systems, addressing key challenges faced by Naive

RAG implementations and providing a more robust framework for generating accurate, contextually rich responses.
Modular RAG
Definition: Modular RAG enhances traditional RAG systems by introducing modularity for improved flexibility and performance.

Key Components

• Customizable Retrievers: Tailored retrieval

mechanisms for specific use cases, enhancing

efficiency and relevance.

• Adaptive Generators: Generative models that

integrate with various retrievers for better coherence

and accuracy.

• Plug-and-Play Modules: Components that can be

easily added or replaced for system customization.

Modular RAG

Applications : Suitable for diverse applications, including customer support chatbots and advanced question-answering

systems, where tailored solutions are essential.

Modular RAG represents a significant evolution in retrieval-augmented techniques, addressing the limitations of Naive RAG

and providing a robust framework for building adaptable AI systems.

Query-Based Retrieval-Augmented Generation (QB-RAG)
Definition: QB-RAG optimizes retrieval by pre-computing a database of potential queries, improving alignment between user

questions and relevant content.

Key Features

• Query Pre-computation: Generates a comprehensive

set of potential queries from the knowledge base to

facilitate efficient retrieval.

• Vector Search: Utilizes vector search techniques to

match incoming user queries against the pre-

generated query database, enhancing retrieval

accuracy.

• Semantic Alignment: Focuses on aligning user queries

with content across distinct semantic representations,

addressing gaps in traditional retrieval methods.

Query-based RAG

Advantages

• Improved Accuracy: Empirical evaluations show that QB-RAG significantly enhances the accuracy of

responses, particularly in healthcare question-answering applications.

• Robustness: Provides a more reliable framework for applications requiring trustworthy responses from

LLMs.

Applications

• Particularly effective in digital health chatbots and other domains where accurate, real-time

information is critical.

QB-RAG represents a significant advancement in retrieval techniques for RAG systems, addressing existing

challenges in aligning user queries with relevant knowledge effectively.

Logit-based RAG
Definition: Combines retrieval information with generative models using logits (raw output values before softmax) during the

decoding process.

Key Features

• Logit Integration: Integrates relevant retrieved

information into generation through logits,

enabling nuanced decision-making.

• Augmentation Methodology: Allows retrieved

results to influence generation stages, either as

input or by modifying logits directly.

Logit-based RAG

Advantages

• Enhanced Relevance: By using logits, this approach can better determine which retrieved information is most relevant to

the current query.

• Improved Output Quality: The integration of retrieval data through logits helps generate more accurate and contextually

appropriate responses.

Applications

• Particularly useful in scenarios requiring high accuracy and contextual relevance, such as advanced question-answering

systems and AI-generated content.

Logit-Based RAG represents a significant advancement in retrieval techniques, effectively leveraging the strengths of

generative models to improve response quality and relevance.

Latent Representation-based RAG
Definition: Incorporates retrieved data as latent representations within generative models to enhance comprehension and

output quality.

Key Features

• Latent Representation Integration: Integrates

retrieved objects at a deeper level, influencing the

model's hidden states during generation.

• Enhanced Comprehension: Provides a nuanced

understanding of context through the use of latent

representations.
Latent Representation-based RAG

Advantages

• Improved Output Quality: Enhances the relevance and quality of generated outputs.

• Adaptability Across Modalities: Suitable for various applications, including text, code, image, and audio generation.

Applications

• Effective in fields such as natural language processing, computer vision, and audio processing, where accurate and

contextually relevant outputs are essential.

Latent Representation-Based RAG signifies a notable advancement in retrieval techniques, leveraging sophisticated

algorithms to incorporate external information into generative processes effectively.

Speculative RAG
Definition: Speculative RAG enhances retrieval-augmented generation by using a larger generalist language model (LM) to verify

multiple drafts generated in parallel by a smaller, specialized LM.

Speculative RAG

Key Features

• Drafting and Verification: Separates drafting (specialist LM) from verification (generalist LM) to improve efficiency.

• Parallel Draft Generation: Creates multiple drafts from distinct subsets of retrieved documents, allowing for diverse

perspectives.

• Efficiency: Offloads drafting to a smaller model, accelerating response generation while maintaining accuracy.

Advantages

• Enhanced Accuracy: Improves accuracy by up to 12.97% on benchmarks like TriviaQA and PubHealth.

• Reduced Latency: Lowers response times by 51% compared to traditional RAG systems.

Applications

• Particularly effective in knowledge-intensive tasks such as question answering, where timely and accurate information

retrieval is crucial.
Speculative RAG
Self Reflective RAG
Definition: Enhances language models (LMs) by enabling on-demand retrieval of relevant passages and self-reflection on

generated outputs using reflection tokens.

Self RAG

Key Features

• Reflection Tokens: These tokens signal the need for retrieval or assess the quality of generated outputs, allowing the model to adapt its behavior

during inference.

• Adaptive Retrieval: The framework allows LMs to determine when to retrieve additional information based on the context of the input and previous

generations.

• End-to-End Training: Self-RAG trains a single LM to generate text informed by retrieved passages while also critiquing its own outputs.

Advantages

• Improved Factuality: Significantly enhances the accuracy of responses, outperforming state-of-the-art LMs in tasks like open-domain QA and fact

verification.

• Versatility: Maintains the original creativity and versatility of LMs while improving their factual accuracy.

Applications

• Effective in various tasks requiring high-quality, factual responses, such as question answering, reasoning, and content generation.
Branched RAG
An advanced framework that enhances standard RAG through a structured, multi-step approach for retrieval and response

generation..
Branched RAG
Key Features

•Multiple Retrieval Steps: Conducts sequential retrievals to gather information progressively.

•Hierarchical Structure: Uses a branching pattern where each retrieval informs the next, allowing deeper topic exploration.

•Specialized Knowledge Bases: Different branches can query distinct knowledge bases tailored to specific sub-topics.

•Dynamic Query Refinement: Refines queries based on intermediate results for focused and relevant retrievals.

Workflow

1.Initial Broad Retrieval: Captures potentially relevant information for context.

2.Intermediate Retrievals: Narrows the search space based on initial results.

3.Final Focused Retrieval: Yields highly relevant information, improving precision.

4.Generation Step: Synthesizes information from multiple retrieval steps for comprehensive responses.

Applications

•Effective for complex queries requiring multi-step reasoning or synthesis of information, such as in legal tools or

multidisciplinary research.
Agentic RAG
Definition: Integrates AI agents into the RAG framework to enhance information retrieval and processing capabilities for complex

tasks.
Agentic RAG
Key Features

• Agent-Based Architecture: Uses agents to orchestrate retrieval processes and make sourcing decisions.

• Tool Integration: Agents access various tools (e.g., vector search engines, web searches, APIs) for information gathering.

• Dynamic Decision-Making: Agents evaluate when to retrieve information and which tools to use based on context.

Advantages

• Enhanced Flexibility: Supports multi-step retrieval processes and adaptive responses to complex queries.

• Improved Accuracy: Agents validate retrieved information, leading to more robust outputs.

Applications

• Effective in real-time adaptive responses, such as automated customer support, internal knowledge management, and

research assistance.
Adaptive RAG
Definition: Adaptive RAG dynamically adjusts its retrieval strategy based on the complexity or nature of the query, enhancing the

accuracy and relevance of responses.

Adaptive RAG
Key Features

•Dynamic Retrieval Strategy: Alters retrieval methods in real-time, using a single source for simple queries and multiple

sources for complex ones.

•Query Classification: Determines the type of query (factual, analytical, opinion-based, contextual) to apply appropriate

retrieval strategies.

•LLM Integration: Utilizes language models at different stages to optimize document ranking and response generation.

Advantages

•Tailored Responses: Ensures customized responses for diverse query types, bridging the gap between precision and

breadth.

•Enhanced Efficiency: Improves information retrieval speed and relevance by adapting to query characteristics.

Applications

•Suitable for environments with varied query types, such as search engines and AI assistants, where dynamic adjustments

are crucial for effective information retrieval.

Corrective RAG
CRAG is a strategy that incorporates self-reflection and self-grading on retrieved documents to enhance the accuracy and

relevance of generated responses.

Corrective RAG

Key Features

•Self-Reflection: Evaluates the relevance of retrieved documents before generation.

•Knowledge Refinement: Partitions documents into "knowledge strips" and grades each for relevance.

•Supplementary Retrieval: If documents fall below a relevance threshold, CRAG performs additional retrievals, such as web searches.

Workflow

1.Initial Retrieval: Retrieves documents based on the input query.

2.Grading Documents: Evaluates each document's relevance to the query.

3.Knowledge Refinement: Filters out irrelevant knowledge strips.

4.Supplemental Search: Uses web searches if necessary to find additional relevant information.

5.Response Generation: Generates a response using refined and relevant information.

Applications

•Ideal for scenarios requiring high factual accuracy, such as legal document generation, medical diagnosis support.
WAS THIS POST USEFUL?

FOLLOW FOR
MORE!

RAG - Genai
No ratings yet
RAG - Genai
11 pages
Modular RAG: Transforming RAG Systems Into LEGO-like Reconfigurable Frameworks
No ratings yet
Modular RAG: Transforming RAG Systems Into LEGO-like Reconfigurable Frameworks
17 pages
Learning: Gen Ai
No ratings yet
Learning: Gen Ai
6 pages
RAG Retrieval-Augmented Generation
No ratings yet
RAG Retrieval-Augmented Generation
12 pages
Retrieval-Augmented Generation (RAG) - A Comprehens
No ratings yet
Retrieval-Augmented Generation (RAG) - A Comprehens
8 pages
7 Agentic RAG System Architectures To Build AI Agents
100% (2)
7 Agentic RAG System Architectures To Build AI Agents
12 pages
01rag For LLM A Survey
No ratings yet
01rag For LLM A Survey
21 pages
Agentic RAG: Survey on AI Advancements
No ratings yet
Agentic RAG: Survey on AI Advancements
39 pages
What Is Retrieval-Augmented Generation (RAG)
No ratings yet
What Is Retrieval-Augmented Generation (RAG)
12 pages
RAG Deep-Dive Research Report
No ratings yet
RAG Deep-Dive Research Report
46 pages
Retrieval Augmented Generation (RAG) For Everyone
No ratings yet
Retrieval Augmented Generation (RAG) For Everyone
57 pages
What It Is Its Types Applications
No ratings yet
What It Is Its Types Applications
11 pages
RAG for LLMs: A Comprehensive Survey
No ratings yet
RAG for LLMs: A Comprehensive Survey
26 pages
Generative AI
No ratings yet
Generative AI
25 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
No ratings yet
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
12 pages
Agent Rag
No ratings yet
Agent Rag
35 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
Traditional RAG vs. Agentic RAG: A Comparative Study of Retrieval-Augmented Systems
No ratings yet
Traditional RAG vs. Agentic RAG: A Comparative Study of Retrieval-Augmented Systems
5 pages
RAG Understanding PDF
No ratings yet
RAG Understanding PDF
12 pages
RAG for NLP Experts
No ratings yet
RAG for NLP Experts
2 pages
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
No ratings yet
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
12 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
No ratings yet
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
16 pages
A Deep Dive Into Retrieval Augmented Generation: Team Members
No ratings yet
A Deep Dive Into Retrieval Augmented Generation: Team Members
14 pages
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
No ratings yet
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
6 pages
Rag 1
No ratings yet
Rag 1
33 pages
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
No ratings yet
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
21 pages
(Retrieval Augmented Generation) : by Uttam Grade
No ratings yet
(Retrieval Augmented Generation) : by Uttam Grade
6 pages
Document 2
No ratings yet
Document 2
12 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Llmrag
No ratings yet
Llmrag
6 pages
A Survey On Rag Meeting LLM
No ratings yet
A Survey On Rag Meeting LLM
18 pages
Medium
No ratings yet
Medium
22 pages
A Survey On Retrieval-Augmented Text Generation For Large Language Models
No ratings yet
A Survey On Retrieval-Augmented Text Generation For Large Language Models
18 pages
A Survey On Rag Meeting LLMS: Towards Retrieval-Augmented Large Language Models
No ratings yet
A Survey On Rag Meeting LLMS: Towards Retrieval-Augmented Large Language Models
18 pages
RAG Technics
100% (1)
RAG Technics
8 pages
Retrieval Augmented Generation (Rag) For Precision Language Models
No ratings yet
Retrieval Augmented Generation (Rag) For Precision Language Models
10 pages
RAG Part 1
No ratings yet
RAG Part 1
1 page
Types of RAG: @bhavishya Pandit
100% (1)
Types of RAG: @bhavishya Pandit
15 pages
Chapters
No ratings yet
Chapters
7 pages
MA RAG DiverseDS
No ratings yet
MA RAG DiverseDS
16 pages
Challenge
No ratings yet
Challenge
8 pages
(IJETA-V11I3P40) :kanishk Pratap Singh, Pradeep Kumar
No ratings yet
(IJETA-V11I3P40) :kanishk Pratap Singh, Pradeep Kumar
8 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
Interview Questions On RAG
100% (1)
Interview Questions On RAG
6 pages
RAG in NLP
No ratings yet
RAG in NLP
1 page
Understanding Retrieval-Augmented Generation (RAG)
No ratings yet
Understanding Retrieval-Augmented Generation (RAG)
12 pages
Agentic RAGs 1740054167
No ratings yet
Agentic RAGs 1740054167
10 pages
1756786367778
No ratings yet
1756786367778
12 pages
Master RAG Course
No ratings yet
Master RAG Course
50 pages
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact On Performance and Efficiency
No ratings yet
Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact On Performance and Efficiency
14 pages
Top 20+ RAG Interview Questions
No ratings yet
Top 20+ RAG Interview Questions
8 pages
Retrieval-Augmented Generation RAG and LLM Integration
No ratings yet
Retrieval-Augmented Generation RAG and LLM Integration
5 pages
RAG Architectures
No ratings yet
RAG Architectures
9 pages
Searching For Best Practices in Retrieval-Augmented Generation
No ratings yet
Searching For Best Practices in Retrieval-Augmented Generation
22 pages
Tyjt
No ratings yet
Tyjt
2 pages
HBXX-6516DS-VTM Line Drawing - Aspx
No ratings yet
HBXX-6516DS-VTM Line Drawing - Aspx
3 pages
Parts Manual: Model - 7400 7420 7440 Lift Truck ™
100% (10)
Parts Manual: Model - 7400 7420 7440 Lift Truck ™
692 pages
U2000 Corba NBI 02.developer Guide (Configuration) PDF
No ratings yet
U2000 Corba NBI 02.developer Guide (Configuration) PDF
442 pages
Training Facility: Building Attributes Case Studies Emerging Issues Relevant Codes and Standards Major Resources
No ratings yet
Training Facility: Building Attributes Case Studies Emerging Issues Relevant Codes and Standards Major Resources
10 pages
29L0168387F en
No ratings yet
29L0168387F en
36 pages
PLTW - Barry Smith
No ratings yet
PLTW - Barry Smith
12 pages
Transportation Order Booking Confirmation
No ratings yet
Transportation Order Booking Confirmation
6 pages
Atomic Engine Price List AUG2011
100% (1)
Atomic Engine Price List AUG2011
8 pages
Flyback Chris Basso APEC Seminar 2011
No ratings yet
Flyback Chris Basso APEC Seminar 2011
165 pages
Juniper Jn0-102 Exam Questions & Answers: Number: JN0-102 Passing Score: 800 Time Limit: 120 Min File Version: 48.5
No ratings yet
Juniper Jn0-102 Exam Questions & Answers: Number: JN0-102 Passing Score: 800 Time Limit: 120 Min File Version: 48.5
105 pages
Data Mining Bread Quality and Process Data in A Plant Bakery
No ratings yet
Data Mining Bread Quality and Process Data in A Plant Bakery
4 pages
MICROSOFT AZURE IoT Platform-Manual PDF
100% (1)
MICROSOFT AZURE IoT Platform-Manual PDF
1,100 pages
Health Monitoring
No ratings yet
Health Monitoring
13 pages
At - Audit in A Computerized Environment
No ratings yet
At - Audit in A Computerized Environment
17 pages
Digital Avionics Part 1
100% (1)
Digital Avionics Part 1
14 pages
Tata Swach Lavita 18L Water Purifier
No ratings yet
Tata Swach Lavita 18L Water Purifier
3 pages
REF. 1200 Dect Kit 1W Wireless
No ratings yet
REF. 1200 Dect Kit 1W Wireless
1 page
Digital India's E-Governance & Participation
No ratings yet
Digital India's E-Governance & Participation
24 pages
Synchonous 4bitcounter PDF
No ratings yet
Synchonous 4bitcounter PDF
25 pages
Bluetooth Wireless Protocol Guide
No ratings yet
Bluetooth Wireless Protocol Guide
18 pages
Incropera Answers
No ratings yet
Incropera Answers
26 pages
Nissan-Versa 2015 en US Manual de Taller Sapito Limpia Parabrisas Bomba Limpiaparabrisas A51af0976e
No ratings yet
Nissan-Versa 2015 en US Manual de Taller Sapito Limpia Parabrisas Bomba Limpiaparabrisas A51af0976e
57 pages
Module-10 WebpageDesigning UsingTemplates
No ratings yet
Module-10 WebpageDesigning UsingTemplates
12 pages
Bryan Steam - Brochure
No ratings yet
Bryan Steam - Brochure
8 pages
Ezviz Intro v1.3
No ratings yet
Ezviz Intro v1.3
30 pages
Thesis On Hotel Management Project
100% (3)
Thesis On Hotel Management Project
6 pages
Chpter 4 Actuators Devices: Dr. M. Atta
No ratings yet
Chpter 4 Actuators Devices: Dr. M. Atta
23 pages
Network Analysis - Week 12
No ratings yet
Network Analysis - Week 12
4 pages
Screenshot 2025-01-13 at 12.17.38 PM
No ratings yet
Screenshot 2025-01-13 at 12.17.38 PM
12 pages
Simba M4 C PDF
No ratings yet
Simba M4 C PDF
4 pages

12 Essential RAG Types 1735544647

Uploaded by

12 Essential RAG Types 1735544647

Uploaded by

12 Essential RAG Types

Key Steps in Naive RAG

• Re-ranking: Prioritizes retrieved documents

• Dynamic Embeddings: Adjusts embeddings for

specific tasks or domains.

• Hierarchical Indexing: Organizes data into a

structured hierarchy for improved retrieval.

• Corrective RAG (CRAG): Scores and filters

documents for relevance and accuracy.

• Enhance efficiency, accuracy, and relevance of information retrieval.

• Tackle complex queries and handle diverse data sources effectively.

Advantages Over Naive RAG:

• Enhanced query optimization through methods like query rewriting.

• Improved scalability for handling larger datasets efficiently.

• Customizable Retrievers: Tailored retrieval

mechanisms for specific use cases, enhancing

efficiency and relevance.

• Adaptive Generators: Generative models that

integrate with various retrievers for better coherence

• Plug-and-Play Modules: Components that can be

easily added or replaced for system customization.

systems, where tailored solutions are essential.

and providing a robust framework for building adaptable AI systems.

questions and relevant content.

• Query Pre-computation: Generates a comprehensive

set of potential queries from the knowledge base to

facilitate efficient retrieval.

• Vector Search: Utilizes vector search techniques to

match incoming user queries against the pre-

generated query database, enhancing retrieval

• Semantic Alignment: Focuses on aligning user queries

with content across distinct semantic representations,

addressing gaps in traditional retrieval methods.

responses, particularly in healthcare question-answering applications.

challenges in aligning user queries with relevant knowledge effectively.

• Logit Integration: Integrates relevant retrieved

information into generation through logits,

enabling nuanced decision-making.

• Augmentation Methodology: Allows retrieved

results to influence generation stages, either as

input or by modifying logits directly.

the current query.

systems and AI-generated content.

generative models to improve response quality and relevance.

• Latent Representation Integration: Integrates

retrieved objects at a deeper level, influencing the

model's hidden states during generation.

• Enhanced Comprehension: Provides a nuanced

understanding of context through the use of latent

contextually relevant outputs are essential.

algorithms to incorporate external information into generative processes effectively.

multiple drafts generated in parallel by a smaller, specialized LM.

generated outputs using reflection tokens.

•Multiple Retrieval Steps: Conducts sequential retrievals to gather information progressively.

1.Initial Broad Retrieval: Captures potentially relevant information for context.

2.Intermediate Retrievals: Narrows the search space based on initial results.

3.Final Focused Retrieval: Yields highly relevant information, improving precision.

accuracy and relevance of responses.

sources for complex ones.

are crucial for effective information retrieval.

relevance of generated responses.

•Self-Reflection: Evaluates the relevance of retrieved documents before generation.

1.Initial Retrieval: Retrieves documents based on the input query.

2.Grading Documents: Evaluates each document's relevance to the query.

3.Knowledge Refinement: Filters out irrelevant knowledge strips.

5.Response Generation: Generates a response using refined and relevant information.

You might also like