0% found this document useful (0 votes)

6 views8 pages

Challenge

Uploaded by

msoumil69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views8 pages

Challenge

Uploaded by

msoumil69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Introduction:

● Retrieval-Augmented Generation (RAG) is a technique that

enhances language model generation by incorporating

external knowledge.

● This is typically done by retrieving relevant information from

a large corpus of documents and using that information to

inform the generation process.

Challenge:

● Clients often have vast proprietary documents.

● Extracting specific information is like finding a needle in a

haystack.

2. GPT4-Turbo Introduction:

● OpenAI’s GPT4-Turbo can process large documents.

3. Efficiency Issue:

● “Lost In The Middle” phenomenon hampers efficiency.

● Model forgets content in the middle of its contextual window.

4. Alternative Approach — Retrieval-Augmented-Generation

(RAG):

● Create an index for each document paragraph.

● Swiftly identify pertinent paragraphs.

● Feed selected paragraphs into a Large Language Model

(LLM) like GPT4.

5. Advantages:

● Prevents information overload.

● Enhances result quality by providing only relevant

paragraphs.

Why is Retrieval-Augmented Generation

important
● You can think of the LLM as an over-enthusiastic new employee who refuses to
stay informed with current events but will always answer every question with
absolute confidence.
● Unfortunately, such an attitude can negatively impact user trust and is not
something you want your chatbots to emulate!
● RAG is one approach to solving some of these challenges. It redirects the LLM to
retrieve relevant information from authortative, pre-determined knowledge
sources.
● Organizations have greater control over the generated text output, and users gain
insights into how the ML generates the response.
Why use RAG?
RAG offers several advantages augmenting traditional methods of text
generation, especially when dealing with factual information or data-driven
responses. Here are some key reasons why using RAG can be beneficial:

Access to fresh information

LLMs are limited to their pre-trained data. This leads to outdated and
potentially inaccurate responses. RAG overcomes this by providing up-to-date
information to LLMs.

Factual grounding

LLMs are powerful tools for generating creative and engaging text, but they
can sometimes struggle with factual accuracy. This is because LLMs are
trained on massive amounts of text data, which may contain inaccuracies or
biases.

Providing “facts” to the LLM as part of the input prompt can mitigate “gen AI
hallucinations.” The crux of this approach is ensuring that the most relevant
facts are provided to the LLM, and that the LLM output is entirely grounded on
those facts while also answering the user’s question and adhering to system
instructions and safety constraints.

Using Gemini’s long context window (LCW) is a great way to provide source
materials to the LLM. If you need to provide more information than fits into the
LCW, or if you need to scale up performance, you can use a RAG approach
that will reduce the number of tokens, saving you time and cost.

Search with vector databases and relevancy re-rankers

RAGs usually retrieve facts via search, and modern search engines now
leverage vector databases to efficiently retrieve relevant documents. Vector
databases store documents as embeddings in a high-dimensional space,
allowing for fast and accurate retrieval based on semantic similarity.
Multi-modal embeddings can be used for images, audio and video, and more
and these media embeddings can be retrieved alongside text embeddings or
multi-language embeddings.

Advanced search engines like Vertex AI Search use semantic search and
keyword search together (called hybrid search), and a re-ranker which scores
search results to ensure the top returned results are the most relevant.
Additionally searches perform better with a clear, focused query without
misspellings; so prior to lookup, sophisticated search engines will transform a
query and fix spelling mistakes.

Relevance, accuracy, and quality

The retrieval mechanism in RAG is critically important. You need the best
semantic search on top of a curated knowledge base to ensure that the
retrieved information is relevant to the input query or context. If your retrieved
information is irrelevant, your generation could be grounded but off-topic or
incorrect.
By fine-tuning or prompt-engineering the LLM to generate text entirely based
on the retrieved knowledge, RAG helps to minimize contradictions and
inconsistencies in the generated text. This significantly improves the quality of
the generated text, and improves the user experience.

The Vertex Eval Service now scores LLM generated text and retrieved chunks
on metrics like “coherence,” “fluency,” “groundedness,” "safety,"
“instruction_following,” “question_answering_quality,” and more. These
metrics help you measure the grounded text you get from the LLM (for some
metrics that is a comparison to a ground truth answer you have provided).
Implementing these evaluations gives you a baseline measurement and you
can optimize for RAG quality by configuring your search engine, curating your
source data, improving source layout parsing or chunking strategies, or
refining the user’s question prior to search. A RAG Ops, metrics driven
approach like this will help you hill climb to high quality RAG and grounded
generation.

RAGs, agents, and chatbots

RAG and grounding can be integrated into any LLM application or agent which
needs access to fresh, private, or specialized data. By accessing external
information, RAG-powered chatbots and conversational agents leverage
external knowledge to provide more comprehensive, informative, and
context-aware responses, improving the overall user experience.
Your data and your use case are what differentiate what you are building with
gen AI. RAG and grounding bring your data to LLMs efficiently and scalably.

Cloud Google Com Use-Cases Retrieval-Augmented-Generation
No ratings yet
Cloud Google Com Use-Cases Retrieval-Augmented-Generation
7 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Retrieval Augmented Generation (RAG) For Everyone
No ratings yet
Retrieval Augmented Generation (RAG) For Everyone
57 pages
Rag
No ratings yet
Rag
10 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
Learning: Gen Ai
No ratings yet
Learning: Gen Ai
6 pages
Retrieval Augmented Generation (Rag) For Precision Language Models
No ratings yet
Retrieval Augmented Generation (Rag) For Precision Language Models
10 pages
RAG Deep-Dive Research Report
No ratings yet
RAG Deep-Dive Research Report
46 pages
Tyjt
No ratings yet
Tyjt
2 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
Llmrag
No ratings yet
Llmrag
6 pages
RAG - Genai
No ratings yet
RAG - Genai
11 pages
RAG for NLP Experts
No ratings yet
RAG for NLP Experts
2 pages
RAG Architecture
100% (10)
RAG Architecture
52 pages
RAG - A Simple Introduction
100% (6)
RAG - A Simple Introduction
75 pages
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
No ratings yet
The Ultimate Guide To GenAI RAG: Enhancing AI With Real-Time Data Retrieval
12 pages
What Is Retrieval-Augmented Generation (RAG)
No ratings yet
What Is Retrieval-Augmented Generation (RAG)
12 pages
2.5 Retrieval Augmented Generation RAG
No ratings yet
2.5 Retrieval Augmented Generation RAG
2 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
Natural Language Processing
No ratings yet
Natural Language Processing
11 pages
WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
12 pages
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
No ratings yet
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
12 pages
v1 Covered
No ratings yet
v1 Covered
16 pages
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
No ratings yet
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
6 pages
Rag System Notes
No ratings yet
Rag System Notes
26 pages
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
5 pages
A Survey On Retrieval-Augmented Text Generation For Large Language Models
No ratings yet
A Survey On Retrieval-Augmented Text Generation For Large Language Models
18 pages
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
No ratings yet
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
9 pages
Document 2
No ratings yet
Document 2
12 pages
Chapters
No ratings yet
Chapters
7 pages
RAG vs GPT: A Comprehensive Guide
No ratings yet
RAG vs GPT: A Comprehensive Guide
8 pages
What Is Retrieval Augmented Generation Rag Final v2 Cs
No ratings yet
What Is Retrieval Augmented Generation Rag Final v2 Cs
5 pages
A Taxonomy of Retrieval Augmented Generation
100% (5)
A Taxonomy of Retrieval Augmented Generation
56 pages
LangChain & RAG - U1
No ratings yet
LangChain & RAG - U1
32 pages
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
No ratings yet
RAG and Its Variants - Graph RAG Light RAG and Agentic RAG
16 pages
RAG 570 Hasnad Ahmed2
No ratings yet
RAG 570 Hasnad Ahmed2
9 pages
RAG Retrieval-Augmented Generation
No ratings yet
RAG Retrieval-Augmented Generation
12 pages
The DOM GraphRAG Project
No ratings yet
The DOM GraphRAG Project
30 pages
Building LLM Applications
No ratings yet
Building LLM Applications
14 pages
Advanced RAG Techniques for LLM Apps
No ratings yet
Advanced RAG Techniques for LLM Apps
54 pages
Paper 2
No ratings yet
Paper 2
12 pages
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
No ratings yet
Advanced RAG Architecture. What Is RAG - Advanced Topics & - by Uğur Özker - Medium
21 pages
Developers Guide To RAG With Data Streaming
100% (1)
Developers Guide To RAG With Data Streaming
22 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
RAG and Vector Database Guide
No ratings yet
RAG and Vector Database Guide
18 pages
Rag PDF
No ratings yet
Rag PDF
10 pages
Medium
No ratings yet
Medium
22 pages
RAG Developers Stack
No ratings yet
RAG Developers Stack
13 pages
RAG for LLMs: A Comprehensive Survey
No ratings yet
RAG for LLMs: A Comprehensive Survey
26 pages
Semantic Search and Beyond handout-Tim-Clarke
No ratings yet
Semantic Search and Beyond handout-Tim-Clarke
16 pages
(Retrieval Augmented Generation) : by Uttam Grade
No ratings yet
(Retrieval Augmented Generation) : by Uttam Grade
6 pages
RAG Understanding PDF
No ratings yet
RAG Understanding PDF
12 pages
Retrieval-Augmented Generation (RAG) - A Comprehens
No ratings yet
Retrieval-Augmented Generation (RAG) - A Comprehens
8 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
RAG First Month Assessment GenAI
No ratings yet
RAG First Month Assessment GenAI
3 pages
Untitled 2
No ratings yet
Untitled 2
40 pages
12 Essential RAG Types 1735544647
No ratings yet
12 Essential RAG Types 1735544647
29 pages
Fundamentals of RAG (Retrieval Augmented Generation)
No ratings yet
Fundamentals of RAG (Retrieval Augmented Generation)
2 pages
Building Scalable AI-Powered Applications With Clo
No ratings yet
Building Scalable AI-Powered Applications With Clo
9 pages
RAGHack LangchainJS Serverless
No ratings yet
RAGHack LangchainJS Serverless
29 pages
AI Agent Design Notes
No ratings yet
AI Agent Design Notes
4 pages
RAG Seminar
No ratings yet
RAG Seminar
11 pages
Full Stack AI SaaS Roadmap
No ratings yet
Full Stack AI SaaS Roadmap
33 pages
MMRAG-DocQA - A Multi-Modal Retrieval-Augmented Generation Method For Document Question-Answering With Hierarchical Index and Multi-Granularity Retrieval
No ratings yet
MMRAG-DocQA - A Multi-Modal Retrieval-Augmented Generation Method For Document Question-Answering With Hierarchical Index and Multi-Granularity Retrieval
13 pages
Cloud Interview Guide 2024
No ratings yet
Cloud Interview Guide 2024
29 pages
A Practical Approach To Retrieval Augmented Generation Systems - 4 From Simple To Advanced RAG
No ratings yet
A Practical Approach To Retrieval Augmented Generation Systems - 4 From Simple To Advanced RAG
45 pages
FloatChat - Team Argonauts
No ratings yet
FloatChat - Team Argonauts
18 pages
ML LLM Eng JD
No ratings yet
ML LLM Eng JD
3 pages
AI Agent in N8N Build
No ratings yet
AI Agent in N8N Build
21 pages
1Z0-1127-24 OCI Generative AI Professional
100% (1)
1Z0-1127-24 OCI Generative AI Professional
15 pages
2025 AI Prompt Engineering Handbook Crafting Effective Prompts Roman
No ratings yet
2025 AI Prompt Engineering Handbook Crafting Effective Prompts Roman
48 pages
ARTICLE - Is Agentic RAG Worth The Investment? Agentic RAG Pricing and ROI Breakdown
No ratings yet
ARTICLE - Is Agentic RAG Worth The Investment? Agentic RAG Pricing and ROI Breakdown
1 page
RAG For Educational Application
No ratings yet
RAG For Educational Application
14 pages
Aws Certified Ai Practitioner Aif c01 Questions Answers Only
100% (1)
Aws Certified Ai Practitioner Aif c01 Questions Answers Only
65 pages
Free Questions For: Shared by On
No ratings yet
Free Questions For: Shared by On
12 pages
Artificial Intelligence in Finance: Filter
No ratings yet
Artificial Intelligence in Finance: Filter
40 pages
M1 - Introduction To Google Agentspace - Course ILT
No ratings yet
M1 - Introduction To Google Agentspace - Course ILT
50 pages
Prompt Engineering - OpenAI API
No ratings yet
Prompt Engineering - OpenAI API
9 pages
Rag Vs Cag Report
No ratings yet
Rag Vs Cag Report
6 pages
OceanofPDF.com Domain-Specific Small Language Models MEAP - Guglielmo Iozzia
No ratings yet
OceanofPDF.com Domain-Specific Small Language Models MEAP - Guglielmo Iozzia
237 pages
Generative AI Specialization Course
100% (1)
Generative AI Specialization Course
29 pages
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
No ratings yet
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
23 pages
Tableau Next Resource Guide
No ratings yet
Tableau Next Resource Guide
92 pages
Project Proposal
No ratings yet
Project Proposal
7 pages
Deeprag
No ratings yet
Deeprag
12 pages
RAG Chatbot For College Support Project Report
0% (1)
RAG Chatbot For College Support Project Report
40 pages
Post Graduate Program
No ratings yet
Post Graduate Program
15 pages

Challenge

Uploaded by

Challenge

Uploaded by

Introduction:

●​ Retrieval-Augmented Generation (RAG) is a technique that

enhances language model generation by incorporating

●​ This is typically done by retrieving relevant information from

a large corpus of documents and using that information to

inform the generation process.

●​ Clients often have vast proprietary documents.

●​ Extracting specific information is like finding a needle in a

●​ OpenAI’s GPT4-Turbo can process large documents.

●​ “Lost In The Middle” phenomenon hampers efficiency.

4. Alternative Approach — Retrieval-Augmented-Generation

●​ Create an index for each document paragraph.

●​ Swiftly identify pertinent paragraphs.

●​ Feed selected paragraphs into a Large Language Model

(LLM) like GPT4.

●​ Prevents information overload.

●​ Enhances result quality by providing only relevant

Why is Retrieval-Augmented Generation

Access to fresh information

Search with vector databases and relevancy re-rankers

Relevance, accuracy, and quality

RAGs, agents, and chatbots

You might also like

● Retrieval-Augmented Generation (RAG) is a technique that

● This is typically done by retrieving relevant information from

● Clients often have vast proprietary documents.

● Extracting specific information is like finding a needle in a

● OpenAI’s GPT4-Turbo can process large documents.

● “Lost In The Middle” phenomenon hampers efficiency.

● Create an index for each document paragraph.

● Swiftly identify pertinent paragraphs.

● Feed selected paragraphs into a Large Language Model

● Prevents information overload.

● Enhances result quality by providing only relevant