0% found this document useful (0 votes)

151 views

RAG and LangChain

The document discusses using LangChain and OpenAI to perform retrieval question answering (RetrieverQA) on PDF documents. It covers loading documents, chunking text, storing chunks in a vector database, performing similarity search on the database, and using different 'chain types' to pass retrieved chunks to an LLM for question answering.

Uploaded by

Chaaranpall Lambba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views

RAG and LangChain

Uploaded by

Chaaranpall Lambba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

RAG_and_LangChain_RetrievalQA

December 7, 2023

1 Install libs
[ ]: !pip install langchain
!pip install pypdf
!pip install openai

[3]: from google.colab import userdata

openai_api_key = userdata.get('OPENAI_API_KEY')

2 Loading PDFs
[4]: from langchain.document_loaders import PyPDFLoader

# I will load this summary of "Deep Work" book:

# https://briefer.com › books › deep-work › pdf
pdf1 = path+"Deep_Work_summary.pdf"

# and also RAG paper, to diversify source documents

pdf2 = "https://arxiv.org/pdf/2005.11401.pdf"

loaders = [
# Duplicate documents on purpose - messy data
PyPDFLoader(pdf1),
PyPDFLoader(pdf1),
PyPDFLoader(pdf2),
]

docs = []
for i, loader in enumerate(loaders):
pages = loader.load()
print(f"For doc = {i}, number of pages: {len(pages)}")
docs.extend(loader.load())

print(f" length of docs {len(docs)}")

For doc = 0, number of pages: 8

For doc = 1, number of pages: 8

1
For doc = 2, number of pages: 19
length of docs 35

3 Chunking documents
[5]: from langchain.text_splitter import RecursiveCharacterTextSplitter
text_splitter = RecursiveCharacterTextSplitter(
chunk_size = 1500,
chunk_overlap = 150,
separators=['. ']
)

chunks = text_splitter.split_documents(docs)
len(chunks)

[5]: 80

4 Storing docs using Vectorestores + Embedding

[ ]: !pip install chromadb
!pip install tiktoken

[9]: from langchain.vectorstores import Chroma

from langchain.embeddings.openai import OpenAIEmbeddings
embedding = OpenAIEmbeddings(openai_api_key=openai_api_key)

[12]: persist_directory = '/content/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/

↪chroma/'

# !rm -rf persist_directory # remove old database files if any

vectordb = Chroma.from_documents(
documents=chunks,
embedding=embedding,
persist_directory=persist_directory
)

[13]: print(vectordb._collection.count())

5 RetrieverQA
5.1 Retriever
Simple retriever to test our question/vectorestore

2
[14]: question = "What is a deep work"
docs_similarity_search = vectordb.similarity_search(question, k=3)

for doc in docs_similarity_search:

print(doc.page_content[:200], f"==> metadata = {doc.metadata}")

All of the best, and most creative work, emerges from a state of clear
focus and careful attention. So, perhaps deep work, along with restorative
rest is just the antidote we need. Deep Work is a guid ==> metadata = {'page':
7, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}
All of the best, and most creative work, emerges from a state of clear
focus and careful attention. So, perhaps deep work, along with restorative
rest is just the antidote we need. Deep Work is a guid ==> metadata = {'page':
7, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}
We've all heard the phrase, "work smarter, not harder." It's a big
adjustment to make, because we've put so much value into working
longer hours. Just because you're spending more time at the office,
==> metadata = {'page': 4, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}

5.2 Initialize LLM using GPT-3.5-Turbo

We initialize the LLMs that we’re using to answer the question
[18]: from langchain.chat_models import ChatOpenAI
llm = ChatOpenAI(model_name='gpt-3.5-turbo', openai_api_key=openai_api_key)

5.3 RetrievalQA chain: inlcude chunks in the context window for QA

This method allows to perform question-answering chain by retrieving data from the vectorestore
and passing it through our LLMs.
There are different ways to send (chaining) the docs to the LLMs: chain_type:
• stuff : the base chain
• map_reduce
• refine
• map_rerank
dict_keys(['stuff', 'map_reduce', 'refine', 'map_rerank'])

5.4 1- Base chain: Include the whole context in the query to the LLM
By default, the base chain is stuff
It processes a list of documents (in our case 4) by combining them into a single prompt and then
submits that combined prompt to a language model.
It’s well-suited for applications where documents are small.

3
[16]: from langchain.chains import RetrievalQA

Base retriever
[19]: qa_chain = RetrievalQA.from_chain_type(
llm,
retriever=vectordb.as_retriever(),
return_source_documents=True,
)

question = "What is a deep work"

result = qa_chain({"query": question})
print(f"Answer:\n {result['result']}")

Answer:
Deep work refers to a state of focused and uninterrupted concentration on a
cognitively demanding task. It is a term coined by author and professor Cal
Newport in his book "Deep Work: Rules for Focused Success in a Distracted
World." Deep work involves eliminating distractions, such as social media or
constant interruptions, and dedicating uninterrupted time to work on tasks that
require intense focus and cognitive effort. The goal of deep work is to maximize
productivity, creativity, and the quality of work output.
If you take a closer look at the result object: we have 3 keys: * Query * Result * Source_documents:
which contain the context from the retriever
[20]: # if we take more closer look on the "source_documents" ==> There are 4␣
↪documents

result
for doc in result['source_documents']:
print(doc.page_content[:200], f"==> metadata = {doc.metadata}\n")

We've all heard the phrase, "work smarter, not harder." It's a big
adjustment to make, because we've put so much value into working
longer hours. Just because you're spending more time at the office,
==> metadata = {'page': 4, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}

4
We've all heard the phrase, "work smarter, not harder." It's a big
adjustment to make, because we've put so much value into working
longer hours. Just because you're spending more time at the office,
==> metadata = {'page': 4, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}

One can see that there are redundants documents, that you don’t want to pass through the LLM,
you’ll pay for it. We can avoid this by using MMR retriever as explained in the last posted notebook,
which gives more diversified chunks to use in the context.

MMR retriever
[21]: qa_chain = RetrievalQA.from_chain_type(
llm,
retriever=vectordb.as_retriever(search_type = "mmr"),
return_source_documents=True,
)

question = "What is a deep work"

result = qa_chain({"query": question})
print(f"Answer:\n {result['result']}")

Answer:
Deep work refers to a state of focused and uninterrupted concentration on a
cognitively demanding task. It is the ability to work in a state of flow, where
one can fully immerse themselves in their work and produce high-quality,
valuable output. Deep work requires eliminating distractions, such as social
media or interruptions, and dedicating uninterrupted time to engage in intense
cognitive activities. It is contrasted with shallow work, which consists of low-
value, easily replicable tasks that can be done while distracted. Deep work is
considered crucial for producing meaningful and impactful work.

[22]: # if we take more closer look on the "source_documents" ==> There are 4␣
↪documents

result
for doc in result['source_documents']:
print(doc.page_content[:200], f"==> metadata = {doc.metadata}\n")

Cal Newport, Associate Professor in computer science, popular author,

and social media avoider, delves into the world of work, focus, and
productivity. By distinguishing the two fundamental types of w ==> metadata =

5
{'page': 1, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}

. This kind of work

means we don't create anything of value. So why is it that we gravitate
towards shallow work?
The truth is that shallow work is easy, and deep work is difficult.
Furthermore, shall ==> metadata = {'page': 1, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}

. Living in the digital age means that

we're hyper-connected, but ironically, this can disconnect us from
completing the essential tasks at hand. ==> metadata = {'page': 2, 'source':
'/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/data/Deep_Work_summary.pdf'}

I let you compare the LLM’s answer from both queries: result[‘result’]

[33]: qa_chain = RetrievalQA.from_chain_type(

llm,
retriever=vectordb.as_retriever(search_type = "mmr"),
return_source_documents=True,
chain_type="stuff"
)

question = "What is a deep work"

result = qa_chain({"query": question})
print(f"Answer:\n {result['result']}")

Answer:
Deep work refers to the ability to focus without distraction on a cognitively
demanding task. It is a state of flow where you can fully immerse yourself in
your work and produce high-quality and valuable output. Deep work requires
extended periods of uninterrupted concentration and intense focus, allowing you
to push your cognitive abilities to their limits. Unlike shallow work, which
consists of mundane and easily replicable tasks, deep work involves tackling
complex problems, generating new ideas, and producing meaningful work that
requires deep thinking and creativity.
Base chain:
Deep work refers to a state of focused and uninterrupted concentration on a cognitively demanding
task. It is the ability to work in a state of flow, where one can fully immerse themselves in their
work and produce high-quality, valuable output. Deep work requires eliminating distractions, such
as social media or interruptions, and dedicating uninterrupted time to engage in intense cognitive
activities. It is contrasted with shallow work, which consists of low-value, easily replicable tasks
that can be done while distracted. Deep work is considered crucial for producing meaningful and
impactful work.
Stuff

6
Deep work refers to the ability to focus without distraction on a cognitively demanding task. It
is a state of flow where you can fully immerse yourself in your work and produce high-quality and
valuable output. Deep work requires extended periods of uninterrupted concentration and intense
focus, allowing you to push your cognitive abilities to their limits. Unlike shallow work, which
consists of mundane and easily replicable tasks, deep work involves tackling complex problems,
generating new ideas, and producing meaningful work that requires deep thinking and creativity.
==> we have almost the same results

5.5 2-Map-reduce chain:

Each individual chunk is sent to the LLM, to get a base answer. Then those answers are composed
to get the final answer
As you already see that at each time the retriever get 4 source documents.
So the RetrievalQA using mapreduce will make 4 calls to the openAI model, each call corresponds
to a document:
Than, it gathers a summary of the 4 calls, to make a final call:
**Inputs:**
*
**System**: Given the following extracted parts of a long document and a question, create a fin
If you don't know the answer, just say that you don't know. Don't try to make up an answer.*

*********************
>\<summary of question made to doc 1\>
>\<summary of question made to doc 2\>
>\<summary of question made to doc 3\>
>\<summary of question made to doc 4\>
*********************

Human: What is a deep work

than we can get the model outputs:
**ASSISTANT**:
*There is no clear answer to this question....*
[3]: from IPython import display
display.Image(path_image)
#source from LangChain documentation
[3]:

7
[25]: qa_chain = RetrievalQA.from_chain_type(
llm,
retriever=vectordb.as_retriever(),
chain_type="map_reduce"
)

question = "What is a deep work"

result = qa_chain({"query": question})
print(f"Answer:\n {result['result']}")

Answer:
Deep work refers to a state of focused and uninterrupted concentration on a
cognitively demanding task. It involves working on a task without any
distractions or interruptions, allowing for maximum productivity and high-
quality output. Deep work requires a state of flow, where the individual is
fully immersed in the task at hand and able to work at their highest level of
cognitive ability. This type of work is often associated with creativity,
problem-solving, and producing high-value work.
In the result object ==> no source_documents ==> only the answer
[26]: result

[26]: {'query': 'What is a deep work',

'result': 'Deep work refers to a state of focused and uninterrupted
concentration on a cognitively demanding task. It involves working on a task
without any distractions or interruptions, allowing for maximum productivity and
high-quality output. Deep work requires a state of flow, where the individual is
fully immersed in the task at hand and able to work at their highest level of
cognitive ability. This type of work is often associated with creativity,
problem-solving, and producing high-value work.'}

[27]: qa_chain_mr = RetrievalQA.from_chain_type(

llm,
retriever=vectordb.as_retriever(search_type = "mmr"),

8
chain_type="map_reduce"
)

question = "What is a deep work"

result = qa_chain_mr({"query": question})
print(f"Answer:\n {result['result']}")

Answer:
Deep work refers to the ability to focus without distraction on a cognitively
demanding task. It is a state of flow where one can fully engage in meaningful
work, free from interruptions and distractions. Deep work requires intense
concentration and can lead to high-quality outputs and significant progress in
one's work.
cons of map_reduce ==>
When using map_reduce, since we send each chunk separately to the LLM, there is a possibility
that our question’s answer might be divided between 2 different chuncks (at the end of one chunk
and the beginning of another). This could result in the LLM being unable to find a relevant answer,
leading to responses like “I don’t know”…

5.6 3- Refine chain

We can improve map-reduce results, by using this other chain type “refine”, which makes sequen-
tial calls to the OpenAI API.
With this chain, we also call 4 times the OpenAI Chat API, but with different way than
map_reduce:
At each time we call the LLM, we give: * The current document + * the LLM’s answer from the
previous call with the previous document + * Adapt the prompt template to ask explicitly the
LLM to refine the answer with the new context (current document)
Here are the steps:
FIRST CALL
SYSTEM: Context information is below ****<doc1>***
Given the context information and not prior knowledge, answer any questions
HUMAN:“What is a deep work”

Model ouput:
ASSISTANT: answer1
SECOND CALL: Second a sequence of messages, that contained the former answer from the
model:
HUMAN: “What is a deep work”
AI (could ne assistant role): answer1

9
HUMAN (could be system role): We have the opportunity to refine the existing answer (only if
needed) with some more context below.
****<doc2>****
Given the new context, refine the original answer to better answer the question. if the context isn’t
useful, return the original answer.

Model output:
ASSISTANT: answer2
THIRD CALL: third sequence of messages, that contained the former answer from the model:
HUMAN: “What is a deep work”
AI (could be assistant role): answer2
HUMAN (could be system role): We have the opportunity to refine the existing answer (only if
needed) with some more context below.
****<doc3>****
Given the new context, refine the original answer to better answer the question. if the context isn’t
useful, return the original answer.

Model output:
ASSISTANT: answer3
…
[5]: from IPython import display
display.Image(path_image)
#source from LangChain documentation
[5]:

[28]: qa_chain = RetrievalQA.from_chain_type(

llm,

10
retriever=vectordb.as_retriever(),
chain_type="refine"
)

question = "What is a deep work"

result = qa_chain({"query": question})
print(f"Answer:\n {result['result']}")

Answer:
Deep work, as described by Cal Newport in his book "Deep Work," is a concept
that emphasizes the importance of focused attention and eliminating distractions
to produce high-quality and creative work. It encourages individuals to work
smarter rather than harder by prioritizing deep, concentrated work over shallow,
easily interruptible tasks. Newport provides practical tips to boost focus and
productivity, such as making deep work a routine, scheduling dedicated time for
it, finding a distraction-free environment, and practicing digital minimalism.
By incorporating deep work into their routine and creating a dedicated space,
individuals can enhance their ability to produce meaningful work and maximize
their output.
Refine gives better answer than map reduce. This is because we incorporate at each call the answers
coming from the previous context, which transfers information through the chain.

5.7 3- Map rerank

With map rerank, we also call the LLMs multiple times (=number of documents). The difference
with the other methods, is that we specify in the prompt to answer the question and also , specif-
ically, score the answer “How certain is the LMM in its answer”; Then the answer with
the highest score is returned as final answer.
[7]: from IPython import display
display.Image(path_image)
#source from LangChain documentation
[7]:

11
[32]: qa_chain = RetrievalQA.from_chain_type(
llm,
retriever=vectordb.as_retriever(),
chain_type="map_rerank"
)

question = "What is a deep work"

result = qa_chain({"query": question})
print(f"Answer:\n {result['result']}")

/usr/local/lib/python3.10/dist-packages/langchain/chains/llm.py:344:
UserWarning: The apply_and_parse method is deprecated, instead pass an output
parser directly to LLMChain.
warnings.warn(
Answer:
Deep Work is a guide that helps individuals regain control of their time,
eliminate distractions, and improve their overall focus. It emphasizes the
importance of clear focus and careful attention in producing the best and most
creative work. Deep Work suggests that by practicing deep work and incorporating
restorative rest, individuals can enhance their ability to do meaningful work.
The book emphasizes that focus, not time, is the key to accomplishing important
tasks.
Here are the different results:
map_reduce: base retriever:
Deep work refers to a state of focused and uninterrupted concentration on a cognitively demanding
task. It involves working on a task without any distractions or interruptions, allowing for maximum
productivity and high-quality output. Deep work requires a state of flow, where the individual is
fully immersed in the task at hand and able to work at their highest level of cognitive ability. This
type of work is often associated with creativity, problem-solving, and producing high-value work.
map_reduce: MMR:
Deep work refers to the ability to focus without distraction on a cognitively demanding task. It
is a state of flow where one can fully engage in meaningful work, free from interruptions and
distractions. Deep work requires intense concentration and can lead to high-quality outputs and
significant progress in one’s work.
refine:
Deep work, as described by Cal Newport in his book “Deep Work,” is a concept that emphasizes the
importance of focused attention and eliminating distractions to produce high-quality and creative
work. It encourages individuals to work smarter rather than harder by prioritizing deep, concen-
trated work over shallow, easily interruptible tasks. Newport provides practical tips to boost focus
and productivity, such as making deep work a routine, scheduling dedicated time for it, finding
a distraction-free environment, and practicing digital minimalism. By incorporating deep work
into their routine and creating a dedicated space, individuals can enhance their ability to produce
meaningful work and maximize their output.

12
map_rerank:
Deep Work is a guide that helps individuals regain control of their time, eliminate distractions, and
improve their overall focus. It emphasizes the importance of clear focus and careful attention in
producing the best and most creative work. Deep Work suggests that by practicing deep work and
incorporating restorative rest, individuals can enhance their ability to do meaningful work. The
book emphasizes that focus, not time, is the key to accomplishing important tasks.

6 Prompt Template: Under the hood

LangChain uses a prompt that takes into account the question and the context retrieved from
vectorestore. Here is an example how you can use your own with RetrievalQA
[23]: from langchain.prompts import PromptTemplate

template = """Use the provided context to respond to the question posed at the␣
↪end.

If you're unsure of the answer, please feel free to acknowledge that you don't␣
↪know rather than attempting to provide a fabricated response.

Please provide a brief and concise response.

{context}
Question: {question}
Helpful Answer:"""
QA_CHAIN_PROMPT = PromptTemplate.from_template(template)
QA_CHAIN_PROMPT

[23]: PromptTemplate(input_variables=['context', 'question'], template="Use the

provided context to respond to the question posed at the end. \nIf you're unsure
of the answer, please feel free to acknowledge that you don't know rather than
attempting to provide a fabricated response.\nPlease provide a brief and concise
response.\n{context}\nQuestion: {question}\nHelpful Answer:")

Use this template to ask question to the LLM

[24]: qa_chain = RetrievalQA.from_chain_type(
llm,
retriever=vectordb.as_retriever(),
return_source_documents=True,
chain_type_kwargs={"prompt": QA_CHAIN_PROMPT}
)

question = "What is a deep work"

result = qa_chain({"query": question})
print(f"Answer:\n {result['result']}")

Answer:
Deep work refers to a state of focused and uninterrupted work that allows for

13
maximum productivity and creativity. It involves eliminating distractions and
dedicating substantial time and effort to tasks that require deep concentration
and attention.
You can see that the answer is more concise than the other examples.

GenAI and LLMs Creative Projects, With Solutions
100% (1)
GenAI and LLMs Creative Projects, With Solutions
206 pages
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
No ratings yet
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
34 pages
Gen AI
No ratings yet
Gen AI
7 pages
INT426 Gen AI
No ratings yet
INT426 Gen AI
4 pages
Generative AI With Large Language Models
100% (2)
Generative AI With Large Language Models
31 pages
Without Good Reason The Rationality Debate in Philosophy and Cognitive Science (Clarendon Library of Logic Philosophy) by Edward Stein
No ratings yet
Without Good Reason The Rationality Debate in Philosophy and Cognitive Science (Clarendon Library of Logic Philosophy) by Edward Stein
295 pages
Scientific-Documentation GOSHABA
No ratings yet
Scientific-Documentation GOSHABA
5 pages
Hands-On Lab With LLMs and Gen AI Within IDC
No ratings yet
Hands-On Lab With LLMs and Gen AI Within IDC
57 pages
GenAI Pinnacle Roadmap
100% (1)
GenAI Pinnacle Roadmap
8 pages
Projects Gen AI Pinnacle
100% (1)
Projects Gen AI Pinnacle
12 pages
Generative AI
No ratings yet
Generative AI
5 pages
A Practical Primer to AI Agents 1736197641
No ratings yet
A Practical Primer to AI Agents 1736197641
23 pages
GenAI_Interview_Questions-Draft
No ratings yet
GenAI_Interview_Questions-Draft
27 pages
Multi Agents Share
No ratings yet
Multi Agents Share
45 pages
Bedrock Doc 1
No ratings yet
Bedrock Doc 1
4 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
50% (2)
Machine Learning Basics: An Illustrated Guide For Non-Technical Readers
27 pages
FAANGPath Simple Template 1
No ratings yet
FAANGPath Simple Template 1
2 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
Weaviate Advanced RAG Techniques eBook
100% (1)
Weaviate Advanced RAG Techniques eBook
13 pages
26 RAG Concepts in Alphabetical Order
No ratings yet
26 RAG Concepts in Alphabetical Order
15 pages
GenAI-Unit1-3
No ratings yet
GenAI-Unit1-3
31 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
Embeddings
No ratings yet
Embeddings
13 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
Patterns For Building LLM-based Systems & Products
No ratings yet
Patterns For Building LLM-based Systems & Products
31 pages
MLOps
No ratings yet
MLOps
9 pages
Fine-Tuning AI Models - A Guide. Fine-Tuning Is A Technique For Adapting - by Prabhu Srivastava - Medium
No ratings yet
Fine-Tuning AI Models - A Guide. Fine-Tuning Is A Technique For Adapting - by Prabhu Srivastava - Medium
12 pages
What Are Vector Databases
No ratings yet
What Are Vector Databases
5 pages
LangChain_Academy_-_Introduction_to_LangGraph_-_Motivation
No ratings yet
LangChain_Academy_-_Introduction_to_LangGraph_-_Motivation
17 pages
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
No ratings yet
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
12 pages
The Hitchhikers Guide To Artificial Intelligence 2018 19.original
No ratings yet
The Hitchhikers Guide To Artificial Intelligence 2018 19.original
18 pages
Gen Ai Solutions
No ratings yet
Gen Ai Solutions
14 pages
Feature Engineering 1
No ratings yet
Feature Engineering 1
68 pages
1. Application Of Large Language
No ratings yet
1. Application Of Large Language
75 pages
Vector_Databases
No ratings yet
Vector_Databases
35 pages
rsch_pdf_30305836
No ratings yet
rsch_pdf_30305836
44 pages
Ai Notes
No ratings yet
Ai Notes
2 pages
ML Observability Build Vs Buy Download Guide 1689038317
No ratings yet
ML Observability Build Vs Buy Download Guide 1689038317
31 pages
Training Generative Adversarial Networks With Limited Data
No ratings yet
Training Generative Adversarial Networks With Limited Data
37 pages
Gen Ai Roadmap - v5
No ratings yet
Gen Ai Roadmap - v5
3 pages
Implementing A Retrieval-Augmented Generation System
No ratings yet
Implementing A Retrieval-Augmented Generation System
3 pages
Parameter-Efficient Fine-Tuning Methods For Pretrained Language Models - A Critical Review and Assessment
No ratings yet
Parameter-Efficient Fine-Tuning Methods For Pretrained Language Models - A Critical Review and Assessment
20 pages
Mlops 101
No ratings yet
Mlops 101
33 pages
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
No ratings yet
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
76 pages
How To Deploy Machine Learning Model As Microservices
No ratings yet
How To Deploy Machine Learning Model As Microservices
7 pages
Crud Rag
No ratings yet
Crud Rag
31 pages
Python AI ML LLM TrainingJun142024
No ratings yet
Python AI ML LLM TrainingJun142024
192 pages
D 02 Large Language Models
100% (1)
D 02 Large Language Models
58 pages
Generative AI Database
No ratings yet
Generative AI Database
14 pages
Generative AI LLM Tutorial
No ratings yet
Generative AI LLM Tutorial
25 pages
Building RAG-based LLM Applications For Production (Part 1) : Blog Detail
100% (1)
Building RAG-based LLM Applications For Production (Part 1) : Blog Detail
39 pages
Advanced Retrieval-Augmented Generation (RAG) With LangChain, LangGraph, and AI Agents - by Manoj Mukherjee - Oct, 2024 - Medium
No ratings yet
Advanced Retrieval-Augmented Generation (RAG) With LangChain, LangGraph, and AI Agents - by Manoj Mukherjee - Oct, 2024 - Medium
15 pages
Debugging React Native
No ratings yet
Debugging React Native
12 pages
Guide to Fast GraphRAG
No ratings yet
Guide to Fast GraphRAG
7 pages
LlamaIndex Talk (W&B Fully Connected 2024)
No ratings yet
LlamaIndex Talk (W&B Fully Connected 2024)
38 pages
LLM Assignment 1
No ratings yet
LLM Assignment 1
3 pages
Techniques of Text Classification
No ratings yet
Techniques of Text Classification
28 pages
AWS FMOps LLMOps Operationalise GenAI Using MLOps Principles
100% (1)
AWS FMOps LLMOps Operationalise GenAI Using MLOps Principles
56 pages
GenerativeAdversialNetwork
No ratings yet
GenerativeAdversialNetwork
21 pages
Evolving LLOMPS For RAG
No ratings yet
Evolving LLOMPS For RAG
6 pages
From Darwin to Deep Work _ by Pascal Janetzky _ in Towards Data Science - Freedium
No ratings yet
From Darwin to Deep Work _ by Pascal Janetzky _ in Towards Data Science - Freedium
6 pages
WG1083-1976 - VriddhaYavanJataka of Minaraja Vol 2 of 2
No ratings yet
WG1083-1976 - VriddhaYavanJataka of Minaraja Vol 2 of 2
411 pages
WG1082-1976 - VriddhaYavanJataka of Minaraja Vol 1 of 2
No ratings yet
WG1082-1976 - VriddhaYavanJataka of Minaraja Vol 1 of 2
451 pages
LLMOps Truera Intel LLM Ops Explained2
No ratings yet
LLMOps Truera Intel LLM Ops Explained2
14 pages
Orderdateorderid Ordertypeoptiontypeorderstrikepriceorderstrikeinstrument Bank Nifty Price
No ratings yet
Orderdateorderid Ordertypeoptiontypeorderstrikepriceorderstrikeinstrument Bank Nifty Price
2 pages
Inquiries Investigation Immersion
100% (1)
Inquiries Investigation Immersion
18 pages
Notes 18ES51
No ratings yet
Notes 18ES51
31 pages
Docente: Jorge Alfonso Gastelum Acosta Nombre: - Missael Valenzuela Gonzalez Trabajo: Choose A Job GPO: 601
No ratings yet
Docente: Jorge Alfonso Gastelum Acosta Nombre: - Missael Valenzuela Gonzalez Trabajo: Choose A Job GPO: 601
2 pages
G11-SLM3-RWS-Q1 SHSPH
100% (1)
G11-SLM3-RWS-Q1 SHSPH
15 pages
Unit 3 - Trends and Issues of Assessment - 240125 - 151800
No ratings yet
Unit 3 - Trends and Issues of Assessment - 240125 - 151800
13 pages
What Is Decision Matrix?: Explanation
No ratings yet
What Is Decision Matrix?: Explanation
3 pages
Abdul Hameed Lone Education HSR 2017 IUB Bahawalpur 07.08.2018
No ratings yet
Abdul Hameed Lone Education HSR 2017 IUB Bahawalpur 07.08.2018
194 pages
Management
No ratings yet
Management
10 pages
CO-2024-LS-Grade 2-NMP - Q1 - Week5 - Day1
No ratings yet
CO-2024-LS-Grade 2-NMP - Q1 - Week5 - Day1
16 pages
Appraising The Secretaries at Sweetwater U
No ratings yet
Appraising The Secretaries at Sweetwater U
14 pages
Level 7 Diploma in Strategic Management and Leadership
No ratings yet
Level 7 Diploma in Strategic Management and Leadership
4 pages
Performance Management System
No ratings yet
Performance Management System
248 pages
The Interface Between Km & Hr
No ratings yet
The Interface Between Km & Hr
8 pages
Prof. Ed. 1 - Module 3
No ratings yet
Prof. Ed. 1 - Module 3
4 pages
BUIST Motivational Letter
No ratings yet
BUIST Motivational Letter
3 pages
Project Ideals
No ratings yet
Project Ideals
4 pages
Problems On Trains - Aptitude Questions and Answers
No ratings yet
Problems On Trains - Aptitude Questions and Answers
6 pages
chatGPT in Literature
No ratings yet
chatGPT in Literature
2 pages
Why Is Ethics Important in History Education? A
No ratings yet
Why Is Ethics Important in History Education? A
20 pages
Exam Approach Interview: P4 Advanced Financial Management
No ratings yet
Exam Approach Interview: P4 Advanced Financial Management
15 pages
Level's of AI
No ratings yet
Level's of AI
17 pages
Ethical Issues in Organizational Behavior
No ratings yet
Ethical Issues in Organizational Behavior
31 pages
CS System Analyst
No ratings yet
CS System Analyst
8 pages
Architectural Research Methods
100% (1)
Architectural Research Methods
3 pages
Knowles Learningcontract
No ratings yet
Knowles Learningcontract
4 pages
Ipcr 3 4 1 2016
No ratings yet
Ipcr 3 4 1 2016
10 pages
Myp General Grade Descriptors SC
No ratings yet
Myp General Grade Descriptors SC
1 page
Criticism 1
No ratings yet
Criticism 1
11 pages

RAG and LangChain

Uploaded by

RAG and LangChain

Uploaded by

RAG_and_LangChain_RetrievalQA

[3]: from google.colab import userdata

# I will load this summary of "Deep Work" book:

# and also RAG paper, to diversify source documents

print(f" length of docs {len(docs)}")

For doc = 0, number of pages: 8

4 Storing docs using Vectorestores + Embedding

[9]: from langchain.vectorstores import Chroma

[12]: persist_directory = '/content/drive/MyDrive/02-Articles_ChatGPT/03_notebooks/

# !rm -rf persist_directory # remove old database files if any

for doc in docs_similarity_search:

5.2 Initialize LLM using GPT-3.5-Turbo

5.3 RetrievalQA chain: inlcude chunks in the context window for QA

question = "What is a deep work"

question = "What is a deep work"

Cal Newport, Associate Professor in computer science, popular author,

. This kind of work

. Living in the digital age means that

[33]: qa_chain = RetrievalQA.from_chain_type(

question = "What is a deep work"

5.5 2-Map-reduce chain:

**Human**: *What is a deep work*

question = "What is a deep work"

[26]: {'query': 'What is a deep work',

[27]: qa_chain_mr = RetrievalQA.from_chain_type(

question = "What is a deep work"

5.6 3- Refine chain

[28]: qa_chain = RetrievalQA.from_chain_type(

question = "What is a deep work"

5.7 3- Map rerank

question = "What is a deep work"

6 Prompt Template: Under the hood

Please provide a brief and concise response.

[23]: PromptTemplate(input_variables=['context', 'question'], template="Use the

Use this template to ask question to the LLM

question = "What is a deep work"

You might also like

Human: What is a deep work