CANCEL
Subscription
0
Your Cart
(0 item)
You have no products in your basket yet
Save more on your purchases!
Buy 2 products and get 15% off
Buy 3-4 products and get 20% off
Buy 5+ products and get 50% off
Savings automatically calculated. No voucher code required.
Checkout
Account
Sign in
New User?
Create Account
Your Account
Your Orders
Change country
United States
Great Britain
India
Germany
France
Canada
Russia
Spain
Brazil
Australia
Singapore
Canary Islands
Hungary
Ukraine
Luxembourg
Estonia
Lithuania
South Korea
Turkey
Switzerland
Colombia
Taiwan
Chile
Norway
Ecuador
Indonesia
New Zealand
Cyprus
Denmark
Finland
Poland
Malta
Czechia
Austria
Sweden
Italy
Egypt
Belgium
Portugal
Slovenia
Ireland
Romania
Greece
Argentina
Netherlands
Bulgaria
Latvia
South Africa
Malaysia
Japan
Slovakia
Philippines
Mexico
Thailand
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
SALE ENDS IN
0
Days
:
00
Hours
:
00
Minutes
:
00
Seconds
GO TO
TOP
You're reading from
RAG-Driven Generative AI
Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone
Product type
Paperback
Published in
Sep 2024
Publisher
Packt
ISBN-13
9781836200918
Length
338 pages
Edition
1st Edition
Languages
Python
Tools
Docker
Concepts
GPT/LLMs
Author (1):
Denis Rothman
Denis Rothman
Denis Rothman
Expert in AI Transformers including ChatGPT/GPT-4, Bestselling Author
Read more
View More author details
Table of Contents
(14) Chapters
Preface
Who this book is for
What this book covers
To get the most out of this book
Get in touch
1. Why Retrieval Augmented Generation?
FREE CHAPTER
What is RAG?
Naïve, advanced, and modular RAG configurations
RAG versus fine-tuning
The RAG ecosystem
Naïve, advanced, and modular RAG in code
Summary
Questions
References
Further reading
2. RAG Embedding Vector Stores with Deep Lake and OpenAI
From raw data to embeddings in vector stores
Organizing RAG in a pipeline
A RAG-driven generative AI pipeline
Building a RAG pipeline
Evaluating the output with cosine similarity
Summary
Questions
References
Further reading
3. Building Index-Based RAG with LlamaIndex, Deep Lake, and OpenAI
Why use index-based RAG?
Building a semantic search engine and generative agent for drone technology
Vector store index query engine
Tree index query engine
List index query engine
Keyword index query engine
Summary
Questions
References
Further reading
4. Multimodal Modular RAG for Drone Technology
What is multimodal modular RAG?
Building a multimodal modular RAG program for drone technology
Summary
Questions
References
Further reading
5. Boosting RAG Performance with Expert Human Feedback
Adaptive RAG
Building hybrid adaptive RAG in Python
Summary
Questions
References
Further reading
6. Scaling RAG Bank Customer Data with Pinecone
Scaling with Pinecone
Pipeline 1: Collecting and preparing the dataset
Pipeline 2: Scaling a Pinecone index (vector store)
Pipeline 3: RAG generative AI
Summary
Questions
References
Further reading
7. Building Scalable Knowledge-Graph-Based RAG with Wikipedia API and LlamaIndex
The architecture of RAG for knowledge-graph-based semantic search
Pipeline 1: Collecting and preparing the documents
Pipeline 2: Creating and populating the Deep Lake vector store
Pipeline 3: Knowledge graph index-based RAG
Summary
Questions
References
Further reading
8. Dynamic RAG with Chroma and Hugging Face Llama
The architecture of dynamic RAG
Installing the environment
Activating session time
Downloading and preparing the dataset
Embedding and upserting the data in a Chroma collection
Querying the collection
Prompt and retrieval
RAG with Llama
Total session time
Summary
Questions
References
Further reading
9. Empowering AI Models: Fine-Tuning RAG Data and Human Feedback
The architecture of fine-tuning static RAG data
Installing the environment
1. Preparing the dataset for fine-tuning
2. Fine-tuning the model
3. Using the fine-tuned OpenAI model
Metrics
Summary
Questions
References
Further reading
10. RAG for Video Stock Production with Pinecone and OpenAI
The architecture of RAG for video production
The environment of the video production ecosystem
Pipeline 1: Generator and Commentator
Pipeline 2: The Vector Store Administrator
Pipeline 3: The Video Expert
Summary
Questions
References
Further reading
11. Other Books You May Enjoy
12. Index
Appendix
References
Pinecone documentation:
https://docs.pinecone.io/guides/get-started/quickstart
OpenAI embedding and generative models:
https://platform.openai.com/docs/models
The rest of the chapter is locked
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Start free trial
Previous Section
Section 8 of 9
Next Section
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Sign up now
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Start free trial
Renews at
$19.99/month
. Cancel anytime
Authors (1)
Denis Rothman
Denis Rothman
Expert in AI Transformers including ChatGPT/GPT-4, Bestselling Author
Read more
See other products by Denis Rothman
Build Your Future-Ready Stack!
Every eBook is
$9.99
- master what's next.
SHOP NOW
Personalised recommendations for you
Based on your interests and search pattern
Modern Computer Vision with PyTorch
Read more
This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.
Read more
Jun 2024
24h 52m
Data Governance Handbook
Read more
This book provides a highly focused view of real business outcomes powered by data governance, that resonate with non-data executives such as CFOs and CEOs. You'll also find useful insights into how to implement data governance initiatives.
Read more
May 2024
13h 12m
Data Engineering with Databricks Cookbook
Read more
This book shows you how to use Apache Spark, Delta Lake, and Databricks to build data pipelines, manage and transform data, optimize performance, and more. Additionally, you'll implement DataOps and DevOps practices, and orchestrate data workflows.
Read more
May 2024
14h 36m
Azure Data Engineer Associate Certification Guide
Read more
Unlock the power of Azure data engineering with this certification guide, elevating your skills in data processing, storage, and security with the help of practical insights, hands-on exercises, and the latest advancements.
Read more
May 2024
18h 16m
Microsoft Power BI Cookbook
Read more
Microsoft Power BI is the most sought-after platform for BI professionals' visualization needs. Explore the latest Power BI features, future AI enhancements, and integration with other Power Platform tools via new recipes in this updated edition.
Read more
Jul 2024
19h 56m
Python Data Cleaning Cookbook
Read more
The book shows you how to clean, wrangle, and view data from multiple perspectives, including dataset and column attributes. You will cover common and not-so-common challenges that are faced while cleaning messy data for complex situations and learn to manipulate data to get it down to a form that can be useful for making the right decisions.
Read more
May 2024
16h 12m
Microsoft Azure AI Fundamentals AI-900 Exam Guide
Read more
This AI-900 study guide will help you prepare and practice for the certification exam. You'll delve into AI workloads, ML principles, computer vision, NLP, knowledge mining, and generative AI using Azure cloud services.
Read more
May 2024
9h 36m
Using Stable Diffusion with Python
Read more
This book shows you how to use Python to control Stable Diffusion and generate high-quality images. In addition to covering the basic usage of the diffusers package, the book provides solutions for extending the package for more advanced purposes.
Read more
Jun 2024
11h 44m
Getting Started with DuckDB
Read more
This hands-on book teaches you to analyze large datasets with blazing speed and ease. You will learn how to use DuckDB to quickly load, query, transform, analyze, and visualize data effectively through a series of practical examples.
Read more
Jun 2024
12h 44m
Databricks Certified Associate Developer for Apache Spark Using Python
Read more
This guide gets you ready for certification with expert-backed content, key exam concepts, and topic reviews. Additionally, you'll be able to make the most of Apache Spark 3.0 to modernize workloads and more using specific tools and techniques.
Read more
Jun 2024
9h 8m