0% found this document useful (0 votes)

20 views4 pages

Semantic Search in AI Documents

The document outlines a Python program that utilizes the SentenceTransformer library to perform semantic search on a set of AI-related documents. It includes functions to embed documents and find the most similar documents to a given query using cosine similarity. An example query is provided to demonstrate the functionality of the program.

Uploaded by

venkat Mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views4 pages

Semantic Search in AI Documents

Uploaded by

venkat Mohan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

PROGRAM:

!pip install sentence-transformers scikit-learn

from sentence_transformers import SentenceTransformer

from [Link] import cosine_similarity

import numpy as np

# Load the pre-trained model

model = SentenceTransformer('all-MiniLM-L6-v2')

# Example AI lab documents (you can replace these with your own data)

documents = [

"Machine learning models can be trained to recognize patterns in data.",

"AI research focuses on building intelligent systems that mimic human behavior.",

"The field of artificial intelligence has seen major advances in recent years.",

"Deep learning involves training neural networks on large datasets."

# Function to embed a list of documents

def embed_documents(documents):

return [Link](documents)

# Embedding the documents

document_embeddings = embed_documents(documents)

# Function to perform semantic search given a query

def semantic_search(query, document_embeddings, top_k=3):

# Embed the query

query_embedding = [Link]([query]

# Compute cosine similarities between the query and all document embeddings

cosine_similarities = cosine_similarity(query_embedding, document_embeddings)

# Get indices of the top_k most similar documents

top_indices = cosine_similarities[0].argsort()[-top_k:][::-1]
# Return the top_k most similar documents

return [(documents[i], cosine_similarities[0][i]) for i in top_indices]

# Example query

query = "What are the recent advancements in AI?"

# Perform semantic search

top_documents = semantic_search(query, document_embeddings)

# Print the results

print("Top similar documents for the query:", query)

for doc, score in top_documents:

print(f"Score: {score:.4f}, Document: {doc}")

OUTPUT:

Implementing Semantic Search with BERT
No ratings yet
Implementing Semantic Search with BERT
9 pages
Semantic Search for Research Papers
No ratings yet
Semantic Search for Research Papers
4 pages
Generative AI Lab Manual for CSE
No ratings yet
Generative AI Lab Manual for CSE
24 pages
Gensim Word Embeddings and NLP Techniques
No ratings yet
Gensim Word Embeddings and NLP Techniques
22 pages
Generative AI Lab Manual for CSE
No ratings yet
Generative AI Lab Manual for CSE
24 pages
Enhancing GenAI Prompts with Word Embeddings
No ratings yet
Enhancing GenAI Prompts with Word Embeddings
16 pages
Exploring Word Relationships with Gensim
No ratings yet
Exploring Word Relationships with Gensim
15 pages
Word and Text Embeddings Tutorial
No ratings yet
Word and Text Embeddings Tutorial
3 pages
Deep Learning Practical with Keras
No ratings yet
Deep Learning Practical with Keras
8 pages
Generative AI Lab Projects Overview
No ratings yet
Generative AI Lab Projects Overview
23 pages
Gensim Word Embeddings and Visualization
No ratings yet
Gensim Word Embeddings and Visualization
8 pages
Word2Vec Implementation in Python
No ratings yet
Word2Vec Implementation in Python
3 pages
Document Similarity Analysis Techniques
No ratings yet
Document Similarity Analysis Techniques
4 pages
Build Your Personalized AI Chatbot
No ratings yet
Build Your Personalized AI Chatbot
6 pages
Keras Deep Learning Practical Guide
No ratings yet
Keras Deep Learning Practical Guide
8 pages
GloVe and Word2Vec Usage Guide
No ratings yet
GloVe and Word2Vec Usage Guide
12 pages
Gensim Word Vector Analysis Labs
No ratings yet
Gensim Word Vector Analysis Labs
8 pages
Python Foundations for Generative AI
No ratings yet
Python Foundations for Generative AI
67 pages
NLP Techniques with Python Examples
No ratings yet
NLP Techniques with Python Examples
16 pages
Install Gensim and Train Word2Vec
No ratings yet
Install Gensim and Train Word2Vec
7 pages
Vector Space Model in Information Retrieval
No ratings yet
Vector Space Model in Information Retrieval
5 pages
Explore Word Relationships with Gensim
No ratings yet
Explore Word Relationships with Gensim
17 pages
Oracle AI Vector Search Overview
No ratings yet
Oracle AI Vector Search Overview
36 pages
GloVe to Word2Vec: Word Vector Analysis
No ratings yet
GloVe to Word2Vec: Word Vector Analysis
5 pages
OpenAI CLIP: Text-Image Embeddings Guide
No ratings yet
OpenAI CLIP: Text-Image Embeddings Guide
5 pages
Understanding Vector Embeddings in AI
No ratings yet
Understanding Vector Embeddings in AI
14 pages
OpenAI Vector Embeddings Overview
No ratings yet
OpenAI Vector Embeddings Overview
7 pages
Gensim NLP Handbook Overview
No ratings yet
Gensim NLP Handbook Overview
16 pages
Installing Sentence Transformers
No ratings yet
Installing Sentence Transformers
2 pages
RAG Model for Academic Assistance
No ratings yet
RAG Model for Academic Assistance
5 pages
Langchain PDF Question-Answering Pipeline
No ratings yet
Langchain PDF Question-Answering Pipeline
7 pages
Enhancing GenAI Prompts with Embeddings
No ratings yet
Enhancing GenAI Prompts with Embeddings
2 pages
Word Embeddings for Sentiment Analysis
No ratings yet
Word Embeddings for Sentiment Analysis
6 pages
Build LLM Applications from Scratch
No ratings yet
Build LLM Applications from Scratch
19 pages
Semantic Network Class in Python
No ratings yet
Semantic Network Class in Python
2 pages
RAG Application: Document Processing Guide
No ratings yet
RAG Application: Document Processing Guide
11 pages
Semantic vs. Keyword Search Explained
No ratings yet
Semantic vs. Keyword Search Explained
4 pages
Intro to Semantic Similarity in Python
No ratings yet
Intro to Semantic Similarity in Python
13 pages
NLTK and Text Processing Setup Guide
No ratings yet
NLTK and Text Processing Setup Guide
18 pages
Sentiment Analysis with LSTM Model
No ratings yet
Sentiment Analysis with LSTM Model
23 pages
Word Embedding Techniques in Python
No ratings yet
Word Embedding Techniques in Python
6 pages
Word Embedding Analysis and Visualization
No ratings yet
Word Embedding Analysis and Visualization
8 pages
Information Retrieval Practical Report
No ratings yet
Information Retrieval Practical Report
30 pages
Deep Learning Lab Manual for AI Students
No ratings yet
Deep Learning Lab Manual for AI Students
53 pages
Understanding Vector Embeddings in AI
No ratings yet
Understanding Vector Embeddings in AI
46 pages
Neural Network Word Embedding & Classification
No ratings yet
Neural Network Word Embedding & Classification
6 pages
Machine Learning Lab Manual CS-601
No ratings yet
Machine Learning Lab Manual CS-601
11 pages
Autoencoder and BERT Model Examples
No ratings yet
Autoencoder and BERT Model Examples
17 pages
Deep Learning for Article Search Optimization
No ratings yet
Deep Learning for Article Search Optimization
8 pages
Information Retrieval Assignment by Samaksh Gupta
No ratings yet
Information Retrieval Assignment by Samaksh Gupta
13 pages
Document Retrieval Techniques Overview
No ratings yet
Document Retrieval Techniques Overview
43 pages
Understanding Embeddings and RAG Techniques
No ratings yet
Understanding Embeddings and RAG Techniques
2 pages
Deep Learning Laboratory Record
No ratings yet
Deep Learning Laboratory Record
50 pages
RNN Text Generation with Pseudocode
No ratings yet
RNN Text Generation with Pseudocode
3 pages
Deep Learning Text Processing with TensorFlow
No ratings yet
Deep Learning Text Processing with TensorFlow
3 pages
Azure RAG Pipeline Implementation
No ratings yet
Azure RAG Pipeline Implementation
15 pages
NLP Sentiment Analysis Pipeline Guide
No ratings yet
NLP Sentiment Analysis Pipeline Guide
8 pages
Current Affairs - Jan
No ratings yet
Current Affairs - Jan
199 pages
SONA - List of Students - 2026 Batch
No ratings yet
SONA - List of Students - 2026 Batch
5 pages
Review 1
No ratings yet
Review 1
2 pages
Maximl - SDE Internship (Front End)
No ratings yet
Maximl - SDE Internship (Front End)
3 pages
Cultiv Ai - Taylor and Francis
No ratings yet
Cultiv Ai - Taylor and Francis
13 pages
First Review
No ratings yet
First Review
13 pages
JSW Applied Students 2026 18 02 2026 12 04 53 L6 CJZ
No ratings yet
JSW Applied Students 2026 18 02 2026 12 04 53 L6 CJZ
9 pages
First Review - Jan 2026 - 29.01.2026
No ratings yet
First Review - Jan 2026 - 29.01.2026
13 pages
(POD - Ai) SEB Installation and Troubleshooting Guide.
No ratings yet
(POD - Ai) SEB Installation and Troubleshooting Guide.
1 page
Sample
No ratings yet
Sample
8 pages
Associate Engineer JD
No ratings yet
Associate Engineer JD
1 page
Boys Intramural Sports 2025-2026
No ratings yet
Boys Intramural Sports 2025-2026
1 page
Review 1 Cse Panel List 03.02.2026
No ratings yet
Review 1 Cse Panel List 03.02.2026
6 pages
CSE Panel List Review 1
No ratings yet
CSE Panel List Review 1
14 pages
Hostel Stayback During College Hours
No ratings yet
Hostel Stayback During College Hours
1 page
4660-Fromat - Final Year Project Report
No ratings yet
4660-Fromat - Final Year Project Report
3 pages
Girls Intramural Sports 2025-2026
No ratings yet
Girls Intramural Sports 2025-2026
1 page
TUFC Job Profile - 2026
No ratings yet
TUFC Job Profile - 2026
4 pages
Contingent Brochure
No ratings yet
Contingent Brochure
7 pages
Block Chain Technology
No ratings yet
Block Chain Technology
32 pages
Degree Certificate Details (Responses)
No ratings yet
Degree Certificate Details (Responses)
4 pages
4552-Nirmirthu Nill Hackathon Circular - Compressed (1) - 1
No ratings yet
4552-Nirmirthu Nill Hackathon Circular - Compressed (1) - 1
1 page
CESI Summer School 2026 Programs in France
No ratings yet
CESI Summer School 2026 Programs in France
3 pages
Rules and Regulations - Manipal MedTech Hackathon 2026 - Compressed
No ratings yet
Rules and Regulations - Manipal MedTech Hackathon 2026 - Compressed
9 pages
Erode Belt SP DSE Interview Shortlist
No ratings yet
Erode Belt SP DSE Interview Shortlist
2 pages
B.E CSE-Second Year (2022-26) Batch UMIS Num
No ratings yet
B.E CSE-Second Year (2022-26) Batch UMIS Num
7 pages
Infosys Campus Recruitment Eligibility Criteria
No ratings yet
Infosys Campus Recruitment Eligibility Criteria
1 page
Infosys - Campus Interview - Venue Details
No ratings yet
Infosys - Campus Interview - Venue Details
92 pages
2026 & 2027 Students List For Capgimini
No ratings yet
2026 & 2027 Students List For Capgimini
6 pages
Soft Suave-Sona College of Technology - 23.01.2026
No ratings yet
Soft Suave-Sona College of Technology - 23.01.2026
18 pages

Semantic Search in AI Documents

Uploaded by

Semantic Search in AI Documents

Uploaded by

PROGRAM:

!pip install sentence-transformers scikit-learn

from sentence_transformers import SentenceTransformer

from [Link] import cosine_similarity

# Load the pre-trained model

"Machine learning models can be trained to recognize patterns in data.",

"Deep learning involves training neural networks on large datasets."

# Function to embed a list of documents

# Embedding the documents

# Function to perform semantic search given a query

def semantic_search(query, document_embeddings, top_k=3):

# Embed the query

cosine_similarities = cosine_similarity(query_embedding, document_embeddings)

# Get indices of the top_k most similar documents

return [(documents[i], cosine_similarities[0][i]) for i in top_indices]

query = "What are the recent advancements in AI?"

# Perform semantic search

top_documents = semantic_search(query, document_embeddings)

# Print the results

print("Top similar documents for the query:", query)

for doc, score in top_documents:

print(f"Score: {score:.4f}, Document: {doc}")

You might also like