Liceria & Co.

The document describes a media caption generator that uses deep learning techniques to automatically generate descriptive captions for images and videos. It discusses the objectives, architecture, methodology using CNN and LSTM algorithms, and results of the image caption generation system.

Uploaded by

Gamer boy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Liceria & Co.

Uploaded by

Gamer boy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

MEDIA CAPTION

GENERATOR
USING DEEP
LEARNING
TECHNIQUES RAHUL N MANESH
SARANG C SANTHOSH
SHYAMKUMAR S
SREEHARI E S
CONTENTS
INTRODUCTION
OBJECTIVES
ARCHITECTURE
METHODOLOGY
ALGORITHMS USED
MODEL
FLOWCHART
RESULTS
TASK IDENTIFICATION & ALLOCATION
CONCLUSION
INTRODUCTION
In modern communication, images and videos are key tools for conveying messages and
narratives effectively.

Descriptive captions greatly improve accessibility, aiding visually impaired individuals and
diverse audiences.

The media caption generator uses advanced tech like deep learning to craft contextually
fitting captions automatically.

Captions are presented as text overlays and converted into audio files for a comprehensive
accessibility approach.
OBJECTIVES
Detailed Image Descriptions

Spatial Awareness

Object Recognition

Affordability and Accessibility

Emphasis on Audio Cues

LITERATURE SURVEY
https://www.freecodecamp.org/news/building-an-image-caption-generator-with-deep-
learning-in-tensorflow-a142722e9b1f/
ARCHITECTURE
METHODOLOGY
MODULE 1: DATA COLLECTION AND PREPROCESSING -

Data cleaning-Replace ‘-’ with ‘ ‘

MODULE 2: FEATURE EXTRACTION -

Getting the best features from the images by selecting and combining variables into features,
thus, effectively reducing the amount of data i.e., converting fixed-length informative vector for
each image using CNN.

MODULE 3: LOADING THE TRAINING SET AND DATA GENERATOR MODEL -

To train the model, we will be using the 6000+ training images by generating the input and
output sequences in batches.
MODULE 4: TESTING THE MODEL AND EVALUATING -

The model will be trained and predictions are generated. The evaluation of image captioning
models is performed using metrics such as BLEU

MODULE 5: TEXT TO VOICE -

Finally, the generated captions are converted to voice using Python library (gTTS - Google Text-
To-Speech)
ALGORITHMS USED
1.CNN(Convolutional Neural Network) -
A Convolutional Neural Network (ConvNet/CNN) is a Deep Learning algorithm which can
take in an input image, assign importance (learnable weights and biases) to various
aspects/objects in the image and be able to differentiate one from the other.

2.LSTM(Long Short-Term Memory) -

Long Short-Term Memory (LSTM) networks are a type of recurrent neural network
capable of learning order dependence in sequence prediction problems. This is a
behavior required in complex problem domains like machine translation, speech
recognition, and more.
MODEL
FLOW CHART
RESULTS
TASK IDENTIFICATION & ALLOCATION

SARANG C SANTHOSH BACKEND DEVELOPMENT

SREEHARI E S FRONTEND DEVELOPMENT

SHYAMKUMAR S QA AND DOCUMENTATION

RAHUL N MANESH QA AND DOCUMENTATION

CONCLUSION

Real-Time Accessibility

Continuous Improvement

Fostering Independence
THANK YOU

Image Caption Generation Using Deep Learning: Department of Electronics & Instrumentation Engineering NIT Silchar, Assam
No ratings yet
Image Caption Generation Using Deep Learning: Department of Electronics & Instrumentation Engineering NIT Silchar, Assam
21 pages
Image Captioning
67% (3)
Image Captioning
16 pages
Image Caption
No ratings yet
Image Caption
16 pages
ALGORITHM Saikareddy Img Cap-1742112866980
No ratings yet
ALGORITHM Saikareddy Img Cap-1742112866980
6 pages
Image Caption Generator Using Deep Learning: Guided by Dr. Ch. Bindu Madhuri, M Tech, PH.D
No ratings yet
Image Caption Generator Using Deep Learning: Guided by Dr. Ch. Bindu Madhuri, M Tech, PH.D
9 pages
RP Springer
No ratings yet
RP Springer
10 pages
Building A Voice Based Image Caption Generator With Deep Learning
No ratings yet
Building A Voice Based Image Caption Generator With Deep Learning
6 pages
Presentation Manu Niha (1)
No ratings yet
Presentation Manu Niha (1)
11 pages
Project Review
No ratings yet
Project Review
12 pages
118-presentation
No ratings yet
118-presentation
26 pages
abstract final Major project
No ratings yet
abstract final Major project
1 page
Image Caption Generator
No ratings yet
Image Caption Generator
2 pages
Image Caption Generator Using AI: Review - 1
No ratings yet
Image Caption Generator Using AI: Review - 1
9 pages
ROHAN PRASAD FinalProjectReport - Rohan Gamer
No ratings yet
ROHAN PRASAD FinalProjectReport - Rohan Gamer
39 pages
Image Caption Generator Report
No ratings yet
Image Caption Generator Report
27 pages
Image Captioning
No ratings yet
Image Captioning
8 pages
Minor
No ratings yet
Minor
14 pages
ImagecaptionusingCNNandLSTM
No ratings yet
ImagecaptionusingCNNandLSTM
11 pages
2501
No ratings yet
2501
6 pages
Image Caption Generator
100% (1)
Image Caption Generator
20 pages
CNN and RNN
No ratings yet
CNN and RNN
82 pages
Image Caption Generator Using Deep Learning
No ratings yet
Image Caption Generator Using Deep Learning
9 pages
Aneja Convolutional Image Captioning CVPR 2018 Paper
No ratings yet
Aneja Convolutional Image Captioning CVPR 2018 Paper
10 pages
Document from Deependra singh (1)
No ratings yet
Document from Deependra singh (1)
10 pages
BTP Report
No ratings yet
BTP Report
27 pages
Image Caption Generator Research Paper
No ratings yet
Image Caption Generator Research Paper
4 pages
Major Report Final
No ratings yet
Major Report Final
40 pages
DL 20i0551 Project Proposal
No ratings yet
DL 20i0551 Project Proposal
3 pages
Synopsis Main
No ratings yet
Synopsis Main
11 pages
Base Paper
No ratings yet
Base Paper
6 pages
Project Report
No ratings yet
Project Report
31 pages
Hybrid_Image_Captioning_Model
No ratings yet
Hybrid_Image_Captioning_Model
6 pages
Final Project Report
No ratings yet
Final Project Report
18 pages
Mini Project Fln..
No ratings yet
Mini Project Fln..
51 pages
Image Caption Generator
No ratings yet
Image Caption Generator
6 pages
Image Captioning
No ratings yet
Image Captioning
14 pages
Image Caption Generator
No ratings yet
Image Caption Generator
13 pages
Review 2
No ratings yet
Review 2
34 pages
Image Captioning
No ratings yet
Image Captioning
17 pages
Sample project doc-REC
No ratings yet
Sample project doc-REC
66 pages
A Novel Approach of Image Caption Generator Using Deep Learning
No ratings yet
A Novel Approach of Image Caption Generator Using Deep Learning
6 pages
Acd
No ratings yet
Acd
15 pages
IJCRT2310418
No ratings yet
IJCRT2310418
8 pages
NLP UNIT 5c
No ratings yet
NLP UNIT 5c
33 pages
Image Captioning
No ratings yet
Image Captioning
16 pages
Image Captioning Using CNN and LSTM
No ratings yet
Image Captioning Using CNN and LSTM
9 pages
Image Captioning Research Paper
No ratings yet
Image Captioning Research Paper
59 pages
Ref12
No ratings yet
Ref12
7 pages
Automated Image Captioning Using CNN and RNN
No ratings yet
Automated Image Captioning Using CNN and RNN
17 pages
2305.02932v2
No ratings yet
2305.02932v2
6 pages
DL Group 6 Rep
No ratings yet
DL Group 6 Rep
11 pages
Internship Report (Sanjay Final)
No ratings yet
Internship Report (Sanjay Final)
45 pages
Image Captioning: - A Deep Learning Approach
No ratings yet
Image Captioning: - A Deep Learning Approach
14 pages
Image Captioning
No ratings yet
Image Captioning
17 pages
CHERUKURI VARALAKSHMI-2
No ratings yet
CHERUKURI VARALAKSHMI-2
21 pages
Image Caption Generation
No ratings yet
Image Caption Generation
8 pages
Performance Evaluation of Medical Image Captioning Using
No ratings yet
Performance Evaluation of Medical Image Captioning Using
10 pages
Sunnit Singh Shivam Kumar Soham Chatterjee Abhishek Kumar Sujata Dawn MuHmt
No ratings yet
Sunnit Singh Shivam Kumar Soham Chatterjee Abhishek Kumar Sujata Dawn MuHmt
6 pages
Report 1
No ratings yet
Report 1
34 pages
Deep Learning: Fundamentals and Applications
From Everand
Deep Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Introduction To Machine Learning With Applications in Information Security 1st Edition Mark Stamp All Chapter Instant Download
100% (4)
Introduction To Machine Learning With Applications in Information Security 1st Edition Mark Stamp All Chapter Instant Download
62 pages
DCPTG Market
No ratings yet
DCPTG Market
28 pages
Subject Outline Digital Finance
No ratings yet
Subject Outline Digital Finance
9 pages
A Machine Learning Model For Average Fuel Consumption in Heavy Vehicles
100% (1)
A Machine Learning Model For Average Fuel Consumption in Heavy Vehicles
70 pages
Machine Learning Hand Written Notes ?
No ratings yet
Machine Learning Hand Written Notes ?
57 pages
ML-ProblemStatement Youtube Adview Prediction-1 Lyst8087
No ratings yet
ML-ProblemStatement Youtube Adview Prediction-1 Lyst8087
2 pages
Pathole Report Final
No ratings yet
Pathole Report Final
40 pages
Data Analytics - Beginner's Guide
No ratings yet
Data Analytics - Beginner's Guide
22 pages
CAAD Futures2023 - Architectural Sketch To 3D Model - An Experiment On Simple-Form Houses
No ratings yet
CAAD Futures2023 - Architectural Sketch To 3D Model - An Experiment On Simple-Form Houses
15 pages
Detecting False Alarms From Automatic Static Analysis Tools: How Far Are We?
No ratings yet
Detecting False Alarms From Automatic Static Analysis Tools: How Far Are We?
12 pages
ML -Assignment-1
No ratings yet
ML -Assignment-1
1 page
3 s2.0 B9780323919074200010 Main
No ratings yet
3 s2.0 B9780323919074200010 Main
6 pages
Online Food Delivery App ‘Foodie’
No ratings yet
Online Food Delivery App ‘Foodie’
11 pages
IMPACT OF AI ON TRADEMARK INFRINGEMENT AND ENFORCEMENT: A STUDY ON AI-DRIVEN MARKETING AND BRANDING STRATEGIES by GIRISH BHARADWAJ
No ratings yet
IMPACT OF AI ON TRADEMARK INFRINGEMENT AND ENFORCEMENT: A STUDY ON AI-DRIVEN MARKETING AND BRANDING STRATEGIES by GIRISH BHARADWAJ
27 pages
Political Campaigns and Big Data: David W. Nickerson and Todd Rogers
No ratings yet
Political Campaigns and Big Data: David W. Nickerson and Todd Rogers
29 pages
Exploring Deep Learning and Neural Networks in Data Science (1)
No ratings yet
Exploring Deep Learning and Neural Networks in Data Science (1)
11 pages
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
No ratings yet
Cyclical Learning Rate For Training Neural Networks (Leslie Smith)
10 pages
9th - Unit 1 - Introduction To AI - Artificial Intelligence
No ratings yet
9th - Unit 1 - Introduction To AI - Artificial Intelligence
51 pages
From Amundson, Aris, and Sargent To The Future of Process Systems Engineering
No ratings yet
From Amundson, Aris, and Sargent To The Future of Process Systems Engineering
10 pages
Project Title:: Parameters For Inconel 718 by Machine Learning
No ratings yet
Project Title:: Parameters For Inconel 718 by Machine Learning
32 pages
Maximize The Business Value of Generative Ai
No ratings yet
Maximize The Business Value of Generative Ai
19 pages
Algorithms and Their Others: Algorithmic Culture in Context: Paul Dourish
No ratings yet
Algorithms and Their Others: Algorithmic Culture in Context: Paul Dourish
11 pages
Amazon_Sales_Analysis_Presentation
No ratings yet
Amazon_Sales_Analysis_Presentation
24 pages
CS461 Intermediate Report Team7
No ratings yet
CS461 Intermediate Report Team7
5 pages
The Dual Impact of Artificial Intelligence in Healthcare: Balancing Advancements With Ethical and Operational Challenges
No ratings yet
The Dual Impact of Artificial Intelligence in Healthcare: Balancing Advancements With Ethical and Operational Challenges
11 pages
Apple Adoption of ChatGPT - Implications For Data & Energy Consumption-Nicholas Assef
No ratings yet
Apple Adoption of ChatGPT - Implications For Data & Energy Consumption-Nicholas Assef
10 pages
Workerareportaje
No ratings yet
Workerareportaje
23 pages
Statistical and Machine Learning Models in Credit Scoring A Systematic
No ratings yet
Statistical and Machine Learning Models in Credit Scoring A Systematic
21 pages
Question bank CS AI_ VI Sem_ 1-5
No ratings yet
Question bank CS AI_ VI Sem_ 1-5
2 pages
Rutvik Jaiswal D
No ratings yet
Rutvik Jaiswal D
2 pages