Synopsis SEM4
Synopsis SEM4
Snehal Barkale 02
Shivraj Chavan 06
Omkar Chendge 07
University of Mumbai
(AY 2023-24)
CERTIFICATE
This is to certify that the Mini Project entitled “ Voice Assistant ” is a bonafide
University of Mumbai.
This Mini Project entitled “Voice Assistant”by Snehal Barkale (02) Shivraj
Chavan (06) Omkar Chendge (07) is approved for S.E. (Artificial Intelligence
& Machine Learning Engineering) for the academic year 2023-24.
Examiners
1………………………………………
(Internal Examiner Name & Sign)
2…………………………………………
(External Examiner name & Sign)
Date:
Place:
Contents
Abstract i
Acknowledgments ii
List of Figures iv
1 Introduction
1.1 Introduction
1.2 Motivation
1.3 Problem Statement & Objectives
1.4 Organization of the Report
2 Literature Survey
3.1 Introduction
3.2 Architecture/ Framework
3.3 Algorithm and Process Design
3.4 Details of Hardware & Software
3.4 Experiment and Results for Validation and Verification
3.5 Analysis
3.6 Conclusion and Future work.
References
4 Annexure
Virtual assistants are designed to mimic human interactions, enabling users to engage
in natural conversations with these digital entities. They can perform a wide range of
tasks, including setting reminders, scheduling appointments, answering questions,
managing emails, and even controlling smart home devices. Their adaptability and
versatility make them an indispensable tool for both individual users and businesses.
Virtual assistants are also becoming increasingly integrated into various devices and
platforms, including smartphones, smart speakers, and chatbots. They can understand
user preferences and tailor responses to specific needs, which fosters a more user-
centric experience. This abstract delves into the technologies that power virtual
assistants, including machine learning, deep learning, and data analytics, which enable
them to continuously improve their performance and expand their capabilities.
Acknowledgment
I would like to express my sincere gratitude and appreciation to all those who have
supported and guided me throughout the completion of this college project. Our guide
Prof. Sneha Ingale ,our HOD Dr. Renuka Deshpande ,without their assistance and
encouragement, this project would not have been possible.
I express my gratitude to our principle Dr. Pramod Rodge for inspiring us and
supporting us during our project .
First and foremost, I would like to thank my college faculty, for their invaluable
guidance and mentorship. Their expertise and dedication to the subject matter have been
instrumental in shaping the direction of this project.
In conclusion, I acknowledge all those who have played a part, big or small, in the
successful completion of this college project. Your contributions have enriched the
quality of this work, and I am truly grateful for your support.
List of figures
3.2 Architecture 11
3.3 Algorithm 12
List of Abbreviations
1.1 Introduction
Virtual assistants come in various forms and are often tailored to specific applications
and ecosystems. Some of the most popular virtual assistants include Apple's Siri,
Amazon's Alexa, Google Assistant, and Microsoft's Cortana. These platforms can
perform a multitude of tasks, such as answering questions, setting reminders, sending
messages, playing music, providing directions, and controlling smart home devices.
1.2 Motivation
The idea of developing this project is a web based program for user convenience . When
I first time saw the advertisement of Alexa in late 2019 I don’t remember exactly but I
saw a pretty girl giving voice command to the some kind of machine and the machine
is reacting and doing the tasks according to that command. Personally for me it was
very fascinating and mind blowing .That day I dreamt of making this one day in future
Let’s talk about outside the project. When I personally studied the history of voice
assistant then I heard about the person whose name is Dr. Raj Reddy from
Andhrapradesh this guy had a very much interest in AI and speech recognition
technology. He started working on AI in 1960s itself. That era when computer only have
4kb storage.After graduation Dr. Raj Raddy was doing PHD under John Mccarthy.
At that university only computer in his lab which used to be free only at night. In such
situation Dr. Raddy used to work on speech recognition technology at night after days
work.After some time of continuous working, computer start recognizing vowels.
So the motivation comes from this story
Create a virtual voice assistant, so that the system can be used by them to interact with
new technologies, control their devices and use this technology for educational
purposes and also for business purpose and future developments.
1.4 Organization of Report
Voice assistant is going to be a program where the user can chat with our voice assistant
via voice and our voice assistant can provide information asked by the user through
chrome, youtube ,Wikipedia ,etc. The user can also ask for current time and our voice
assistant can speak the current time for the user .Our voice assistant can be accessed
online .This system will be able to analyze problem and will find solution or
information to it.The proposed system will be able to provide user interface which the
user will able to see and also hear. To create this system we will use visual studio. It
will be written in python, MySQL will be the database used.
2
Literature Survey
1. Unintelligible Words
It can be very difficult for voice assistants to comprehend words that are said that are
not in the dictionary. Though this trend has been improving over time.
When you are doing a voice search, you expect to get the correct result immediately.
However, if the voice assistant does not understand what you’re asking for, you will
have to repeat yourself or put in your search query manually.
Voice assistants are good at providing factual information, but some of them struggle
to provide decent information due to misinterpreting or misidentifying a particular
query. This problem seems to be more prominent with voice assistants, especially when
it comes to misidentifying the source of the voice command.
Even though it’s good that you can use voice assistants to make your life a little bit
easier, there are times when they can be counterproductive when trying to have a
conversation with someone. This is especially true when you’re talking to someone
face-to-face..
The major players in the voice assistant sphere are Siri, Google Assistant, and Alexa.
These assistants are still far from perfect, which is why it’s amazing that they were
integrated into so many different types of devices. However, despite the fact that there
are many devices and applications that are compatible with voice assistants, there are
still numerous problems with software integration
.
2.3 Mini project contribution
The voice assistance has been developed for users for educational,business and for
personal use . It has achieved the objectives and scope that were stated in this project
the project will achieve some of the below objectives :
It can open chrome, Youtube , Wikipedia , etc to search information and read 2 or 3
lines for the user. It can open presentation .It can tell current time.It can send mails,play
music,etc.
3
Proposed system
3.1 Introduction
Virtual assistant is software program that helps you ease your day-to-day tasks, such as
showing weather forecasting, listing reminders, creating to do list etc. They can take
commands as voice or text. Voice based intelligent assistant need an invoking words or
wake words to active the listener, followed by commands. For my project the wake, up
word is “MANAV”. Our voice assistant is designed to be used efficiently for all users.
This personal assistant software improves users’ productivity by managing day to day
tasks & providing information from online sources to users.
• Audio Input/Output:
• Speech Recognition:
o Intent Recognition: Identify the user's intent from the transcribed text.
This involves understanding what the user wants to do or know.
o Named Entity Recognition (NER): Identify important entities like dates,
locations, and proper nouns in the user's command.
o Dialog Management: Keep track of the conversation context, including
the user's previous requests and responses.
• Knowledge Base and Data Sources:
o Use the NLP results and the context to generate a meaningful response.
o Choose the platform for your voice assistant. It can be a mobile app, a
webbased interface, a smart speaker, or a custom hardware device.
The diagram shows the main process flow of how Voice Assistant works.
3.3 Algorithm and Process Design
Creating a flowchart for a voice assistant project is a helpful way to visualize and plan
how the assistant will respond to user inputs and interact with various components.
Below, I'll provide a step-by-step guide on how to create a basic flowchart for a voice
assistant project:
1. Define the Objective: Start by defining the primary objective of your voice
assistant. What tasks or functions should it perform? Common voice assistant
tasks include answering questions, setting reminders, controlling smart devices,
providing weather updates, and more.
2. Identify User Inputs: List the possible user inputs or commands your voice
assistant should be able to handle. These inputs will be the starting points for
your flowchart. For example, users might say, "What's the weather like today?"
or "Set a reminder for
3 PM."
3. Create Start and End Points: Draw a "start" symbol (usually a circle) at the top
of your flowchart and an "end" symbol (usually a rounded rectangle) at the
bottom. These symbols represent the beginning and end of the interaction.
4. Add Decision Points: Depending on the complexity of your voice assistant, you
may need to add decision points where the assistant needs to make choices or
validate information. Decision points are represented as diamonds. For
example, you might have a decision point to check if the user's command is
clear.
5. Connect User Inputs: Connect the user inputs (from step 2) to the decision
points and actions in the flowchart. Use arrows to indicate the flow of the
conversation. For example, connect "What's the weather like today?" to a
decision point that validates the location and date.
6. Define Actions and Responses: At each step in the flowchart, describe the action
your voice assistant should take and the response it should provide. This may
include retrieving information from a database, performing calculations, or
generating a spoken or text response.
7. Error Handling: Include branches for error handling. What should the voice
assistant do if it doesn't understand the user's command or if there's an error in
processing? Define these paths in the flowchart.
8. Integration with APIs or Services: If your voice assistant interacts with external
services or APIs (e.g., weather data, calendar, or home automation), represent
these interactions in your flowchart with the appropriate symbols and labels.
9. Looping and Repetition: If your voice assistant supports repeated interactions
or loops, use loop symbols in your flowchart to represent these behaviors.
10. Testing and Validation: After creating the initial flowchart, review it to ensure
that it covers all possible user inputs and provides clear paths for the assistant's
responses. Conduct testing to validate that the flowchart functions as expected.
11. Refinement: As you develop your voice assistant, you may need to refine and
expand the flowchart to accommodate new features or scenarios.
12. Documentation: Finally, document your flowchart, ensuring that anyone
working on the project can understand the interaction logic.
Flowcharts are essential for designing and communicating the logic of your voice
assistant project. They serve as a blueprint for development, helping you visualize and
plan how the assistant will process user commands and provide responses.
3.4 Details of Hardware and Software
.Software Requirements :
.Hardware Requirements :
Computer (Desktop/Laptop)
100 MB HDD storage 4GB RAM
The expected result of our project is we will be developing a voice assistant that will
be useful in educational purposes , business ,personal use ,etc.
3.6 Analysis
As we know Python is a suitable language for scriptwriters and developers. The query
for the assistant can be manipulated as per the user’s need.
Modules needed
We will set our engine to Pyttsx3 which is used for text to speech in Python and sapi5
is a Microsoft speech application platform interface we will be using this for text to
speech function.
You can change the voice Id to “0” for the Male voice while using assistant here we
are using a Female voice i.e “1” for all text to speech.
Snapshorts:
3.6 Conclusion
[2] Patrick Nguyen, Georg Heigold, Geoffrey Zweig, Speech Recognition with Flat
Direct Models, IEEE Journal of Selected Topics in Signal Processing, 2010
[6] Sutar Shekhar, P. Sameer, Kamad Neha, Prof. Devkate Laxman, An Intelligent Voice
Assistant Using Android Platform, IJARCSMS, ISSN: 232-7782, 2017.
[7] Rishabh Shah, Siddhant Lahoti, Prof. Lavanya. K, An Intelligent Chatbot using
Natural Language Processing, International Journal of Engineering Research , Vol.6 ,
pp.281-286, 2017.