0% found this document useful (0 votes)
32 views

Synopsis SEM4

The document describes a mini project report on developing a voice assistant. It discusses the motivation behind creating the assistant, which was to develop a program for user convenience. It also outlines the problem statement and objectives of the project.

Uploaded by

barkalesnehal1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views

Synopsis SEM4

The document describes a mini project report on developing a voice assistant. It discusses the motivation behind creating the assistant, which was to develop a program for user convenience. It also outlines the problem statement and objectives of the project.

Uploaded by

barkalesnehal1
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 24

Mini-Project Report On

Mini Project Title : Voice Assistant


Submitted in partial fulfillment of the requirements of
S.E ARTIFICIAL INTELLIGENCE & MACHINE
LEARNING ENGINEERING
By

Snehal Barkale 02
Shivraj Chavan 06
Omkar Chendge 07

Name of the Mentor

Prof. Sneha Ingale

Department of Artificial Intelligence & Machine Learning


Engineering
Shivajirao S. Jondhale College of Engineering Dombivli (E)

University of Mumbai
(AY 2023-24)
CERTIFICATE

This is to certify that the Mini Project entitled “ Voice Assistant ” is a bonafide

work of Snehal Barkale(02) Shivraj Chavan(06) Omkar Chendge(07)

submitted to the University of Mumbai in partial fulfillment for SE (Artificial

Intelligence & Machine Learning

Engineering) semester IV during the academic year 2023-24 as prescribed by

University of Mumbai.

(Prof. Sneha Ingale)


Mentor

Prof Sneha Ingale Dr. Renuka Deshpande Dr. Pramod Rodge


Project Coordinator Head of Department Principal
Mini Project Approval

This Mini Project entitled “Voice Assistant”by Snehal Barkale (02) Shivraj
Chavan (06) Omkar Chendge (07) is approved for S.E. (Artificial Intelligence
& Machine Learning Engineering) for the academic year 2023-24.

Examiners

1………………………………………
(Internal Examiner Name & Sign)

2…………………………………………
(External Examiner name & Sign)

Date:

Place:
Contents
Abstract i

Acknowledgments ii

List of Abbreviations iii

List of Figures iv

1 Introduction
1.1 Introduction
1.2 Motivation
1.3 Problem Statement & Objectives
1.4 Organization of the Report
2 Literature Survey

2.1 Survey of Existing System/SRS


2.2 Limitation Existing system or Research gap
2.3 Mini Project Contribution

3 Proposed System (eg New Approach of Data Summarization )

3.1 Introduction
3.2 Architecture/ Framework
3.3 Algorithm and Process Design
3.4 Details of Hardware & Software
3.4 Experiment and Results for Validation and Verification
3.5 Analysis
3.6 Conclusion and Future work.
References
4 Annexure

4.1 Published Paper /Camera Ready Paper/ Business pitch/proof of concept


Abstract

A voice assistant is a type of artificial intelligence (AI) software application or virtual


assistant that is designed to respond to voice commands and interact with users using
natural language processing (NLP) technology. Voice assistants are typically integrated
into various devices and platforms, such as smartphones, smart speakers, tablets, and
even certain appliances, to provide users with hands-free access to information,
perform tasks, and control connected devices.

The rise of voice assistants represents a significant advancement in artificial


intelligence and human-computer interaction.

Virtual assistants are designed to mimic human interactions, enabling users to engage
in natural conversations with these digital entities. They can perform a wide range of
tasks, including setting reminders, scheduling appointments, answering questions,
managing emails, and even controlling smart home devices. Their adaptability and
versatility make them an indispensable tool for both individual users and businesses.

Virtual assistants are also becoming increasingly integrated into various devices and
platforms, including smartphones, smart speakers, and chatbots. They can understand
user preferences and tailor responses to specific needs, which fosters a more user-
centric experience. This abstract delves into the technologies that power virtual
assistants, including machine learning, deep learning, and data analytics, which enable
them to continuously improve their performance and expand their capabilities.
Acknowledgment

I would like to express my sincere gratitude and appreciation to all those who have
supported and guided me throughout the completion of this college project. Our guide
Prof. Sneha Ingale ,our HOD Dr. Renuka Deshpande ,without their assistance and
encouragement, this project would not have been possible.

I express my gratitude to our principle Dr. Pramod Rodge for inspiring us and
supporting us during our project .

First and foremost, I would like to thank my college faculty, for their invaluable
guidance and mentorship. Their expertise and dedication to the subject matter have been
instrumental in shaping the direction of this project.

In conclusion, I acknowledge all those who have played a part, big or small, in the
successful completion of this college project. Your contributions have enriched the
quality of this work, and I am truly grateful for your support.
List of figures

Figure no. Title Page no.

3.2 Architecture 11

3.3 Algorithm 12

List of Abbreviations

1.AI Artificial intelligence

2.NLP Natural language processing

3.IOT Internet of Things

4.WI Web Intelligence

5.IROS Intelligent Robots and Systems


1
Introduction

A voice assistant is a type of artificial intelligence (AI) software application or virtual


assistant that is designed to respond to voice commands and interact with users using
natural language processing (NLP) technology. Voice assistants are typically integrated
into various devices and platforms, such as smartphones, smart speakers, tablets, and
even certain appliances, to provide users with hands-free access to information,
perform tasks, and control connected devices.

The rise of voice assistants represents a significant advancement in artificial


intelligence and human-computer interaction.

1.1 Introduction

A voice assistant is a type of artificial intelligence (AI) software application or virtual


assistant In the fast-paced world of today, the demand for efficiency and convenience
has led to the rise of virtual assistants, revolutionizing the way we interact with
technology and manage our daily tasks. A virtual assistant is a computer program or
application that uses artificial intelligence (AI) and natural language processing (NLP)
to provide users with a wide range of services and support, often mimicking the role
of a human personal assistant. These digital companions have transformed the way we
work, stay organized, and access information. The concept of a virtual assistant can be
traced back to the advent of speech recognition and text-to-speech technology. Over
the years, advancements in machine learning, data analytics, and AI have allowed
virtual assistants to become increasingly sophisticated and versatile. These digital
helpers are now integrated into various devices and platforms, including smartphones,
smart speakers, smartwatches, and even cars, making them accessible to a wide range
of users.

Virtual assistants come in various forms and are often tailored to specific applications
and ecosystems. Some of the most popular virtual assistants include Apple's Siri,
Amazon's Alexa, Google Assistant, and Microsoft's Cortana. These platforms can
perform a multitude of tasks, such as answering questions, setting reminders, sending
messages, playing music, providing directions, and controlling smart home devices.

The future of virtual assistants is incredibly promising. As AI technology continues to


evolve, virtual assistants are expected to become more personalized and context-aware,
providing users with increasingly tailored and proactive assistance. They are likely to
play a pivotal role in the development of smart cities, healthcare, education, and
various other sectors, making our lives more efficient and convenient.

1.2 Motivation

The idea of developing this project is a web based program for user convenience . When
I first time saw the advertisement of Alexa in late 2019 I don’t remember exactly but I
saw a pretty girl giving voice command to the some kind of machine and the machine
is reacting and doing the tasks according to that command. Personally for me it was
very fascinating and mind blowing .That day I dreamt of making this one day in future
Let’s talk about outside the project. When I personally studied the history of voice
assistant then I heard about the person whose name is Dr. Raj Reddy from
Andhrapradesh this guy had a very much interest in AI and speech recognition
technology. He started working on AI in 1960s itself. That era when computer only have
4kb storage.After graduation Dr. Raj Raddy was doing PHD under John Mccarthy.
At that university only computer in his lab which used to be free only at night. In such
situation Dr. Raddy used to work on speech recognition technology at night after days
work.After some time of continuous working, computer start recognizing vowels.
So the motivation comes from this story

1.3 Problem Statement

Create a virtual voice assistant, so that the system can be used by them to interact with
new technologies, control their devices and use this technology for educational
purposes and also for business purpose and future developments.
1.4 Organization of Report

Voice assistant is going to be a program where the user can chat with our voice assistant
via voice and our voice assistant can provide information asked by the user through
chrome, youtube ,Wikipedia ,etc. The user can also ask for current time and our voice
assistant can speak the current time for the user .Our voice assistant can be accessed
online .This system will be able to analyze problem and will find solution or
information to it.The proposed system will be able to provide user interface which the
user will able to see and also hear. To create this system we will use visual studio. It
will be written in python, MySQL will be the database used.
2
Literature Survey

2.1 Survey of existing systems

1. Title: "Voice Assistant Classification and Evaluation"


• Authors: Lin, F., and Ko, E.
• Published In: 2019 IEEE/RSJ International Conference on Intelligent
Robots and Systems (IROS)
• Abstract: This paper discusses the classification and evaluation of voice
assistants, providing insights into their performance and capabilities.
2. Title: "Efficient Voice Command Recognition for IoT and Mobile Devices"
• Authors: Perez, A., et al.
• Published In: 2017 IEEE Internet of Things Journal
• Abstract: This paper presents a study on the efficient recognition of
voice commands for Internet of Things (IoT) and mobile devices, which
is closely related to voice assistant technology.
3. Title: "Voice Assistant Conversational Intelligence: A Study on Google Home,
Amazon Echo, and Apple HomePod"
• Authors: Ali, W., et al.
• Published In: 2019 IEEE/WIC/ACM International Conference on Web
Intelligence (WI)
• Abstract: This paper explores the conversational intelligence of popular
voice assistant devices like Google Home, Amazon Echo, and Apple
HomePod.
4. Title: "Speech Recognition in Noisy Environments: A Survey"
• Authors: F. Visin, et al.
• Published In: 2016 IEEE/ACM Transactions on Audio, Speech, and
Language Processing
• Abstract: While not specifically about voice assistants, this survey paper
delves into speech recognition, a fundamental technology behind voice
assistants, especially in challenging environments.
2.2 Limitations of existing systems

1. Unintelligible Words

It can be very difficult for voice assistants to comprehend words that are said that are
not in the dictionary. Though this trend has been improving over time.

2. Not Understanding Commands

When you are doing a voice search, you expect to get the correct result immediately.
However, if the voice assistant does not understand what you’re asking for, you will
have to repeat yourself or put in your search query manually.

3. Providing the Wrong Information

Voice assistants are good at providing factual information, but some of them struggle
to provide decent information due to misinterpreting or misidentifying a particular
query. This problem seems to be more prominent with voice assistants, especially when
it comes to misidentifying the source of the voice command.

4. Getting Distracted during In-Person Conversation

Even though it’s good that you can use voice assistants to make your life a little bit
easier, there are times when they can be counterproductive when trying to have a
conversation with someone. This is especially true when you’re talking to someone
face-to-face..

5. Software Integration Problems

The major players in the voice assistant sphere are Siri, Google Assistant, and Alexa.
These assistants are still far from perfect, which is why it’s amazing that they were
integrated into so many different types of devices. However, despite the fact that there
are many devices and applications that are compatible with voice assistants, there are
still numerous problems with software integration

.
2.3 Mini project contribution

The voice assistance has been developed for users for educational,business and for
personal use . It has achieved the objectives and scope that were stated in this project
the project will achieve some of the below objectives :

It can open chrome, Youtube , Wikipedia , etc to search information and read 2 or 3
lines for the user. It can open presentation .It can tell current time.It can send mails,play
music,etc.
3
Proposed system

3.1 Introduction

Virtual assistant is software program that helps you ease your day-to-day tasks, such as
showing weather forecasting, listing reminders, creating to do list etc. They can take
commands as voice or text. Voice based intelligent assistant need an invoking words or
wake words to active the listener, followed by commands. For my project the wake, up
word is “MANAV”. Our voice assistant is designed to be used efficiently for all users.
This personal assistant software improves users’ productivity by managing day to day
tasks & providing information from online sources to users.

3.2 Architecture / Framework

Creating a voice assistant involves several components, including speech


recognition, natural language processing, and interaction design. Here's a high-
level architecture for a voice assistant project:

• Audio Input/Output:

o Microphone: To capture user voice commands.

o Speaker: To provide audio responses.

• Speech Recognition:

o Use a Speech-to-Text engine (ASR - Automatic Speech Recognition) to


convert spoken words into text. Popular ASR engines include Google's
Speech-to-Text, Microsoft Azure Speech Service, or open-source
solutions like Mozilla DeepSpeech.
• Natural Language Processing (NLP):

o Intent Recognition: Identify the user's intent from the transcribed text.
This involves understanding what the user wants to do or know.
o Named Entity Recognition (NER): Identify important entities like dates,
locations, and proper nouns in the user's command.
o Dialog Management: Keep track of the conversation context, including
the user's previous requests and responses.
• Knowledge Base and Data Sources:

o Store information or connect to external data sources to provide answers


to user queries. This can include APIs, databases, or web scraping.
• Response Generation:

o Use the NLP results and the context to generate a meaningful response.

o You can use pre-defined templates for common responses or generate


responses dynamically using NLG (Natural Language Generation)
techniques.
• Text-to-Speech (TTS):

o Convert the generated text response into speech using a Text-to-Speech


engine. Examples include Amazon Polly, Google Text-to-Speech, or
opensource TTS solutions.
• User Interface:

o Choose the platform for your voice assistant. It can be a mobile app, a
webbased interface, a smart speaker, or a custom hardware device.
The diagram shows the main process flow of how Voice Assistant works.
3.3 Algorithm and Process Design

Creating a flowchart for a voice assistant project is a helpful way to visualize and plan
how the assistant will respond to user inputs and interact with various components.
Below, I'll provide a step-by-step guide on how to create a basic flowchart for a voice
assistant project:

1. Define the Objective: Start by defining the primary objective of your voice
assistant. What tasks or functions should it perform? Common voice assistant
tasks include answering questions, setting reminders, controlling smart devices,
providing weather updates, and more.
2. Identify User Inputs: List the possible user inputs or commands your voice
assistant should be able to handle. These inputs will be the starting points for
your flowchart. For example, users might say, "What's the weather like today?"
or "Set a reminder for
3 PM."

3. Create Start and End Points: Draw a "start" symbol (usually a circle) at the top
of your flowchart and an "end" symbol (usually a rounded rectangle) at the
bottom. These symbols represent the beginning and end of the interaction.
4. Add Decision Points: Depending on the complexity of your voice assistant, you
may need to add decision points where the assistant needs to make choices or
validate information. Decision points are represented as diamonds. For
example, you might have a decision point to check if the user's command is
clear.
5. Connect User Inputs: Connect the user inputs (from step 2) to the decision
points and actions in the flowchart. Use arrows to indicate the flow of the
conversation. For example, connect "What's the weather like today?" to a
decision point that validates the location and date.
6. Define Actions and Responses: At each step in the flowchart, describe the action
your voice assistant should take and the response it should provide. This may
include retrieving information from a database, performing calculations, or
generating a spoken or text response.
7. Error Handling: Include branches for error handling. What should the voice
assistant do if it doesn't understand the user's command or if there's an error in
processing? Define these paths in the flowchart.
8. Integration with APIs or Services: If your voice assistant interacts with external
services or APIs (e.g., weather data, calendar, or home automation), represent
these interactions in your flowchart with the appropriate symbols and labels.
9. Looping and Repetition: If your voice assistant supports repeated interactions
or loops, use loop symbols in your flowchart to represent these behaviors.
10. Testing and Validation: After creating the initial flowchart, review it to ensure
that it covers all possible user inputs and provides clear paths for the assistant's
responses. Conduct testing to validate that the flowchart functions as expected.
11. Refinement: As you develop your voice assistant, you may need to refine and
expand the flowchart to accommodate new features or scenarios.
12. Documentation: Finally, document your flowchart, ensuring that anyone
working on the project can understand the interaction logic.
Flowcharts are essential for designing and communicating the logic of your voice
assistant project. They serve as a blueprint for development, helping you visualize and
plan how the assistant will process user commands and provide responses.
3.4 Details of Hardware and Software

.Software Requirements :

 Programming language: Python


 Tools: PyCharm

.Hardware Requirements :

 Computer (Desktop/Laptop)
 100 MB HDD storage 4GB RAM

3.5 Experiment and actual results

The expected result of our project is we will be developing a voice assistant that will
be useful in educational purposes , business ,personal use ,etc.

 It can open google


 It can open youtube
 It can search on Wikipedia and read 2 lines for the user
 It can play music
 It can send mails
 It tell current time.
 It can open our presentation

3.6 Analysis

As we know Python is a suitable language for scriptwriters and developers. The query
for the assistant can be manipulated as per the user’s need.

Modules needed

• Pyttsx3:- This module is used for the conversion of text to speech in a


program it works offline. To install this module type the below
command in the terminal.
• Wikipedia:- As we all know Wikipedia is a great source of knowledge
to get information from Wikipedia or to perform a Wikipedia search. To
install this module type the below command in the terminal.
• Speech Recognition:- Since we’re building an Application of voice
assistant, one of the most important things in this is that your assistant
recognizes your voice (means what you want to say/ ask). To install this
module type the below command in the terminal.
• Web browser:- To perform Web Search. This module comes built-in
with Python.
• Datetime:- Date and Time are used to showing Date and Time. This
module comes built-in with Python.
• Smtplib:- Simple Mail Transfer Protocol (SMTP) is used as a protocol
to handle the email transfer using Python. It is used to route emails
between email servers. It is an application layer protocol which allows to
users to send mail to another

We will set our engine to Pyttsx3 which is used for text to speech in Python and sapi5
is a Microsoft speech application platform interface we will be using this for text to
speech function.

You can change the voice Id to “0” for the Male voice while using assistant here we
are using a Female voice i.e “1” for all text to speech.
Snapshorts:
3.6 Conclusion

In conclusion, voice assistants represent a transformative technology that has reshaped


the way we interact with devices, access information, and accomplish tasks. Their
significance lies in their ability to provide accessibility, convenience, and efficiency in
a wide range of applications. From simplifying daily tasks and enhancing productivity
to improving accessibility for individuals with disabilities, voice assistants have
become an integral part of our lives.

As voice assistant technology continues to advance, we can expect even more


innovative applications and improvements in natural language understanding, making
these virtual assistants increasingly valuable in our homes, workplaces, and
communities. Voice assistants are not merely a technological convenience; they are a
powerful tool that fosters accessibility, personalization, and safety while driving
innovation in AI and human-computer interaction. Their continued evolution promises
a future where seamless voice-powered interactions will further enrich our lives and
redefine how we interact with the digital world
References

[1] Shaughnessy, IEEE, Interacting with Computers by Voice: Automatic Speech


Recognition and Synthesis proceedings of the IEEE, vol. 91, no. 9, september 2003.

[2] Patrick Nguyen, Georg Heigold, Geoffrey Zweig, Speech Recognition with Flat
Direct Models, IEEE Journal of Selected Topics in Signal Processing, 2010

[3] Mackworth (2019-2020), Python code for voice assistant: Foundations of


Computational Agents- David L. Poole and AlanK. Mackworth.

[4] Nil Goksel, CanbekMehmet ,EminMutlu, On the track of Artificial Intelligence:


Learning with Intelligent Personal Assistant, proceedings of International Journal of
Human Sciences, 2016.

[5] Keerthana S, Meghana H, Priyanka K, Sahana V. Rao, Ashwini B Smart Home


Using Internet of Things , proceedings of Perspectives in Communication , Embedded
-systems and signal processing, 2017.

[6] Sutar Shekhar, P. Sameer, Kamad Neha, Prof. Devkate Laxman, An Intelligent Voice
Assistant Using Android Platform, IJARCSMS, ISSN: 232-7782, 2017.

[7] Rishabh Shah, Siddhant Lahoti, Prof. Lavanya. K, An Intelligent Chatbot using
Natural Language Processing, International Journal of Engineering Research , Vol.6 ,
pp.281-286, 2017.

[8] Luis Javier RodrÃguez-Fuentes, Mikel Peñagarikano, AparoVarona, Germán


Bordel, GTTS-EHU Systems for the Albayzin 2018 Search on Speech Evaluation,

You might also like