Technical Paper
Technical Paper
Abstract—AI desktop assistants like Apple’s ”SIRI” and at the end of the paper, highlighting the sources that provided
Google’s ”Google Voice Search,” can perform tasks and provide information and support for the research project.
services based on user commands. These systems use speech
recognition to respond to synthetic speech, allowing users to
communicate with their devices. The proposed system, which can II. L ITERATURE S URVEY
work with or without internet connectivity, uses voice recognition
to process user input and provide various outputs. AI-based In the realm of technology, researchers are continuously
personal assistants aim to bridge the communication gap between advancing the capabilities of virtual assistants and AI-
humans and machines, creating a more engaging user experience. driven communication systems. Leandro Tibola and Liane
Margarida Rockenbach Tarouco [1] emphasize the importance
Index Terms—AI, SIRI, Google Voice Search, Speech Recog-
of interoperability in virtual worlds, highlighting the role
nition, Internet, Personal Assistants, User Experience
of WWW services using HTTP and XML to enhance
I. I NTRODUCTION communication between virtual and real-world entities
AI assistants have a long history, starting with the Turing while bolstering security measures against modern operating
Test in the 1950s. Advances in machine learning, particularly systems.
deep learning and neural networks, led to breakthroughs like
OpenAI’s GPT-3 in 2020. Today, AI assistants are used in Alec Radford, Karthik Narasimhan, Tim Salimans, and Ilya
customer service, translation, content generation, and personal Sutskever [2] showcase significant advancements in natural
virtual assistants like Siri and Alexa. language understanding through generative pre-training
and discriminative fine-tuning. Their task-agnostic model
The goal of creating an AI desktop assistant that talks like outperforms discriminatively trained models in various
humans is to enhance user experience, facilitate efficient task language understanding tasks, marking a notable stride in
execution, make technology more accessible and inclusive, language processing capabilities.
provide personalized assistance, and push the boundaries of
natural language processing. This innovative project requires Deepak Shende, Ria Umahiya, Monika Raghorte,
advanced natural language understanding and pushes the Aishwarya Bhisikar, and Anup Bhange [3] present an
boundaries of NLP research and development. The goal is AI-based voice assistant project implemented using Python,
to create an AI-powered desktop assistant that communicates leveraging open-source software modules and community
with users in a natural and relatable way, making technology support to ensure adaptability to future updates.
more approachable, efficient, and user-centric.
Rajat Sharma and Adweteeya Dwivedi [4] introduce
The research aims to enhance natural language ”JARVIS,” an AI voice assistant system employing speech
understanding in AI desktop assistants, train models recognition, gTTS, neural networks, and natural language
effectively, mitigate biases, and improve user experience, processing to deliver intelligent and responsive interactions
promoting inclusivity, enhancing productivity, and pushing tailored to specific circumstances.
NLP capabilities.
Afra Ali, Shweta Dubey, Shyam Dwivedi, Divisha Pandey,
The research paper explores the development of artificial in- Md. Saif Raza, and Muskan Srivastava [5] unveil a voice
telligence desktop assistants, focusing on recent developments assistant service for desktop users, integrating internet-
and techniques. It presents a literature survey in section II, of-things technology, speech recognition, and modern AI
discusses the proposed work in section III, and analyzes the technologies to provide enhanced functionalities and a
outcomes. Section V presents the results and discussion that seamless user experience.
provides insights into the assistant’s limitations and efficacy.
Section IV gives the implementation details including the Vedant Kulkarni, Shreyas Kallurkar, Vipul Waikar, Saurabh
design and implementation of the virtual desktop assistant. The Patil, and Swarupa Deshpande [6] present a framework
study concludes with a conclusion and future directions for for a virtual assistant that overcomes existing constraints,
further research in Section VI. A list of references is provided promising improved effectiveness and usability in processing
voice inputs.
Figure 4. The above figure showcases the top 10 most popular programming
languages globally, including Python, Java, and JavaScript.
V. R ESULTS
This section presents visual project results, including
screenshots of our AI assistant’s functionality and user
interface. These provide a firsthand view of the system’s
performance and usability, highlighting user interaction Figure 8. On saying ”WhatsApp”, the assistant enables the user to send
flow, interface design, and tasks executed. WhatsApp messages to specified contacts by dictating the message via voice
input.
Last but not the least we would like to thank to our parents
and friends.
R EFERENCES
[1] Leandro Tibola, Liane Margarida Rockenbach Tarouco (2013). ”Interop-
erability in Virtual World.”
[2] Alec Radford, Karthik Narasimhan, Tim Salimans, Ilya Sutskever (2018).
”Improving Language Understanding by Generative PreTraining.”
[3] Deepak Shende, Ria Umahiya, Monika Raghorte, Aishwarya Bhisikar,
Anup Bhange (2019). ”AI Based Voice Assistant Using Python.”
[4] ajat Sharma, Adweteeya Dwivedi (2022). ”JARVIS - AI Voice Assistant.”
[5] Divisha Pandey, Afra Ali, Shweta Dubey, Muskan Srivastava, Shyam
Dwivedi, Md. Saif Raza (2022). ”Voice Assistant Using Python and AI.”