0% found this document useful (0 votes)
17 views10 pages

AIspeaker

ppt presentation

Uploaded by

Manoj Vattikuti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views10 pages

AIspeaker

ppt presentation

Uploaded by

Manoj Vattikuti
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 10

PES INSTITUTE OF TECHNOLOGY AND MANAGEMENT

DEPARTMENT OF MASTER OF COMPUTER APPLICATIONS

“ARTIFICIAL SPEAKER FOR DEAF AND DUMB PEOPLE


USING THEIR MURMURING SOUND”

Presented By:
Manoj V
4PM22MC018
TABLE OF CONTENTS

Introduction to the Project


Existing System
Proposed System
Software and Hardware Requirements
Algorithms
Technology Used
System Architecture
INTRODUCTION

 At present time, smart phones are quite popular and easily accessible. They have features that can
help any human being to make a tedious task rather simpler. If we are able to translate voice input in
text in real time, it can bring a drastic change in life of deaf and dumb people.
 Gestures may have a limit for use as not all the people are familiar with sign language and also the
fact that various region in the world uses different sign languages.
 We are trying to implement a communication mechanism in order to overcome differences between
the deaf and dumb community as the normal people.
 Our primary concern is to avail a way for deaf and dumb community to get closer to the Technical
Era by making use of STT and TTS technology.
EXISTING SYSTEM
• In existing system the module was developed for dumb person using flex sensor, there user hand is
attached with the flex sensors.
• On this module the flex sensor reacts on bend of each finger individually.
• By taking that value controller starts to react with speech, each flex sensor holds unique voice stored
in APR Kit and for each sign it will play unique voice.
• And in other existing system, the work is done only for some alphabets and not for the words or
sentences, and accuracy obtained is very low.
PROPOSED SYSTEM
 The proposed system aims to translate the murmuring sounds made by deaf and dumb individuals
into comprehensible speech.
 using advanced technologies such as machine learning, signal processing, STT(Speech to Text) and
TTS(Text to Speech).
 This artificial speaker system will enhance communication by converting subtle vocalizations into
clear spoken language.
SOFTWARE AND HARDWARE REQUIREMENTS

Software requirements: Hardware requirements:

 Language used : Python 3.10 onwards  Processor : 64 bit


 Front end : HTML , CSS, JS  RAM : 4 GB
 Database : MYSQL  ROM : 256 GB HDD
 Operating system : Windows 7 and above
ALGORITHMS
CNN( convolutional neural network):
Convolutional Neural Networks (CNNs) play a crucial role in feature extraction
from audio data. CNNs can process spectrograms, which are visual representations
of the frequency spectrum of the murmuring sounds over time.
 By identifying patterns and hierarchies within these spectrograms, CNNs
effectively capture the essential characteristics of the murmuring sounds.
 These extracted features are then used to train models that translate the murmuring sounds into
phonemes or words, significantly enhancing the accuracy and efficiency of the sound recognition
process.
 This leads to more precise and intelligible speech synthesis outputs.
TECHNOLOGY USED
STT(Speech to Text):
• Speech-to-Text (STT) technology converts spoken language into written text using advanced audio
processing, feature extraction, and machine learning models like neural networks. It is commonly
used in applications such as voice recognition, transcription services, and virtual assistants.

TTS(Text to Speech):
• Text-to-Speech (TTS) technology converts written text into spoken words using advanced models
like neural networks to generate natural-sounding speech. It enhances accessibility and
communication, especially for individuals with visual or speech impairments.
SYSTEM DESIGN
BLOCK DIAGRAM:

You might also like