0% found this document useful (0 votes)

94 views

Applications PDF

This document outlines a lecture on practical applications of speech signal processing. It discusses several topics: speech coding, speech synthesis, speech recognition and understanding, speaker recognition, speech enhancement, and speech modification. The goals of the lecture are to introduce each topic, provide examples of applications, discuss selected topics in more detail, and keep the presentation at a high level.

Uploaded by

Nilesh Patil

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views

Applications PDF

Uploaded by

Nilesh Patil

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Practical Applications of

Speech Signal Processing

Vishu R Viswanathan
TI Fellow, Director, Speech Technologies Lab
DSP Solutions R&D Center
Texas Instruments, Dallas, Texas
[email protected]

March 2004 Vishu Viswanathan 1

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 2

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 3

Goals of the Lecture

Introduce and discuss each of a number of speech

signal processing areas
List examples of practical applications
Discuss some selected topics in each area
High level presentation only

March 2004 Vishu Viswanathan 4

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 5

Speech Coding
Goal
Reduce speech signal data rate
Maintain high speech quality
General Principle: Take advantage of
Redundancies in the speech signal
Properties of speech production and perception
Applications
Digital cellular telephony, voice over IP, IP phone,
audio/video conferencing, PSTN trunking, secure voice
communication, digital answering machines, voice mail, voice
response systems, talking products
March 2004 Vishu Viswanathan 6
Components of a Speech Coding System

Sampled Channel or
Analyzer Encoder
Speech s(n) x(n) y(n) Medium y(n)

Decoder Synthesizer
x(n) s(n)

Goal: Minimize data rate of y(n) while maximizing speech

quality of s(n)

March 2004 Vishu Viswanathan 7

Types of Speech Coders
Waveform Coders
Goal: Reproduce speech on a sample-by-sample basis
High data rates, high speech quality
Examples: 64 kb/s PCM (G.711), 32 kb/s ADPCM (G.726)
Parametric Coders
Speech production characterized by parametric models
Low data rates, good speech intelligibility, communications/synthetic speech
quality
Examples: 2.4 kb/s LPC (FS 1015), 2.4 kb/s MELP (recent NATO standard)
Analysis-by-Synthesis Coders
Hybrid between waveform and parametric coders, with medium data rates
Parametric models used, with excitation signal computed by minimizing
error between synthesized speech and input speech
Examples: 16 kb/s G.728, 8 kb/s G.729
March 2004 Vishu Viswanathan 8
Speech Quality
Terms Used
Toll quality: High-grade wireline telephone
High quality
Good quality
Communications quality
Transparent quality
Formal Subjective Testing Methods
Expensive, time consuming
Mean opinion score (MOS): Used in all industry standards bodies
Diagnostic acceptability measure (DAM): Used by US Dept of Defense
Informal and Semi-Formal Subjective Tests
Pairwise or A/B comparisons
Rating tests
Objective Methods
Signal-to-Noise Ratio, ITU P.802 (PESQ)
Automatic, repeatable, useful in coder development and optimization

March 2004 Vishu Viswanathan 9

Speech Coder Attributes
Low bit rate 1200 2400 4800 8000 16000 32000 64000
High bit rate
Bits/Second
2.5 3.0 3.5 4.0
Low quality High quality
Mean Opinion Score
Clean Noisy
Speech Handheld Hands-free Speech
10 50 100 200
Low delay High delay
Milliseconds

Low High
Complexity MIPS, Memory Complexity

Human
Music
Speech Sound Effects

March 2004 Vishu Viswanathan 10

Speech Coding Standards

ITU Standards
coder rate (kb/s) approach
G.711 64 Mu/A-law
G.726 16-40 ADPCM
G.728 16 LD-CELP
G.729 8 CS-ACELP
G.723.1 5.3/6.3 MP/ACELP

ITU standards are targeted for telephone network applications

Also used in Voice over IP applications
All produce toll quality speech
March 2004 Vishu Viswanathan 11
Speech Coding Standards
Digital Cellular Standards
coder rate (kb/s) chan rate approach date
GSM FR 13 22.8 RPE-LTP 1987
Europe GSM HR 5.6 11.4 VSELP 1994
GSM EFR 12.2 22.8 ACELP 1995
GSM AMR 4.75-12.2 11.4 - 22.8 ACELP 1998
TIA IS54 7.95 13 VSELP 1989
TIA IS95 0.8-8.55 QCELP 1993
North TIA Q13 0.8-13.3 QCELP 1995
America TIA IS641 7.4 13 ACELP 1996
TIA EVRC 0.8-8.55 R-ACELP 1996
TIA SMV 0.8-8.5 R-ACELP 2001
PDC FR 6.7 11.2 VSELP 1990
Japan PDC HR 3.45 5.6 PSI-CELP 1993
PDC EFR 8 11.2 ACELP 1999
PDC EFR 6.7 11.2 ACELP 2000

March 2004 Vishu Viswanathan 12

Speech Coding Standards

Wideband Standards
coder rate (kb/s) approach
G.722 48,56,64 SB-ADPCM
G.722.1 24,32 Transform
ITU WB 16,24 ACELP
AMR WB 6.60-23.85 ACELP
VMR WB 1.0-13.3 ACELP

Wideband: 50 Hz 7 kHz (versus narrowband telephone, 300-3200 Hz)

March 2004 Vishu Viswanathan 13

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 14

Speech Synthesis
Human Speech Based Systems
Suitable for known material
Speech coding based
Talking toys, talking books, voice prompts, voice response systems
Concatenation of pre-recorded voice data
Information retrieval (stock quotes, airline schedules, banking)
Text-to-Speech Systems
Suitable for unknown or arbitrary text
Applications: e-mail/fax reading, phone access to web based
services, spoken telephone directory, car navigation, location-
based services, customer service, help desk, reading machines
for the blind

March 2004 Vishu Viswanathan 15

Components of a TTS System

Dictionary
and Rules

Text Text Letter-to- Speech

Synthesizer
Analysis Sound

- Numerical expansion - Phonemes choice of units

(dates, times, money) words, phones, diphones, dyad,
- Pitch
syllables
- abbreviations, acronyms
- Duration
choice of parameters
-proper name id
- Pauses LPC, formants, waveform templates,
Dr. Smith lives at 23 articulatory parameters, sinusoidal
- loudness/amplitude
Lakeshore Dr. parameters
method of computation
Courtesy of Larry Rabiner rules, concatenation

March 2004 Vishu Viswanathan 16

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 17

Speech Recognition & Understanding
Problem
Recognition: Automatic recognition of human speech by machine
Understanding: Interpret the meaning of recognized speech and map them to
actions to be taken
Applications
Voice dialing (name or number dialing) in telephone, cellphone, PDA,
smartphone (Safety laws against handheld cellphone use while driving)
Voice command & control in telematics, cellphone, PDA, smartphone, PC, toys
Voice-enabled web browsing, information retrieval (stock quotes, weather
forecast, airline flight information, banking), navigation, e-mail, SMS, dictation
Automated customer service and help desks
Benefits: hands-free, eyes-free use; not using keypad; faster task completion;
ease of use; part of multi-modal interface; cost savings

March 2004 Vishu Viswanathan 18

March 2004 Vishu Viswanathan 19
Components of a Speech Recognizer

speech signal word string

Feature Acoustic
Decoding
Extraction Scoring

Acoustic Language
Models Models

Front end Back end

March 2004 Vishu Viswanathan 20

Speech Recognizer Attributes
Speaker Speaker Adaptive Speaker
Dependent Independent
Small 10 100 1000 10000 Large
Vocabulary Words Vocabulary
Isolated Continuous Speech Conversa-
Words tional Speech
Syntax Semantics
Recognition Understanding

Clean Noisy
Speech Handheld Hands-free Speech
Low High
Complexity MIPS, Memory Complexity
Server Distributed Client
Based Based
March 2004 Vishu Viswanathan 21
Performance & Robustness
Performance
Recognition Accuracy: Word error rate (WER) or task completion rate
High enough performance required for user acceptance
Robustness Issues
Training versus operational condition differences
Background noise: extent of noise, its variability (Usually additive)
Channel variability: different microphones, different telephone circuits,
handheld, handsfree, handheld-handsfree (Usually convolutive)
Recognizer must have means to compensate for noise and channel variabilities
Out-of-vocabulary rejection capability
Speaker dialect and accent variability (handled by speaker adaptation)
User Interface: Very important for the success of an application

March 2004 Vishu Viswanathan 22

Recognition in Multiple Languages
Speaker-Dependent Recognition
Language independent (User can enroll names for voice dialing in multiple
languages!)

Some Observations for Speaker-Independent Recognition

Same recognition engine but different data (models, dictionary) needed
Recognition grammar to handle language-specific usage differences (e.g.,
French speak telephone numbers in pairs; natural number dialing needed)
Training requires speech databases and dictionary in the new language
Automatic training tools to minimize time to develop recognition in a new
language

March 2004 Vishu Viswanathan 23

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 24

Speaker Recognition
Speaker Verification / Authentication
Problem: Use voice input to verify the users claimed identity
Applications: Secure access to premises, information (banking), services (voice
dialing), etc.
Issues
True user acceptance traded off with impostor acceptance
Total voice verification
Fixed text versus free text
Speaker Identification
Problem: Use voice to identify speaker from a closed or open set of speakers
Applications: Legal and forensic use, intelligence, security
Issues: Uncooperative user, often relatively short-duration speech, noisy
and/or distorted speech.

March 2004 Vishu Viswanathan 25

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 26

Speech Enhancement

Noise Suppression
Playback Enhancement
Acoustic Echo Cancellation

March 2004 Vishu Viswanathan 27

Noise Suppression
Problem
Remove acoustic noise from noisy speech signal for better listenability or for
improved performance of speech processing devices
Requirements: No speech signal distortion, no loss of speech intelligibility,
no artifacts like musical noises, natural sounding residual noise
Methods
Single microphone approach: spectral subtraction family of methods
Multi-microphone approach: adaptive noise cancellation, microphone array
based fixed or adaptive beamforming, blind signal separation

March 2004 Vishu Viswanathan 28

Playback Enhancement
Problem
Enhanced playback of speech to the listener
Methods
Spectrally shape the speech signal prior to playback, for improved
intelligibility when the listener is in a noisy environment (PA system in
aircraft, airports, sports arenas)
Active noise cancellation to cancel noise acoustically in listeners ears (ANC
headsets)
Narrowband to wideband speech extension to provide wideband speech
perception

March 2004 Vishu Viswanathan 29

Acoustic Echo Cancellation
loudspeaker
r ( n) s ( n)

Downlink Signal Far End Signal

A
channel
E H ( z ) H(z)
Error Signal C

e( n ) y (n) microphone
x ( n) - v(n) = u (n) + y (n) + n0 (n)

Uplink Signal + Near End Signal

Goal: Cancel feedback from loudspeaker into microphone using

adaptive linear filter

March 2004 Vishu Viswanathan 30

Lecture Outline

Goals of the Lecture

Speech Coding
Speech Synthesis
Speech Recognition & Understanding
Speaker Recognition
Speech Enhancement
Speech Modification

March 2004 Vishu Viswanathan 31

Speech Modification
Voice Conversion
Convert one voice to sound like another
A female voice converted to sound like a low-pitched male voice (security)

Time-Scale or Rate Modification

Speed up or slow down speech, while preserving naturalness
Applications: talking books, pre-recorded lectures, language learning

March 2004 Vishu Viswanathan 32

Personal Voice Assistant in Python
86% (22)
Personal Voice Assistant in Python
30 pages
Speech Signals Processing
No ratings yet
Speech Signals Processing
7 pages
14ec3029 Speech and Audio Signal Processing
No ratings yet
14ec3029 Speech and Audio Signal Processing
30 pages
How To Send Million Mails Free Every Day
100% (2)
How To Send Million Mails Free Every Day
22 pages
Advanced Topics in Speech Processing (IT60116) : K Sreenivasa Rao School of Information Technology IIT Kharagpur
No ratings yet
Advanced Topics in Speech Processing (IT60116) : K Sreenivasa Rao School of Information Technology IIT Kharagpur
17 pages
Digital Speech Processing
No ratings yet
Digital Speech Processing
46 pages
Speech Recognition: BY Charu Joshi
No ratings yet
Speech Recognition: BY Charu Joshi
26 pages
SPEECH
100% (1)
SPEECH
17 pages
Speech Recognition: BY Charu Joshi
100% (2)
Speech Recognition: BY Charu Joshi
26 pages
Speech Recognition (Dr. M. Sabarimalai Manikandan
No ratings yet
Speech Recognition (Dr. M. Sabarimalai Manikandan
2 pages
Speech Signal Analysis and Coding: Dr. Arun Kumar
No ratings yet
Speech Signal Analysis and Coding: Dr. Arun Kumar
52 pages
Digital Speech Processing
No ratings yet
Digital Speech Processing
7 pages
Speech Recognition Full Report
No ratings yet
Speech Recognition Full Report
11 pages
Basic Course Material Winter 2015
100% (1)
Basic Course Material Winter 2015
19 pages
Reconocimiento de Voz - MATLAB
No ratings yet
Reconocimiento de Voz - MATLAB
5 pages
Digital Voice Analysis
0% (2)
Digital Voice Analysis
20 pages
Tan Pan Hassan VoiceRecognition
No ratings yet
Tan Pan Hassan VoiceRecognition
21 pages
Speech Technology
No ratings yet
Speech Technology
5 pages
Speech Recognition Project
No ratings yet
Speech Recognition Project
33 pages
Tan Pan Hassan VoiceRecognition
No ratings yet
Tan Pan Hassan VoiceRecognition
21 pages
Advances in Speech Transcription at IBM Under The DARPA EARS Program
No ratings yet
Advances in Speech Transcription at IBM Under The DARPA EARS Program
13 pages
Speech Recognition Technology in A Ubiquitous Computing Environment
No ratings yet
Speech Recognition Technology in A Ubiquitous Computing Environment
24 pages
Speech Processing
No ratings yet
Speech Processing
5 pages
Speech Recognition
0% (1)
Speech Recognition
27 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
Speech and Audio Coding
No ratings yet
Speech and Audio Coding
16 pages
David Crawford Epson
No ratings yet
David Crawford Epson
31 pages
Jarvis Digital Life Assistant IJERTV2IS1237 PDF
No ratings yet
Jarvis Digital Life Assistant IJERTV2IS1237 PDF
6 pages
Seminar Presentation: Topic: Speech Recognition
No ratings yet
Seminar Presentation: Topic: Speech Recognition
26 pages
Chapter 2
No ratings yet
Chapter 2
29 pages
Lecture 1
No ratings yet
Lecture 1
48 pages
Introduction To Digital Speech Processing
No ratings yet
Introduction To Digital Speech Processing
42 pages
Speech Coder
No ratings yet
Speech Coder
20 pages
HG3052 CourseOutline SpeechSynthesisRecognition AY2019-20 SEM1 Update Sep10
No ratings yet
HG3052 CourseOutline SpeechSynthesisRecognition AY2019-20 SEM1 Update Sep10
6 pages
Voice Recognition
No ratings yet
Voice Recognition
16 pages
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
No ratings yet
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
10 pages
Feature Extraction Using PCA
No ratings yet
Feature Extraction Using PCA
36 pages
Personal Voice Assistant in Python
100% (1)
Personal Voice Assistant in Python
30 pages
How Voice Works
No ratings yet
How Voice Works
3 pages
Voice Recognition Using Matlab: Presented By: Avienash Raibole Paresh Meshram Vinayak Kolpek
100% (1)
Voice Recognition Using Matlab: Presented By: Avienash Raibole Paresh Meshram Vinayak Kolpek
18 pages
Speech Recognition PPT F
100% (2)
Speech Recognition PPT F
16 pages
Speech Coding
100% (3)
Speech Coding
36 pages
Speech Technologies For Data Mining Voice Analytics and Voice Biometry Slides
No ratings yet
Speech Technologies For Data Mining Voice Analytics and Voice Biometry Slides
41 pages
Chapter 1: Introduction To Audio Signal Processing: KH Wong
No ratings yet
Chapter 1: Introduction To Audio Signal Processing: KH Wong
55 pages
Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem
No ratings yet
Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem
23 pages
Piyu Sem Report.5
No ratings yet
Piyu Sem Report.5
30 pages
Speech Recognition Using Ic HM2007
100% (4)
Speech Recognition Using Ic HM2007
31 pages
Speech Recognition Report
100% (1)
Speech Recognition Report
20 pages
Discrete Time Processing of Speech Signa
No ratings yet
Discrete Time Processing of Speech Signa
12 pages
Mr. Sibananda Panda Mca 4 Semister
No ratings yet
Mr. Sibananda Panda Mca 4 Semister
18 pages
Artificial Intelligence and Its Applicat
No ratings yet
Artificial Intelligence and Its Applicat
4 pages
Ann LA2 Project
No ratings yet
Ann LA2 Project
23 pages
Voice Communication With Computers (VanNostrand) (1993)
No ratings yet
Voice Communication With Computers (VanNostrand) (1993)
342 pages
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
No ratings yet
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
8 pages
ASR
No ratings yet
ASR
13 pages
4 Voice - PCM
No ratings yet
4 Voice - PCM
33 pages
4 Voice - PCM
No ratings yet
4 Voice - PCM
33 pages
Asterisk 1.4 : The Professional’s Guide
From Everand
Asterisk 1.4 : The Professional’s Guide
Colman Carpenter
No ratings yet
Digital Audio Formats
From Everand
Digital Audio Formats
Ambrose Delaney
No ratings yet
Digital Signal Processing for Audio Applications: Volume 1 - Formulae
From Everand
Digital Signal Processing for Audio Applications: Volume 1 - Formulae
Anton R Kamenov
No ratings yet
D-STAR, DMR & Fusion A Beginner’s Guide
From Everand
D-STAR, DMR & Fusion A Beginner’s Guide
Duarte Braga
No ratings yet
Applications of Regular Expressions
No ratings yet
Applications of Regular Expressions
2 pages
Natural Language Processing: A Beginner's Guide To Fundamentals of
No ratings yet
Natural Language Processing: A Beginner's Guide To Fundamentals of
14 pages
Unit-4: Network Layer
No ratings yet
Unit-4: Network Layer
73 pages
Converging Technologies For Smart Environments and Integrated Ecosystems IERC Book Open Access 2013 PDF
100% (1)
Converging Technologies For Smart Environments and Integrated Ecosystems IERC Book Open Access 2013 PDF
363 pages
The Self-To-Other Ratio Applied As A Phonation Detector For Voice Accumulation
No ratings yet
The Self-To-Other Ratio Applied As A Phonation Detector For Voice Accumulation
2 pages
Introduction To Information Technology: Lecture #8
No ratings yet
Introduction To Information Technology: Lecture #8
25 pages
Signal Processing
No ratings yet
Signal Processing
28 pages
WT Final Notes 2015
No ratings yet
WT Final Notes 2015
164 pages
DSIP
No ratings yet
DSIP
3 pages
Big Data: The Definitive Guide To The Revolution in Business Analytics
No ratings yet
Big Data: The Definitive Guide To The Revolution in Business Analytics
66 pages
Queueing Models
No ratings yet
Queueing Models
16 pages
Tcs Paper Soln
No ratings yet
Tcs Paper Soln
37 pages
Ooad
No ratings yet
Ooad
2 pages
Unit 14: Testing and Inspection
No ratings yet
Unit 14: Testing and Inspection
14 pages
DHCP Project Report (Department)
75% (4)
DHCP Project Report (Department)
16 pages
Mysql 8.0 en 91 120
No ratings yet
Mysql 8.0 en 91 120
30 pages
IMS 36 CommandRef
No ratings yet
IMS 36 CommandRef
644 pages
N5 Ac
No ratings yet
N5 Ac
3 pages
Michael Alterio Resume
No ratings yet
Michael Alterio Resume
1 page
Python
No ratings yet
Python
14 pages
83 98 Final Year Students Projects Allocation and Management System
No ratings yet
83 98 Final Year Students Projects Allocation and Management System
16 pages
Auto Firmware Update
No ratings yet
Auto Firmware Update
4 pages
Build Gbox Server On PC
No ratings yet
Build Gbox Server On PC
4 pages
Aarons Resume
No ratings yet
Aarons Resume
4 pages
Excel Shortcut and Function Keys Microsoft Office Excel 2003
No ratings yet
Excel Shortcut and Function Keys Microsoft Office Excel 2003
5 pages
NDG NISGTC Forensics Lab 04
No ratings yet
NDG NISGTC Forensics Lab 04
29 pages
Web Based Online Examination System
No ratings yet
Web Based Online Examination System
4 pages
Skybox InstallationAndAdministrationGuide V10 0 600
No ratings yet
Skybox InstallationAndAdministrationGuide V10 0 600
155 pages
VPN Tut
No ratings yet
VPN Tut
7 pages
Mobile FAQ
No ratings yet
Mobile FAQ
3 pages
Unit Iv: Hmi Systems
No ratings yet
Unit Iv: Hmi Systems
19 pages
5TE Integrators Guide
No ratings yet
5TE Integrators Guide
12 pages
Project On Parking Fare System
100% (1)
Project On Parking Fare System
20 pages
Loan Management System OOP in PHP With MySQLi
0% (1)
Loan Management System OOP in PHP With MySQLi
8 pages
AWC7813 Motion Controller User Manual RV1.3
No ratings yet
AWC7813 Motion Controller User Manual RV1.3
112 pages
PCSE Workbook
No ratings yet
PCSE Workbook
71 pages
9 Edexcel Computer Science
No ratings yet
9 Edexcel Computer Science
8 pages
Nathan Vranicar Resume PDF
No ratings yet
Nathan Vranicar Resume PDF
1 page
PC Rakitan, Printer Tinta 2022
No ratings yet
PC Rakitan, Printer Tinta 2022
1 page
Windows Vista Quick Start Guide
100% (8)
Windows Vista Quick Start Guide
28 pages
Whats New
No ratings yet
Whats New
32 pages
Image Processing and Computer Vision in iOS Oge Marques - Quickly download the ebook to start your content journey
No ratings yet
Image Processing and Computer Vision in iOS Oge Marques - Quickly download the ebook to start your content journey
69 pages
Hema Product Owner: Certifications
No ratings yet
Hema Product Owner: Certifications
3 pages

Applications PDF

Uploaded by

Applications PDF

Uploaded by

Practical Applications of

Speech Signal Processing

March 2004 Vishu Viswanathan 1

Goals of the Lecture

March 2004 Vishu Viswanathan 2

Goals of the Lecture

March 2004 Vishu Viswanathan 3

Introduce and discuss each of a number of speech

March 2004 Vishu Viswanathan 4

Goals of the Lecture

March 2004 Vishu Viswanathan 5

Goal: Minimize data rate of y(n) while maximizing speech

March 2004 Vishu Viswanathan 7

March 2004 Vishu Viswanathan 9

March 2004 Vishu Viswanathan 10

ITU standards are targeted for telephone network applications

March 2004 Vishu Viswanathan 12

Wideband: 50 Hz 7 kHz (versus narrowband telephone, 300-3200 Hz)

March 2004 Vishu Viswanathan 13

Goals of the Lecture

March 2004 Vishu Viswanathan 14

March 2004 Vishu Viswanathan 15

Text Text Letter-to- Speech

- Numerical expansion - Phonemes choice of units

March 2004 Vishu Viswanathan 16

Goals of the Lecture

March 2004 Vishu Viswanathan 17

March 2004 Vishu Viswanathan 18

speech signal word string

Front end Back end

March 2004 Vishu Viswanathan 20

March 2004 Vishu Viswanathan 22

Some Observations for Speaker-Independent Recognition

March 2004 Vishu Viswanathan 23

Goals of the Lecture

March 2004 Vishu Viswanathan 24

March 2004 Vishu Viswanathan 25

Goals of the Lecture

March 2004 Vishu Viswanathan 26

March 2004 Vishu Viswanathan 27

March 2004 Vishu Viswanathan 28

March 2004 Vishu Viswanathan 29

Downlink Signal Far End Signal

Uplink Signal + Near End Signal

Goal: Cancel feedback from loudspeaker into microphone using

March 2004 Vishu Viswanathan 30

Goals of the Lecture

March 2004 Vishu Viswanathan 31

Time-Scale or Rate Modification

March 2004 Vishu Viswanathan 32

You might also like