Synthesized Speech Using A Small Microcontroller
Synthesized Speech Using A Small Microcontroller
e-ISSN: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 6, Ver. III (Nov - Dec .2015), PP 77-94
www.iosrjournals.org
Abstract: Speech was the natural form of human communication. Automatic speech signal generation on
computers is commonly called speech synthesis. Speech synthesizer can be created by concatenating pieces of
recorded audio speech that was stored in a database memory. Text-to-speech synthesis was an important object
in speech interface, which allows low-bandwidth text to supply a user with easy to gain the information. Speech
synthesis system has wide range of applications in our daily life. Speech synthesis plays an important role in
signal processing. In this paper a Speech synthesized system is designed which is based on microcontroller, a
project that performs several commands as defined for the synthesis of the speech signal.
Index Terms: Microcontoller, Control System
I.
Introduction
Embedded systems play an important role in our day to day life. Due to limited works this systems can
be highly optimized by particular needs [27]. Some of the embedded systems applications were controlling the
devices with some amount of intelligence built-in [1].
Speech synthesis system has wide range of applications in our daily life. Speech synthesis plays an
important role in signal processing. The natural, intelligible synthetic voices in the expanding field of speech
technology had a lot of demand [4]. Speech was the natural form of human communication. Automatic speech
signal generation on computers is commonly called speech synthesis. Speech synthesis was the artificial
production of human speech [2]. Speech synthesizer can be created by concatenating pieces of recorded audio
speech that was stored in a database memory. Text-to-speech synthesis was an important object in speech
interface, which allows low-bandwidth text to supply a user with easy to gain the information. In speech synthesis
field, a number of research programmes have taken place during the last decade [13]. In 1984 an industry expert
Mr Drinkwater said that about the synthesis synthesis is going to be the next bam-burning technology [2]. Now
a days synthesis speech was useful for many applications. In 1999 an American business magnate said that In a
few years personal computers will talk back to us [2].In the past (Allen et al., 1987), the text to speech synthesis
was done from a complete knowledge of the acoustics of the language being targeted.
For getting text to speech synthesis different techniques were used [5]. Concatenate method was the
simplest method for achieving the synthesis speech. By using this method the speech was generated by linking
pre recorded speech segments to build syllables, phrases, or words [2]. the applicable criteria that follow.
The block diagram for the system was shown in the following figure 1.
INPUT
OUTPUT
www.iosrjournals.org
77 | Page
Problem Analysis
This Paper was highlighted the requirements for developing the system. These requirements were
categorized into two major sections those were hardware requirements and software requirements. Based on the
project deliverables the requirements of the system were given below and clearly explained. The block box
testings were also mentioned in the ending of this Paper.
2.1) Hardware requirements of the system
This project synthesized speech using a small microcontroller was a type embedded system so it
would be a combination of electronic hardware and software code. The following figure was show the system
would have electronic hardware capable of control the ISD board as shown in Fig. 2.1 (a).The following figure
was show the system would have electronic hardware capable of speech synthesizer which can controlled by the
microcontroller. The block diagram for synthesized speech using microcontroller was shown in the Figure
2.1(b)
www.iosrjournals.org
78 | Page
III.
Based on the hardware and software requirements as mentioned before Paper, the solutions were given
for both hardware and software. According to the deliverables mentioned in the previous Paper, the first task
was to control audio record/playback ISD board and second one was generate the synthesized speech within the
microcontroller. The hardware and software solution were given below.
3.1) PIC18F1320 Microcontroller
PIC18F1320 was an 8 bit microcontroller developed by Microchip Company and it was power efficient
because used Nano watt technology. This microcontroller was used in this project because it has external
oscillator and many other features [30]. PIC18F microcontroller support enhanced USART module and supports
RS485, RS232 and LIN1.2 which has auto baud rate and also auto wake up start bit. The microcontroller
PIC188F1320 has four crystal modules (LP, XT, HS and HSPLL) and high computational performance. It could
be works at high speed up to 16-40MHZ. So PIC18F1320 (Fig 3.1) was used rather than PIC16F series (it works
at low speed). For more information about the PIC the references 30 and 27 were studied. It had two ports A and
B and port B pins were used in this project. In this project the microcontroller was used to control the audio
record/playback device and generating the synthesized speech.
DOI: 10.9790/2834-10637794
www.iosrjournals.org
79 | Page
DOI: 10.9790/2834-10637794
www.iosrjournals.org
80 | Page
DOI: 10.9790/2834-10637794
www.iosrjournals.org
81 | Page
DOI: 10.9790/2834-10637794
www.iosrjournals.org
82 | Page
IV.
Problem Implementation
DOI: 10.9790/2834-10637794
www.iosrjournals.org
83 | Page
CONN-H2
C2
10u
D1
GND
R2
U1
1
2
6
7
3
4
15
16
RA0/AN0
RA1/AN1/LVDIN
RA2/AN2/VREFRA3/AN3/VREF+
RA4/T0CKI
RA5/MCLR/VPP
RA6/OSC2/CLKO
RA7/OSC1/CLKI
RB0/AN4/INT0
RB1/AN5/TX/CK/INT1
RB2/P1B/INT2
RB3/CCP1/P1A
RB4/AN6/RX/DT/KBI0
RB5/PGM/KBI1
RB6/PGC/T1OSO/T1CKI/P1C/KBI2
RB7/PGD/T1OSI/P1D/KBI3
8
9
17
18
10
11
12
13
330R
DIODE-LED
PIC18F1320
PACKAGE=DIL18
VDD=GND
VSS=+5V
CONN-H2
R1
10k
+5V
1
2
3
4
8
7
6
5
CONN-DIL8
J2
1
2
3
4
CONN-SIL4
J3
J1
4
3
2
1
GND
CONN-SIL4
DOI: 10.9790/2834-10637794
www.iosrjournals.org
84 | Page
DOI: 10.9790/2834-10637794
www.iosrjournals.org
85 | Page
DOI: 10.9790/2834-10637794
www.iosrjournals.org
86 | Page
www.iosrjournals.org
87 | Page
TTS A
CONN-SIL2
TTS C
5V SUPPLY TO BOARD
7
6
5
4
3
2
1
1
2
1
2
TTS B
1
2
3
4
5
6
7
+5V
CONN-H2
TTS D
1 CONN-SIL7
2
3
4
5
6
7
7
CONN-SIL7
6
5
4
3
2
1
CONN-SIL7
CONN-SIL7
C2
10u
D1
GND
R2
U1
1
2
6
7
3
4
15
16
D2
DIODE-LED
RB0/AN4/INT0
RB1/AN5/TX/CK/INT1
RB2/P1B/INT2
RB3/CCP1/P1A
RB4/AN6/RX/DT/KBI0
RB5/PGM/KBI1
RB6/PGC/T1OSO/T1CKI/P1C/KBI2
RB7/PGD/T1OSI/P1D/KBI3
330R
8
9
17
18
10
11
12
13
DIODE-LED
PIC18F1320
PACKAGE=DIL18
VDD=+5V
VSS=GND
R3
330R
RA0/AN0
RA1/AN1/LVDIN
RA2/AN2/VREFRA3/AN3/VREF+
RA4/T0CKI
RA5/MCLR/VPP
RA6/OSC2/CLKO
RA7/OSC1/CLKI
CONN-H2
R1
10k
+5V
8
7
6
5
CONN-DIL8
MM232 A MM232 C
4 B
MM232
1
2
3
4
1
2
3
4
CONN-SIL4
3
2
1
GND
CONN-SIL4
DOI: 10.9790/2834-10637794
Pin No.
Pin B5
Pin B4 (Rx)
Pin B1 (Tx)
Pin B2
Pin B3
www.iosrjournals.org
88 | Page
DOI: 10.9790/2834-10637794
www.iosrjournals.org
89 | Page
www.iosrjournals.org
90 | Page
V.
Results
In this Paper the system outcomes were explained clearly. Initially the control PCB was designed for
control the audio record/playback device (ISD board). The main task of the project synthesis speech systems
results were given and explained in the last section.
5.1) Results of the control system
Based on the command which was entered by the user on the hyper terminal the microcontroller
controls the ISD board. When the user entered command r on hyper terminal then the microcontroller send the
active low signal to the ISD board the system starts recording. The microcontroller sends the status of the ISD
board to the 7th pin (RX) of the MM232R and it was displayed on the hyper terminal. The red LED of the ISD
board blink until the system recording the voice. The following figures (5.1(a) and 5.1(b)) were show the
process of the system.
Figure 5.1(a): Computer received the data from the PIC when the system starts recording
DOI: 10.9790/2834-10637794
www.iosrjournals.org
91 | Page
Figure 5.1(c): Computer received the data from the PIC when the system start play
Outcome for the speech synthesis system
The final PCB which was the main PCB, designed for the controlling of the ISD board and synthesis speech
system. The following figure which was shows the two LED indicators, one was for power indication and
another one for checking the PIC programmed or not.
VI.
Discussion On Results
Well the results of the complete system were as expected, but in sometimes it was not picking up the
commands entered by the user through the hyper terminal. There was some problem with the playback of the
sound possible causes may be compatibility issues of TTS IC. The Embedded C program was logical right and
DOI: 10.9790/2834-10637794
www.iosrjournals.org
92 | Page
VII.
Conclusion
After going through this task it was evident that to develop complete system even if its looks simple at
first look lots of effort and dedication is required. Various fields required to be combined like good software
knowledge, electronics and analytical skills. Sometimes a simple appearing task may take a long time to work
accordingly, as happened while tried to achieve a proper communication between microcontroller and the TTS
IC. Every element is required to be looked after. Analyzing why something fails or did not worked as required
helps a lot and makes the vision clear brings success closer and enhances knowledge.
The work on the project was quite satisfactory, but still there were some shortcomings like sometimes
system was not working at all, so some changes are suggested in software and electronic design. Another major
addition that could be done in future design is the addition of on board debugging mechanism to attach the ICD
kit to the main board this could be easily done by adding a RJ connector. Various features like audio filter
circuit which would only allow the audible frequencies to pass to the speaker could be added along with some
amplifier and volume control.
In the embedded program additional commands could be added like for volume control, the duration
for which TTS is required to be in play or record mode. Some features like to monitor the errors on the system
could be added and if any error occurs then could be displayed on the hyper terminal, and even intimate the user
by producing speech from the TTS. There is no limit as such to the additions that could be done in the future to
this system as the developed project is just the basic one so many enhancements are possible.
References
[1].
[2].
[3].
[4].
[5].
[6].
[7].
[8].
[9].
[10].
[11].
[12].
[13].
[14].
[15].
[16].
[17].
[18].
[19].
[20].
[21].
[22].
[23].
[24].
[25].
Juang, B.H. Tsuhan Chen, The past, present, and future of speech processing Volume 15, Issue 3, ISSN: 1053-5888, Pages 24-48.
Henton, Caroline, Challenges and Rewards in Using Parametric or Concatenative Speech synthesis, ISSN: 1381-2416.
Kim, Jongkuk - Hahn, Hernsoo Yoon, Uei-Joong-Bae, Myungjin, On a pitch alteration for speech synthesis systems, year 2009,
volume 50, issue 4, pages 435-446, ISSN:0929-6212.
Rahier, M.C. , Defraeye, P.J. Guebels, P.-P, Patovan, B, A 3 m NMOS high performance LPC speech synthesizer chip, Volume
18, Issue 3, Pages 349-359, ISSN:0018-9200.
Henton, Caroline, Challenges and rewards in using parametric or concatenative speech synthesis, Year
2002, Volume 5, Issue 2, Pages 117-131, ISSN: 1381-2416.
Holfelder, Wieland, Interactive remote recording and playback of multicast videoconferences, Year 1998, Volume 21, Issue 15,
Pages 1285-1294, ISSN: 0140-3664.
Kim, Jong Kuk-Hahn, Hern Soo-Bae, Myung Jin, On a speech multiple system implementation foe synthesis, year 2009, volume 49,
issue 4, ISSN: 0929-6212.
Hilt, V. Mauve, M. Vogel, J. Effelsberg, W, Recording and playing back interactive media streams, Date Oct 2005, volume 7 issue
5,pages 960-971, ISSN: 1520-9210
A. K. Rath and P.K. Meher, Design of a merged DSP microcontroller for embedded systems using discrete orthogonal transform,
Year 2006, pages 388-394, ISSN: 1549-3636.
Schroeder, E.F. platte, H. J. Krahe, D., MSC :stereo audio coding with CD-quality and 256 kBit/sec, Date: Nov.1987, Volume
CE-33, Issue 4, Pages 512-519, ISSN: 0098-3063.
Geoff Jackson, Peter Holzmann, A Single-chip Text-to-Speech Synthesis Device Utilizing Analog Nonvolatile Multilevel Flash
Storage, Issue Date: NOV.2002, ISSN: 0018-9200.
Liu, Fu-Hua-Gu, Liang-Gao, Machael, Applications of Language Modelling in Speech-to-Speech Translation table of content, Year2004, ISSN: 1381-2416.
Lee, C, -H; Jung, S.-K. ; Kang, H. G., Applying a speaker-Dependent speech compression Technique to concatenative TTS
synthesizers, Year- 2007, ISSN: 1558-7916.
Henton, Caroline, Challenges and Rewards in using Parametric or concatenative speech synthesis, Year- 2002, Issn: 1381-2416.
Goldsmith, John, Dealing with Prosody in a tex-to-speech system table of content, Year- 1999, ISSN: 1381-2416.
Sotiris Karabetsos, Pirros Tsiakoulis, Embedded unit selection Text-to-speech synthesis, Year- 2009, ISSN: 0098-3063.
Kim, Jong kuk-Hahn, Hern Soo- Bae, Myung Jin, On a speech Multiple system Implementation for speech synthesis table of
content, Year- 2009, ISSN: 0929-6212.
Taylor & Francis, Technical notes on Audio Recording, Year- 2000, ISSN: 0835-1813.
El-Imam, Yousif A. Don, Zuraida Mohammed, Text-to-Speech conversion of standard Malay, Year- 1999, ISSN: 1381-2416.
Steve Heath, Embedded System Design, Second Edition, ISBN 0-7506-5546-1.
Holzmann, P. Jackson, G.B. Raina, A. Hung-chuan pai Ming-Bing Chang Awsare, S.V. Engh, L.D. Kao,O.C. Palmer, C.R. ChunMai Liu Kordesch, A.V. Ken su Hemming,M. An analog and digital record, playback and processing system using a coarse-fine
programming method, Date 18-20 April 2001, pages 192-195, ISBN: 0-7803-6412-0.
Embedded C by Michale J Pont, Pearson International, ISBN-0173290532, year of publication 2002
Rice, S.V. Bailey, S.M, An integrated environment for audio recording,editing, and retrieval, Date Dec 2006, ISBN: 0-7695-27469.
Kumar, K. Raja Ramaiah, P. Seetha, DSP and Microcontroller based speech processor for auditory prosthesis, Date 20-23 Dec
2006, pages 518-522, ISBN: 1-4244-0716-8.
DOI: 10.9790/2834-10637794
www.iosrjournals.org
93 | Page
[28].
Thiang, Implementation of speech recognition on MCS51 microcontroller for controlling wheelchair, Date : 25-28 Nov 2007, pages
1193-1198, ISBN: 978-1-4244-1355-3.
Sarathy, K.P. Ramakrishnan, A.G, A research bed for unit selection based text to speech synthesis, Date 15-19 Dec 2008, pages
229-232, ISBN: 978-1-4244-3471-8.
PIC Microcontroller and Embedded Systems by Mazidi, Muhammad Ali, Pearson International, ISBN- 0136009026, Year of
publication 2007.
DOI: 10.9790/2834-10637794
www.iosrjournals.org
94 | Page