Correlation Speech Recognition PDF
Correlation Speech Recognition PDF
ABSTRACT
Purpose: to check the correlation between Speech Reception Threshold and Index of Speech
Recognition with mean audiometric results. Methods: we selected 241elderly patients who underwent
examinations of the pure tone audiometry and speech audiometry. As inclusion, audiometry should
have a sensorineural hearing loss. The tone thresholds for air obtained were classified according
with the following averages: Average 1 – Average of frequencies of 500, 1000 and 2000 Hz; Average
2 – Average of frequencies of 500, 1000, 2000 and 4000 Hz; Average 3 – average of frequencies of
500, 1000, 2000 and 3000 Hz; and 4 average – average of frequencies of 500, 1000, 2000, 3000
and 4000 Hz. The data were compared with Speech Reception Threshold and Index of Speech
Recognition, and treated statistically. Results: Average 1 showed higher correlation with the Speech
Reception Threshold (rho = 0.934, CI = 0.901 to 0.958; eqm = 52.2). In relation to the Index of Speech
Recognition, it was observed that the average 3 showed the highest degree of correlation with the test
(rho = – 0.768, CI = –0.807 to –0.721; eqm = 245) followed averages 2 and 4. Conclusion: for elderly
people with ski slop sensorineural hearing loss, the Speech Reception Threshold has the strongest
correlation with the average frequencies 500 Hz, 1000 Hz and 2000 Hz, while the Index of Speech
Recognition has the highest correlation with the average which include the frequencies 3000 Hz and
4000 Hz.
contribute 60 % of it, while vowels contribute only correlation with the speech recognition threshold
40%9. Due to the spectral characteristics of these (SRT) and with the speech discrimination score
sounds and the range of human audibility, one can (SDS).
understand why individuals with hearing loss at high
frequencies have impaired speech recognition. METHODS
In audiology, speech recognition ability is
measured by the tests Speech Reception Threshold The present study was approved by the Research
(SRT) and Speech Discrimination Score (SDS). Ethics Committee of the Federal University of
The SRT corresponds to the softest sound intensity Minas Gerais (UFMG) under protocol no. 155 09.
level at which an individual can recognize 50% An informed consent form was prepared explaining
of the common words given. Typically, SRT and the theme of the study, its aims and the importance
SDS values are consistent with the average of the of analyzing the audiometric tests of the individuals
hearing thresholds obtained for the speech-related selected for the study.
frequencies. The SDS evaluates speech discrimi- This comparative study based on the results of
nation using a list of monosyllables and bisyllables audiometric tests was conducted with an elderly
40 dB above the SRT thresholds10. population who received care at the Instituto Jenny
The classification of hearing losses is a widely de Andrade Faria, an annex to the UFMG Hospital
discussed topic in speech-language pathology das Clínicas.
and audiology. In Brazil, hearing losses have The study employed a convenience sample of
been predominantly classified on the basis of the all the elderly individuals with a complaint of hearing
frequencies of speech relative to the tritone average loss who underwent audiometric evaluation with
of the air-conduction thresholds for 500 Hz, 1000 measurements of pure-tone hearing thresholds for
Hz and 2000 Hz11,12. However, a number of authors air and bone conduction and speech audiometry
have advocated the use of hearing loss classifica- consisting of the SRT and SDS tests between April
tions by frequency ranges in order to include the 2011–April 2012. Some exclusion criteria were
high frequencies in composing these averages, established: individuals with mixed or conductive
especially for elderly individuals13,14. hearing loss in at least one ear, or incomplete/
The Bureau Internacional d’ Audiophonologie inconclusive tests, e.g, when pure-tone thresholds
(BIAP, 2005) recommended that the classification of over the frequencies of 500 Hz to 4000 Hz were not
hearing losses should take into account the averages obtained or speech audiometry was not performed.
of the pure-tone thresholds for air conduction at In total, 241 tests were analyzed: 153 (63.5%)
500 Hz, 1000 Hz, 2000 Hz and 4000 Hz in order to of female patients and 88 (36.5%) of male patients.
encompass the high frequencies, as these are the Ages ranged from 60 to 97 years (mean, 77.9 years).
most compromised in the pathologies of hearing15. The results of 482 ears were evaluated.
Because the frequency ranges of consonants The present study examined the pure-tone
above 2000 Hz are the chief constituents of speech hearing thresholds for air conduction obtained
intelligibility, Russo proposed, in 2009, that the through the analysis of pure-tone audiometry based
degree of hearing loss should be classified based on the following averages:
on the average audibility threshold obtained over • Average 1: tritone average of frequencies 500
the frequencies of 500 Hz to 4000 Hz13. Hz, 1000 Hz and 2000 Hz according to Davis
The classifications of hearing losses relying on and Silverman, 197011 and Lloyd and Kaplan,
the tritone average as suggested by Lloyd & Kaplan 197812;
(1978)12 and Davis & Siverman (1970)11 were found • Average 2: pure-tone average of frequencies 500
to be adequate to classify the hearing losses with a Hz, 1000 Hz, 2000 Hz and 4000 Hz according
flat audiometric configuration. However, in ski slope to Recommendation 02/1 of the Bureau
hearing losses, such as presbycusis, this classi- Internacional d’Audiophonologie (BIAP)15;
fication is not always consistent with the patients’ • Average 3: pure-tone average of frequencies
complaints, as these refer to difficulties in speech 500 Hz, 1000 Hz, 2000 Hz and 3000 Hz, and
recognition, mostly. • Average 4: pure-tone average of frequencies
Given that the major auditory function concerns 500 Hz, 1000 Hz, 2000 Hz, 3000 Hz and 4000
verbal communication and speech recognition Hz.
abilities, it is paramount that the speech tests be The data were treated statistically using the R
considered in the classification of hearing losses, software. Distribution of frequencies was performed
thus expressing the actual hearing impairment of for the categorical variable (sex). For the continuous
the patients. The aim of the present study was to variables (age, frequencies, SRT and SDS),
assess which pure-tone averages have a stronger measures of central tendency (mean and median)
Frequency (Hz)
Table 1 shows the descriptive analysis of the 4000 Hz, Average 1, Average 2, Average 3 and
threshold averages by frequency, the four calcu- Average 4 with the SRT and SDS. In addition,
lated pure-tone averages, the SRT in dBHL, and the confidence intervals and mean squared errors were
SDS in percent values. calculated, which served to identify the analyses
Correlations were established for the individual with the highest predictive value for the SRT and
frequencies 500 Hz, 1000 Hz, 2000 Hz, 3000 Hz, SDS. The results can be seen in Table 2.
Table 1 – Descriptive analysis of the pure-tone frequencies, calculated means, mean SRT and SDS in
the study sample
Table 2 – Correlations of the individual frequencies 500 Hz, 1000 Hz, 2000 Hz, 3000 Hz and 4000 Hz,
Average 1, Average 2, Average 3 and Average 4 with the SRT and the SDS
SRT SDS
Variable CI MSE CI MSE
Rho Rho
CI.inf CI.up (dBHL2) CI.inf CI.up (dBHL2)
500 Hz 0.807* 0.765 0.844 108.4 -0.569 -0.634 -0.501 322.9
1000 Hz 0.889* 0.857 0.915 71.5 -0.675** -0.720 -0.620 284.2
2000 Hz 0.841* 0.800 0.875 88.8 -0.735** -0.778 -0.688 282.1
3000 Hz 0.700 0.633 0.752 138.2 -0.696** -0.745 -0.637 323.4
4000 Hz 0.636 0.574 0.691 162.4 -0.653** -0.706 -0.589 348.1
Average 1 0.934* 0.901 0.958 52.2+ -0.734** -0.776 -0.683 251.6
Average 2 0.918* 0.887 0.941 59.0 -0.768** -0.811 -0.720 245.3
Average 3 0.922* 0.886 0.944 56.4 -0.768** -0.807 -0.721 245.0+
Average 4 0.895* 0.861 0.921 68.7 -0.773** -0.813 -0.725 251.2
Average 1- mean of frequencies 500 Hz, 1000 Hz and 2000 Hz
Average 2- mean of frequencies 500 Hz, 1000 Hz, 2000 Hz and 4000 Hz
Average 3- mean of frequencies 500 Hz, 1000 Hz, 2000 Hz and 3000 Hz
Average 4- mean of frequencies 500 Hz, 1000 Hz, 2000 Hz, 3000 Hz and 4000 Hz
Rho- Spearman’s coefficient of correlation
MSE- Mean Squared Error
SRT- Speech Reception Threshold
SDS- Speech Discrimination Score
CI- Confidence Interval
CI inf- lower limit of Confidence Interval
CI up- upper limit of Confidence Interval
*very good correlation
**good correlation
+
best variable as a predictor of correlation
Spearman’s coefficient of correlation test
Mean Squared Error test
Figure 2 – Correlation matrix with hypothesis testing p-values comparing the significance of the
correlations for the SRT and SDS
prediction error of Average 1 relative to the SRT. In are more frequent in the patient’s vocabulary, which
other words, Average 1 is the best predictor of SRT increases redundancy and the chances of correct
values. Additionally, Fig. 2 shows that the correlation guesses. This makes guessing with the SRT test
of SRT and Average 1 differs statistically from the easier than with the SDS.
correlations between SRT and the other averages.
This proves that, even with such narrow confidence A recent study compared the audiologic perfor-
intervals (Table 2), the correlation of Average 1 with mance of elderly individuals based on the classifi-
the SRT can be regarded as the strongest. Thus, cations of Davis and Silverman, 197011 and on the
this result indicates that the averages of 500 Hz, Recommendation 02/1 of the Bureau Internacional
1000 Hz and 2000 Hz used to compose Average 1 d’Audiophonologie (BIAP)15. The authors reviewed
are key frequencies to estimating the SRT. This fact the records of 140 elderly individuals focusing on
can be confirmed by the analysis of the individual the clinical history and the pure-tone audiometry
frequencies in Table 2. results. Predominance of mild to moderate sensori-
The correlation values for the SDS are negative, neural hearing loss was found, with slight differences
since they are inversely proportional: as averages regarding its prevalence depending on the hearing
increase, SDS percentages decline. All the averages loss classification adopted. Based on Davis and
had good correlation with the SDS, with statisti- Silverman, 99 cases of ears with normal thresholds
cally superior correlation values for Averages 2, 3
were found, while only 66 cases were identified
and 4 (Table 2 and Fig. 2). As Fig. 2 illustrates, the
according to the BIAP recommendation. The study
correlation between Average 1 and the SDS, albeit
good, is statistically inferior. On the other hand, no concluded that the classifications achieved similar
statistically significant difference was found when results; however, the BIAP recommendation was
comparing the correlations between Averages 2 found to be more sensitive in detecting hearing
and 3; 2 and 4, and 3 and 4. This finding indicates losses in the elderly27. A study conducted in Finland
that the correlations of the SDS and Averages with 5400 volunteers aged 55–75 years demon-
2, 3 and 4, in addition to being superior, are also strated a remarkable difference between the hearing
quite similar; therefore, it is impossible to infer loss classification by the World Health Organization
which of these three averages is the best predictor. (WHO) criteria compared with the European Union
Nevertheless, these results allow us to conclude criteria. In that study, the percentage of individuals
that the frequencies 3000 Hz and 4000 Hz are
with normal hearing was greater when the WHO
important for speech recognition, since the corre-
lation of Average 1 was lower with the SDS, and classification was used28.
statistically different as compared with the other The results of the analyses enabled us to
three averages. To corroborate these findings, the conclude that speech discrimination as measured
MSE (Table 2) was lower for Average 3, followed by the SDS is influenced by the frequencies 3000
by Average 2—albeit with quite similar values. This Hz and 4000 Hz. For that reason, the importance
shows that both averages, 2 and 3, have a greater of including these frequencies in the pure-tone
predictive value regarding the SDS, i.e., the more average used for classifying the hearing loss
severe the impairment at 3000 Hz and 4000 Hz, the
should be emphasized, since one of the objectives
worse the performance in the SDS test. This finding
of this classification is to reflect a person’s hearing
is explained in the literature7,9 which indicates that
60% of speech intelligibility relies on frequencies abilities and difficulties especially with regard to
above 1000 Hz. communication.
The contribution of high frequencies to speech
recognition was more evident in the SDS than in the CONCLUSION
SRT. We believe this occurred because in the SDS
test, whose purpose is to assess the percentage of
For the elderly population with ski slope senso-
speech recognition, one-syllable words are used,
as they provide fewer clues in speech and are less rineural hearing loss, the SRT is more strongly
redundant, which reduces the likelihood of correct correlated with the average of frequencies 500
guesses. In the SRT evaluation, the purpose is Hz, 1000 Hz and 2000 Hz, while the SDS shows
to assess the threshold of speech recognition; greater correlation with the average that includes
therefore, three-syllable words are used, as they frequencies 3000 Hz and 4000 Hz.
RESUMO
Objetivo: verificar quais médias tonais possuem maior correlação com o Limiar de Recepção da Fala
e com o Índice de Reconhecimento da Fala. Métodos: foram selecionados 241 exames de pacientes
idosos com perda auditiva neurossensorial que realizaram audiometria tonal liminar e logoaudiome-
tria. As avaliações audiométricas foram classificados com base nos limiares tonais de via aérea das
seguinte formas: Média 1- Média das frequências de 500, 1000 e 2000 Hz; Média2-Média das frequ-
ências de 500, 1000, 2000 e 4000 Hz; Média 3 – Média das frequências de 500, 1000, 2000 e 3000 Hz
e Média 4 –Média das frequências de 500, 1000, 2000, 3000 e 4000 Hz. Os dados foram comparados
com os testesLimiar de Recepção da Fala e Índice de Reconhecimento da Fala e tratados estatistica-
mente. Resultados: a Média 1 apresentou maior valor de correlação com o Limiar de Recepção da
Fala (rho=0,934; IC=0,901 a 0,958; eqm=52,2). Em relação ao Índice de Reconhecimento da Fala, foi
observado que amédia 3 apresentou omaior grau de correlação com o teste(rho= – 0,768; IC= –0,807
a –0,721; eqm = 245) seguido das médias 2 e 4. Conclusão: para a população idosa com perda audi-
tiva neurossensorial descendente, o Limiar de Recepção de Fala possui correlação mais forte com a
média das frequências 500 Hz, 1000 Hz e 2000 Hz, enquanto o Índice de Reconhecimento de Fala
possui maior correlação com as média que incluem as frequências de 3000 Hz e 4000 Hz.
frequentam a Universidade Aberta à Terceira Idade. 24. Studebaker GA, Sherbecoe RL, McDaniel DM,
Salusvita. 2004;23(2):279-90. Gwaltney CA. Monosyllabic word recognition at
19. Mattos LC, Veras RP. A prevalência da perda higher-than-normal speech and noise levels. J.
auditiva em uma população de idosos da cidade Acoust. Soc. Am. 1999;105(4):2431-44.
do Rio de Janeiro: um estudo seccional. Braz J
Otorhinolaryngol. 2007;73(5):654-9. 25. Martini A, Mazzoli M, Rosignoli M, Trevisi P,
20. Russo ICP. Achados audiométricos em uma Maggi S, Enzi G, et al. Hearing in the elderly: a
população de idosos presbiacúsicos brasileiros em population study. Audiology. 2001;40(6):285-93.
função do sexo e da faixa etária. Pró-Fono R Atual 26. Kano CE, Mezzena LH, Guida HL. Estudo
Cient. 1993;5(1):8-10. comparativo da classificação do grau de perda
21. Pedalini M, Liberman P, Piranas S, Jacob W,
auditiva em idosos institucionalizados. Rev CEFAC.
Câmara J, Miniti A Análise do perfil audiológico de
2009;11(3):473-7.
idosos através de testes de função auditiva
periférica e central. Rev Soc Bras Fonoaudiol. 27. Tenório JP, Guimarães JATL, Flores NGC, Iório
1997;5(63):489-95. MCM. Comparação entre critérios de classificação
22. Felder E, Schrott, EA. Quantitative evaluation dos achados audiométricos em idosos. J. Soc. Bras.
of myelinated nerve fibres and hair cells in cochlea Fonoaudiol. 2011;23(2):114-8.
of humans with age-related high-tone hearing loss.
28. Uimonen S, Maki-Torkko E, Jounio-Ervasti
Hear. Res. 1995;91:19-32.
23. Fish I. The selective and differential vulnerability K, Sorri M. Hearing in 55 to 75 year old people in
of the auditory system. In: Wolstenholmeand GEW, northern Finland-a comparison of two classifications
Knight J (eds). Sensorioneural Hearing Loss. New of hearing impairment. Acta Otolaryngol.
York, Churchill-Livingstone, 1970. 1997;33:53-9.
Mailing address:
Letícia Pimenta Costa Guarisco
Rua Ouro Preto, 1275/ 04, Santo Agostinho
Belo Horizonte – MG – Brasil
CEP: 30170-041
E-mail: [email protected]