Learning discriminative features in sequence training without requiring framewise labelled data

Wang, Jun; Su, Dan; Chen, Jie; Feng, Shulin; Ma, Dongpeng; Li, Na; Yu, Dong

Computer Science > Machine Learning

arXiv:1905.06907 (cs)

[Submitted on 16 May 2019]

Title:Learning discriminative features in sequence training without requiring framewise labelled data

Authors:Jun Wang, Dan Su, Jie Chen, Shulin Feng, Dongpeng Ma, Na Li, Dong Yu

View PDF

Abstract:In this work, we try to answer two questions: Can deeply learned features with discriminative power benefit an ASR system's robustness to acoustic variability? And how to learn them without requiring framewise labelled sequence training data? As existing methods usually require knowing where the labels occur in the input sequence, they have so far been limited to many real-world sequence learning tasks. We propose a novel method which simultaneously models both the sequence discriminative training and the feature discriminative learning within a single network architecture, so that it can learn discriminative deep features in sequence training that obviates the need for presegmented training data. Our experiment in a realistic industrial ASR task shows that, without requiring any specific fine-tuning or additional complexity, our proposed models have consistently outperformed state-of-the-art models and significantly reduced Word Error Rate (WER) under all test conditions, and especially with highest improvements under unseen noise conditions, by relative 12.94%, 8.66% and 5.80%, showing our proposed models can generalize better to acoustic variability.

Comments:	Accepted in ICASSP 2019 lecture session
Subjects:	Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1905.06907 [cs.LG]
	(or arXiv:1905.06907v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.06907

Submission history

From: Jun Wang [view email]
[v1] Thu, 16 May 2019 16:58:25 UTC (259 KB)

Computer Science > Machine Learning

Title:Learning discriminative features in sequence training without requiring framewise labelled data

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning discriminative features in sequence training without requiring framewise labelled data

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators