Transfer Learning for Ultrasound Tongue Contour Extraction with Different Domains

Mozaffari, M. Hamed; Lee, Won-Sook

doi:10.1121/1.5137211

Computer Science > Machine Learning

arXiv:1906.04301 (cs)

[Submitted on 10 Jun 2019]

Title:Transfer Learning for Ultrasound Tongue Contour Extraction with Different Domains

Authors:M. Hamed Mozaffari, Won-Sook Lee

View PDF

Abstract:Medical ultrasound technology is widely used in routine clinical applications such as disease diagnosis and treatment as well as other applications like real-time monitoring of human tongue shapes and motions as visual feedback in second language training. Due to the low-contrast characteristic and noisy nature of ultrasound images, it might require expertise for non-expert users to recognize tongue gestures. Manual tongue segmentation is a cumbersome, subjective, and error-prone task. Furthermore, it is not a feasible solution for real-time applications. In the last few years, deep learning methods have been used for delineating and tracking tongue dorsum. Deep convolutional neural networks (DCNNs), which have shown to be successful in medical image analysis tasks, are typically weak for the same task on different domains. In many cases, DCNNs trained on data acquired with one ultrasound device, do not perform well on data of varying ultrasound device or acquisition protocol. Domain adaptation is an alternative solution for this difficulty by transferring the weights from the model trained on a large annotated legacy dataset to a new model for adapting on another different dataset using fine-tuning. In this study, after conducting extensive experiments, we addressed the problem of domain adaptation on small ultrasound datasets for tongue contour extraction. We trained a U-net network comprises of an encoder-decoder path from scratch, and then with several surrogate scenarios, some parts of the trained network were fine-tuned on another dataset as the domain-adapted networks. We repeat scenarios from target to source domains to find a balance point for knowledge transfer from source to target and vice versa. The performance of new fine-tuned networks was evaluated on the same task with images from different domains.

Comments:	3 figures, 9 pages, 1 table, 16 references
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
Cite as:	arXiv:1906.04301 [cs.LG]
	(or arXiv:1906.04301v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.04301
Journal reference:	The Journal of the Acoustical Society of America 146, 2940 (2019)
Related DOI:	https://doi.org/10.1121/1.5137211

Submission history

From: Mohammad Hamed Mozaffari [view email]
[v1] Mon, 10 Jun 2019 22:17:08 UTC (567 KB)

Computer Science > Machine Learning

Title:Transfer Learning for Ultrasound Tongue Contour Extraction with Different Domains

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Transfer Learning for Ultrasound Tongue Contour Extraction with Different Domains

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators