Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

Alberti, Michele; Seuret, Mathias; Pondenkandath, Vinaychandran; Ingold, Rolf; Liwicki, Marcus

doi:10.1145/3151509.3151519

Computer Science > Computer Vision and Pattern Recognition

arXiv:1710.07363 (cs)

[Submitted on 19 Oct 2017]

Title:Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

Authors:Michele Alberti, Mathias Seuret, Vinaychandran Pondenkandath, Rolf Ingold, Marcus Liwicki

View PDF

Abstract:In this paper, we present a novel approach to perform deep neural networks layer-wise weight initialization using Linear Discriminant Analysis (LDA). Typically, the weights of a deep neural network are initialized with: random values, greedy layer-wise pre-training (usually as Deep Belief Network or as auto-encoder) or by re-using the layers from another network (transfer learning). Hence, many training epochs are needed before meaningful weights are learned, or a rather similar dataset is required for seeding a fine-tuning of transfer learning. In this paper, we describe how to turn an LDA into either a neural layer or a classification layer. We analyze the initialization technique on historical documents. First, we show that an LDA-based initialization is quick and leads to a very stable initialization. Furthermore, for the task of layout analysis at pixel level, we investigate the effectiveness of LDA-based initialization and show that it outperforms state-of-the-art random weight initialization methods.

Comments:	5 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1710.07363 [cs.CV]
	(or arXiv:1710.07363v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1710.07363
Journal reference:	ICDAR-HIP 2017
Related DOI:	https://doi.org/10.1145/3151509.3151519

Submission history

From: Michele Alberti [view email]
[v1] Thu, 19 Oct 2017 22:43:47 UTC (6,052 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

Submission history

Access Paper:

Current browse context:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators