Occam's Gates

Raiman, Jonathan; Sidor, Szymon

Computer Science > Machine Learning

arXiv:1506.08251 (cs)

[Submitted on 27 Jun 2015]

Title:Occam's Gates

Authors:Jonathan Raiman, Szymon Sidor

View PDF

Abstract:We present a complimentary objective for training recurrent neural networks (RNN) with gating units that helps with regularization and interpretability of the trained model. Attention-based RNN models have shown success in many difficult sequence to sequence classification problems with long and short term dependencies, however these models are prone to overfitting. In this paper, we describe how to regularize these models through an L1 penalty on the activation of the gating units, and show that this technique reduces overfitting on a variety of tasks while also providing to us a human-interpretable visualization of the inputs used by the network. These tasks include sentiment analysis, paraphrase recognition, and question answering.

Comments:	In review at NIPS
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1506.08251 [cs.LG]
	(or arXiv:1506.08251v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1506.08251

Submission history

From: Szymon Jozef Sidor [view email]
[v1] Sat, 27 Jun 2015 03:03:10 UTC (385 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2015-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jonathan Raiman
Szymon Sidor

export BibTeX citation

Computer Science > Machine Learning

Title:Occam's Gates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Occam's Gates

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators