Unsupervised Learning in Recurrent Neural Networks
M. Klapper-Rybicka, N. N. Schraudolph, and J. Schmidhuber. Unsupervised Learning in Recurrent Neural Networks. In Proc. Intl. Conf. Artificial Neural Networks (ICANN), pp. 674–681, Springer Verlag, Berlin, Vienna, Austria, 2001.
Download
| 186.8kB | 89.1kB | 74.8kB |
Abstract
While much work has been done on unsupervised learning in feedforward neural network architectures, its potential with (theoretically more powerful) reecurrent networks and time-varying inputs has rarely been explored. Here we train Long Short-Term Memory (LSTM) recurrent networks to maximize two information-theoretic objectives for unsupervised learning: Binary Information Gain Optimization and Nonparametric Entropy Optimization. LSTM learns to discriminate different types of temporal sequences and group them according to a variety of features.
BibTeX Entry
@inproceedings{KlaSchSch01,
author = {Magdalena Klapper-Rybicka and Nicol N. Schraudolph
and J\"urgen Schmid\-huber},
title = {\href{http://nic.schraudolph.org/pubs/KlaSchSch01.pdf}{
Unsupervised Learning in Recurrent Neural Networks}},
pages = {674--681},
editor = {Georg Dorffner and Horst Bischof and Kurt Hornik},
booktitle = icann,
address = {Vienna, Austria},
volume = 2130,
series = {\href{http://www.springer.de/comp/lncs/}{
Lecture Notes in Computer Science}},
publisher = {\href{http://www.springer.de/}{Springer Verlag}, Berlin},
year = 2001,
b2h_type = {Top Conferences},
b2h_topic = {>Entropy Optimization},
abstract = {
While much work has been done on unsupervised learning in feedforward
neural network architectures, its potential with (theoretically more
powerful) reecurrent networks and time-varying inputs has rarely been
explored. Here we train Long Short-Term Memory (LSTM) recurrent networks
to maximize two information-theoretic objectives for unsupervised
learning: \href{http://nic.schraudolph.org/bib2html/b2hd-nips92.html}{
Binary Information Gain Optimization} and
\href{http://nic.schraudolph.org/bib2html/b2hd-emma}{
Nonparametric Entropy Optimization}. LSTM learns to discriminate
different types of temporal sequences and group them according to a
variety of features.
}}