0% found this document useful (0 votes)

56 views

Modelling Time Series With Neural Networks: Volker Tresp Summer 2017

This document provides an overview of using neural networks to model time series data. It discusses using recurrent neural networks (RNNs) with techniques like long short-term memory (LSTM) to capture temporal dependencies in time series. Specific applications mentioned include predicting future stock prices based on historical data, speech recognition, and machine translation.

Uploaded by

lagrange29 lagrange

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views

Modelling Time Series With Neural Networks: Volker Tresp Summer 2017

Uploaded by

lagrange29 lagrange

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Modelling Time Series with Neural

Networks
Volker Tresp
Summer 2017

1
Modelling of Time Series

• The next figure shows a time series (DAX)

• Other interesting time-series: energy prize, energy consumption, gas consumption,

copper prize, ...

2
Neural Networks for Time-Series Modelling

• Let zt, t = 1, 2, . . . be the time-discrete time-series of interest (example: DAX)

• Let xt, t = 1, 2, . . . denote a second time-series, that contains information on zt

(Example: Dow Jones)

• For simplicity, we assume that both zt and xt are scalar. The goal is the prediction of
the next value of the time-series

• We assume a system of the form

zt = f (zt−1, . . . , zt−T , xt−1, . . . , xt−T ) + t

with i.i.d. random numbers t, t = 1, 2, . . . which model unknown disturbances.

3
Neural Networks for Time-Series Modelling (cont’d)

• We approximate, using a neural network,

f (zt−1, . . . , zt−T , xt−1, . . . , xt−T )

≈ fw,V (zt−1, . . . , zt−T , xt−1, . . . , xt−T )

and obtain the cost function
N
X
cost(w, V ) = (zt − fw,V (zt−1, . . . , zt−T , xt−1, . . . , xt−T ))2
t=1

• The neural network can be trained as before with simple back propagation if in training
all zt and all xt are known!

• This is a NARX model: Nonlinear Auto Regressive Model with external inputs. Ano-
ther name: TDNN (time-delay neural network).

• Note the ”convolutional“ idea in TDNNs

4
• Language model: last T words as input. The task is to predict the next word in a
sentence. One-hot encoding.
Mutiple-Step Prediction

• Predicting more than one time step in the future is not trivial

• The future inputs are not available

• The model noise needs to be properly considered in multiple step prediction (for
example by a stochastic simulation); if possible one could also simulate future inputs
(multivariate prediction)

• Teacher forcing: Use the predicted output as input (can be risky)

5
Recurrent Neural Network

• Recurrent Neural Networks are powerful methods for time series and sequence model-
ling

6
Generic Recurrent Neural Network Architecture

• Consider a feedforward neural network where there are connections between the hidden
units
zt,h = sig(zT T
t−1 ah + xt vh )
and, as before,
ŷt = sig(zT
t w)

• Here, zt = (zt,1, zt,2, . . . , zt,H )T , xt = (xt,0, xt,1, . . . , xt,M −1)T

• In Recurrent Neural Networks (RNNs) the next state of the neurons in the hidden
layer depends on their last state and both are not directly measured

• ah, w, vh are weight vectors

• Note that in must applications, one is interested in the output yt (and not in zt,h)

• The next figure shows an example. Only some of the recurrent connections are shown
(blue). The blue connections also model a time lag. Without recurrent connections
(ah = 0, ∀h), we obtain a regular feedforward network
7
• Note, that a recurrent neural network has an internal memory
A Recurrent Neural Network Architecture unfolded in Time

• The same RNN but with a different intuition

• Consider that at each time-step a feedforward Neural Network predicts outputs based
on some inputs

• In addition, the hidden layer also receives input from the hidden layer of the previous
time step

• Without the nonlinearities in the transfer functions, this is a linear state-space model;
thus a RNN is a nonlinear state-space model

8
Training of Recurrent Neural Network Architecture

• Backpropagation through time (BPTT): essentially backpropagation applied to the

unfolded network; note that all that happened before time t influences ŷt, so the error
needs to be backpropagated backwards in time, in principle until the beginning of
the experiment! In reality, one typically truncates the gradient calculation (review in:
Werbos (1990))

• Real-Time Recurrent Learning (RTRL) (Williams and Zipser (1989))

• Time-Dependent Recurrent Back-Propagation: learning with continuous time (Lag-

rangian approach) (Pearlmutter 1998)

9
Echo-State Network

• Recurrent Neural Networks are notoriously difficult to train

• A simple alternative is to initialize A and V randomly (according to some recipe) and

only train w, e.g., with the ADALINE learning rule

• This works surprisingly well and is done in the Echo-State Network (ESN)

10
Iterative Prediction

• Assume a trained model where the prediction is: ŷt → (xt, yt) → ŷt+1, ...

• Thus we predict (e.g., the DAX of the next day) and the obtain a measurement of
the next day

• A RNN would ignore the new measurement. What can be done

• 1: In probabilistic models, the measurement can change the hidden state estimates
accordingly (HMM, Kalman filter, particle filter, ....)

• 2: We can use yt as an input to the RNN (as in TDNN)

• 3: We add a (linear) noise model

11
Bidirectional RNNs

• The predictions in bidirectional RNNs depend on past and future inputs

• Useful for sequence labelling problems: handwriting recognition, speech recognition,

bioinformatics, ...

12
Long Short Term Memory (LSTM)

• As a recurrent structure the Long Short Term Memory (LSTM) approach has been
very successful

• Basic idea: at time T a newspaper announces that the Siemens stock is labelled as
“buy”. This information will influence the development of the stock in the next days.
A standard RNN will not remember this information for very long. One solution is to
define an extra input to represent that fact and that is on as along as “buy” is valid. But
this is handcrafted and does not exploit the flexibility of the RNN. A flexible construct
which can hold the information is a long short term memory (LSTM) block.

• The LSTM was used very successful for reading handwritten text and is the basis for
many applications involving sequential data (NLP, translation of text, ...)

13
LSTM in Detail

• The LSTM block replaces one hidden unit zh, together with its input weights ah and
vh. In general all H hidden units are replaced by H LSTM blocks. It produces one
output zh (in the figure it is called y)
• All inputs in the figure are weighted inputs
• Thus in the figure z would be the regular RNN-neuron output with a tanh transfer
function
• Three gates are used that control the information flow
• The input gate (one parameter) determines if z should be attenuated
• The forget gate (one parameter) determines if the last z should be added and with
which weight
• Then another tanh, modulated by the output gate
• See http://www.wildml.com/2015/10/recurrent-neural-network-tutorial-part-4-implementing-
a-grulstm-rnn-with-python-and-theano/

14
LSTM Applications

• Wiki: LSTM achieved the best known results in unsegmented connected handwriting
recognition, and in 2009 won the ICDAR handwriting competition. LSTM networks
have also been used for automatic speech recognition, and were a major component
of a network that in 2013 achieved a record 17.7% phoneme error rate on the classic
TIMIT natural speech dataset

• Applications: Robot control, Time series prediction, Speech recognition, Rhythm lear-
ning, Music composition, Grammar learning, Handwriting recognition, Human action
recognition, Protein Homology Detection

15
Gated Recurrent Units (GRUs)

• Some people found LSTMs too complicated and invented GRUs with fewer gates

16
Encoder Decoder Architecture

• For example, used in machine translation

Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
No ratings yet
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
21 pages
BBMA OA MyEnglishVersion1.1
100% (1)
BBMA OA MyEnglishVersion1.1
79 pages
Giordano Bruno
No ratings yet
Giordano Bruno
29 pages
Stock Prediction Using Recurrent Neural Network (RNN)
0% (1)
Stock Prediction Using Recurrent Neural Network (RNN)
24 pages
ch10 Sequence Modelling - Recurrent and Recursive Nets
No ratings yet
ch10 Sequence Modelling - Recurrent and Recursive Nets
45 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
DL_MOD4 (3)
No ratings yet
DL_MOD4 (3)
105 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
CH4_AA1.1-Sequence Models (1)
No ratings yet
CH4_AA1.1-Sequence Models (1)
26 pages
Lecture Notes_RRN
No ratings yet
Lecture Notes_RRN
8 pages
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
No ratings yet
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
9 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
36 pages
Deep Arch Msc 2024
No ratings yet
Deep Arch Msc 2024
83 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
99 pages
slides_rnn
No ratings yet
slides_rnn
75 pages
DS303_RNN_LSTM
No ratings yet
DS303_RNN_LSTM
16 pages
Sequence Modeling
No ratings yet
Sequence Modeling
131 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
42 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
9 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
18 pages
LSTMDerivadas
No ratings yet
LSTMDerivadas
10 pages
What is a Recurrent Neural Network
No ratings yet
What is a Recurrent Neural Network
36 pages
Unit III- Recurrent Neural Networks
No ratings yet
Unit III- Recurrent Neural Networks
44 pages
Recurrent Neural Network: Dr. Sukanta Ghosh
100% (1)
Recurrent Neural Network: Dr. Sukanta Ghosh
34 pages
Unit 5
No ratings yet
Unit 5
76 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
AN2DL_04_2324_RecurrentNeuralNetworks
No ratings yet
AN2DL_04_2324_RecurrentNeuralNetworks
34 pages
RNN.docx
No ratings yet
RNN.docx
8 pages
Soft Computing 1
No ratings yet
Soft Computing 1
15 pages
Module 4-1
No ratings yet
Module 4-1
44 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
6 pages
RNN
No ratings yet
RNN
32 pages
lec14-RNN3-8-Feb-18
No ratings yet
lec14-RNN3-8-Feb-18
16 pages
Unit III (2) RNN, LSTM, Gru
No ratings yet
Unit III (2) RNN, LSTM, Gru
14 pages
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
No ratings yet
Recurrent Neural Networks (RNNS) : A Gentle Introduction and Overview
16 pages
DNN U2 Notes
No ratings yet
DNN U2 Notes
32 pages
Unit 3
No ratings yet
Unit 3
8 pages
What is an RNN
No ratings yet
What is an RNN
6 pages
07 RNN Recurrent Neural Networks
No ratings yet
07 RNN Recurrent Neural Networks
115 pages
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
0% (1)
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
16 pages
4-Recurrent Neural Network
No ratings yet
4-Recurrent Neural Network
21 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
Unit IV
No ratings yet
Unit IV
31 pages
Unit V Recurrent Neural Networks
No ratings yet
Unit V Recurrent Neural Networks
35 pages
Unit 4 - MachineLearning
No ratings yet
Unit 4 - MachineLearning
16 pages
10DL
No ratings yet
10DL
20 pages
Recurrent Neural Networks: Anahita Zarei, PH.D
No ratings yet
Recurrent Neural Networks: Anahita Zarei, PH.D
37 pages
UNIT-IV DL
No ratings yet
UNIT-IV DL
23 pages
Deep Learning Notes
100% (1)
Deep Learning Notes
44 pages
Module 4 Recurrent Neural Network
No ratings yet
Module 4 Recurrent Neural Network
78 pages
RNN LSTM Gru R
No ratings yet
RNN LSTM Gru R
97 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
Unit 4b - Recurrent Neural Networks
No ratings yet
Unit 4b - Recurrent Neural Networks
60 pages
15.03.2024_CSA3007_A24+D23+D24 (1)
No ratings yet
15.03.2024_CSA3007_A24+D23+D24 (1)
8 pages
Unit 3 Deep Learning SPPU BE IT
No ratings yet
Unit 3 Deep Learning SPPU BE IT
30 pages
RNN
No ratings yet
RNN
23 pages
RNN_2
No ratings yet
RNN_2
144 pages
DL M5 Tech
No ratings yet
DL M5 Tech
21 pages
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet
CV %
No ratings yet
CV %
28 pages
CNF Lesson Lesson 3
No ratings yet
CNF Lesson Lesson 3
4 pages
Catalog - Search (The Library Genesis Project Wiki)
No ratings yet
Catalog - Search (The Library Genesis Project Wiki)
2 pages
Material Dec14 Final
No ratings yet
Material Dec14 Final
1 page
FUTA Post UTME General Paper Final Questions
No ratings yet
FUTA Post UTME General Paper Final Questions
37 pages
(Ebook) Interpreting epidemiologic evidence: connecting research to applications by Savitz, David A.; Wellenius, Gregory A ISBN 9780190243777, 9780190243784, 0190243775, 0190243783 2024 scribd download
100% (9)
(Ebook) Interpreting epidemiologic evidence: connecting research to applications by Savitz, David A.; Wellenius, Gregory A ISBN 9780190243777, 9780190243784, 0190243775, 0190243783 2024 scribd download
52 pages
Ms Word Notes Template 1
No ratings yet
Ms Word Notes Template 1
4 pages
Effect of Temperature On Enzyme Activity
No ratings yet
Effect of Temperature On Enzyme Activity
10 pages
List of important topics for UPSC prelims 2025
No ratings yet
List of important topics for UPSC prelims 2025
8 pages
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
24 pages
Science-6 Q4 W2 The Effects of Volcanic Eruption
No ratings yet
Science-6 Q4 W2 The Effects of Volcanic Eruption
8 pages
Free Space Laser Communication
100% (1)
Free Space Laser Communication
17 pages
GEOM 121 - Lecture 8 - Digital Terrain Model
No ratings yet
GEOM 121 - Lecture 8 - Digital Terrain Model
35 pages
Energy WeaponWeapon TypeDamage RatingDamage EffectsDamage TypeFire RateRangeQualitiesWeightCostRarity
No ratings yet
Energy WeaponWeapon TypeDamage RatingDamage EffectsDamage TypeFire RateRangeQualitiesWeightCostRarity
7 pages
Group 7 Robotic Arm
No ratings yet
Group 7 Robotic Arm
39 pages
DLP Grade 5
100% (1)
DLP Grade 5
3 pages
Kaye Group - STS
No ratings yet
Kaye Group - STS
30 pages
Designing Effective Exhibits: Criteria For Success, Exhibit Design Approaches, and Research Stategies
No ratings yet
Designing Effective Exhibits: Criteria For Success, Exhibit Design Approaches, and Research Stategies
12 pages
Mi Ensayo Sobre Mis Fortalezas y Debilidades
100% (1)
Mi Ensayo Sobre Mis Fortalezas y Debilidades
4 pages
MSC Chemistry Syllabus
No ratings yet
MSC Chemistry Syllabus
96 pages
TFN International Theories
No ratings yet
TFN International Theories
38 pages
TQM Exam
No ratings yet
TQM Exam
7 pages
A Simple and Efficient Finite Element For Plate Bending
No ratings yet
A Simple and Efficient Finite Element For Plate Bending
15 pages
19 Chapter 8, Part 1
No ratings yet
19 Chapter 8, Part 1
29 pages
Commanders Handbook
No ratings yet
Commanders Handbook
72 pages
[Ebooks PDF] download Urban Storm Water Management Second Edition Pazwash full chapters
100% (2)
[Ebooks PDF] download Urban Storm Water Management Second Edition Pazwash full chapters
55 pages
SBW Rotary
No ratings yet
SBW Rotary
1 page
Innovative Technologies in Learning
No ratings yet
Innovative Technologies in Learning
1 page

Modelling Time Series With Neural Networks: Volker Tresp Summer 2017

Uploaded by

Modelling Time Series With Neural Networks: Volker Tresp Summer 2017

Uploaded by

Modelling Time Series with Neural

• The next figure shows a time series (DAX)

• Other interesting time-series: energy prize, energy consumption, gas consumption,

• Let zt, t = 1, 2, . . . be the time-discrete time-series of interest (example: DAX)

• Let xt, t = 1, 2, . . . denote a second time-series, that contains information on zt

• We assume a system of the form

zt = f (zt−1, . . . , zt−T , xt−1, . . . , xt−T ) + t

• We approximate, using a neural network,

f (zt−1, . . . , zt−T , xt−1, . . . , xt−T )

≈ fw,V (zt−1, . . . , zt−T , xt−1, . . . , xt−T )

• Note the ”convolutional“ idea in TDNNs

• The future inputs are not available

• Teacher forcing: Use the predicted output as input (can be risky)

• Here, zt = (zt,1, zt,2, . . . , zt,H )T , xt = (xt,0, xt,1, . . . , xt,M −1)T

• ah, w, vh are weight vectors

• The same RNN but with a different intuition

• Backpropagation through time (BPTT): essentially backpropagation applied to the

• Real-Time Recurrent Learning (RTRL) (Williams and Zipser (1989))

• Time-Dependent Recurrent Back-Propagation: learning with continuous time (Lag-

• Recurrent Neural Networks are notoriously difficult to train

• A simple alternative is to initialize A and V randomly (according to some recipe) and

• A RNN would ignore the new measurement. What can be done

• 2: We can use yt as an input to the RNN (as in TDNN)

• 3: We add a (linear) noise model

• The predictions in bidirectional RNNs depend on past and future inputs

• Useful for sequence labelling problems: handwriting recognition, speech recognition,

• For example, used in machine translation

You might also like

zt = f (zt−1, . . . , zt−T , xt−1, . . . , xt−T ) + t