[PDF][PDF] Event Fisher Vectors: Robust Encoding Visual Diversity of Visual Streams.
In this paper we focus on event recognition in visual image streams. More specifically, we
aim to construct a compact representation which encodes the diversity of the visual stream
from just a few observations. For this purpose, we introduce the Event Fisher Vector, a Fisher
Kernel based representation to describe a collection of images or the sequential frames of a
video. We explore different generative models beyond the Gaussian mixture model as
underlying probability distribution. First, the Student'st mixture model which captures the …
aim to construct a compact representation which encodes the diversity of the visual stream
from just a few observations. For this purpose, we introduce the Event Fisher Vector, a Fisher
Kernel based representation to describe a collection of images or the sequential frames of a
video. We explore different generative models beyond the Gaussian mixture model as
underlying probability distribution. First, the Student'st mixture model which captures the …
Abstract
In this paper we focus on event recognition in visual image streams. More specifically, we aim to construct a compact representation which encodes the diversity of the visual stream from just a few observations. For this purpose, we introduce the Event Fisher Vector, a Fisher Kernel based representation to describe a collection of images or the sequential frames of a video. We explore different generative models beyond the Gaussian mixture model as underlying probability distribution. First, the Student’st mixture model which captures the heavy tails of the small sample size of a collection of images. Second, Hidden Markov Models to explicitly capture the temporal ordering of the observations in a stream. For all our models we derive analytical approximations of the Fisher information matrix, which significantly improves recognition performance. We extensively evaluate the properties of our proposed method on three recent datasets for event recognition in photo collections and web videos, leading to an efficient compact image representation which achieves state-of-the-art performance on all these datasets.
ivi.fnwi.uva.nl
Showing the best result for this search. See all results