Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection

Zolfaghari, Mohammadreza; Oliveira, Gabriel Leivas; Sedaghat, Nima; Brox, Thomas

Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection

Mohammadreza Zolfaghari, Gabriel Leivas Oliveira, Nima Sedaghat, Thomas Brox

IEEE International Conference on Computer Vision (ICCV), 2017

Abstract: General human action recognition requires understanding of various visual cues. In this paper, we propose a network architecture that computes and integrates the most important visual cues for action recognition: pose, motion, and the raw images. For the integration, we introduce a Markov chain model which adds cues successively. The resulting approach is efficient and applicable to action classification as well as to spatial and temporal action localization. The two contributions clearly improve the performance over respective baselines. The overall approach achieves state-of-the-art action classification performance on HMDB51, J-HMDB and NTU RGB+D datasets. Moreover, it yields state-of-the-art spatio-temporal action localization results on UCF101 and J-HMDB.

Project page

Other associated files :

BibTex reference

@InProceedings{ZOSB17a,
  author       = "Mohammadreza Zolfaghari and Gabriel L. Oliveira and Nima Sedaghat and Thomas Brox",
  title        = "Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection",
  booktitle    = "IEEE International Conference on Computer Vision (ICCV)",
  month        = " ",
  year         = "2017",
  url          = "http://lmbweb.informatik.uni-freiburg.de/Publications/2017/ZOSB17a"
}

Other publications in the database

» Mohammadreza Zolfaghari
» Gabriel Leivas Oliveira
» Nima Sedaghat
» Thomas Brox

Chained Multi-stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection

See also

BibTex reference

Other publications in the database