FusionNet and AugmentedFlowNet: Selective Proxy Ground Truth for Training on Unlabeled Images
Technical Report , arXiv:1808.06389, 2018
Abstract: Recent work has shown that convolutional neural networks (CNNs) can be used to estimate optical flow with high quality and fast runtime. This makes them preferable for real-world applications. However, such networks require very large training datasets. Engineering the training data is difficult and/or laborious. This paper shows how to augment a network trained on an existing synthetic dataset with large amounts of additional unlabelled data. In particular, we introduce a selection mechanism to assemble from multiple estimates a joint optical flow field, which outperforms that of all input methods. The latter can be used as proxy-ground-truth to train a network on real-world data and to adapt it to specific domains of interest. Our experimental results show that the performance of networks improves considerably, both, in cross-domain and in domain-specific scenarios. As a consequence, we obtain state-of-the-art results on the KITTI benchmarks.
Paper
Images and movies
BibTex reference
@TechReport{MIB18, author = "O. Makansi and E. Ilg and T. Brox", title = "FusionNet and AugmentedFlowNet: Selective Proxy Ground Truth for Training on Unlabeled Images", institution = "arXiv:1808.06389", month = " ", year = "2018", url = "http://lmbweb.informatik.uni-freiburg.de/Publications/2018/MIB18" }