Synthetic Training Data for Deep Learning

Training data is one of the key ingredients of machine learning—most prominently, of supervised learning. While all our deep learning works feature data in one way or another, some of our publications focus on its creation and analysis.

What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

Nikolaus Mayer¹ Eddy Ilg¹ Philipp Fischer¹ Caner Hazirbas² Daniel Cremers² Alexey Dosovitskiy^1,3 Thomas Brox¹

¹University of Freiburg ²TU Munich ³Intel Deutschland GmbH

International Journal of Computer Vision (IJCV), 2018

Abstract

The finding that very large networks can be trained efficiently and reliably has led to a paradigm shift in computer vision from engineered solutions to learning formulations. As a result, the research challenge shifts from devising algorithms to creating suitable and abundant training data for supervised learning. How to efficiently create such training data? The dominant data acquisition method in visual recognition is based on web data and manual annotation. Yet, for many computer vision problems, such as stereo or optical flow estimation, this approach is not feasible because humans cannot manually enter a pixel-accurate flow field. In this paper, we promote the use of synthetically generated data for the purpose of training deep networks on such tasks. We suggest multiple ways to generate such data and evaluate the influence of dataset properties on the performance and generalization properties of the resulting networks. We also demonstrate the benefit of learning schedules that use different types of data at selected stages of the training process.

Downloads

DOI

Nikolaus Mayer¹ Eddy Ilg¹ Philip Häusser² Philipp Fischer¹ Daniel Cremers² Alexey Dosovitskiy^1,3 Thomas Brox¹

¹University of Freiburg ²TU Munich ³Intel Deutschland GmbH

IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2016

Abstract

Recent work has shown that optical flow estimation can be formulated as a supervised learning task and can be successfully solved with convolutional networks. Training of the so-called FlowNet was enabled by a large synthetically generated dataset. The present paper extends the concept of optical flow estimation via convolutional networks to disparity and scene flow estimation. To this end, we propose three synthetic stereo video datasets with sufficient realism, variation, and size to successfully train large networks. Our datasets are the first large-scale datasets to enable training and evaluating scene flow methods. Besides the datasets, we present a convolutional network for real-time disparity estimation that provides state-of-the-art results. By combining a flow and disparity estimation network and training it jointly, we demonstrate the first scene flow estimation with a convolutional network.

Downloads

DOI Code Dataset