논문명 | FlowNet: Learning Optical Flow with Convolutional Networks |
---|---|
저자(소속) | Alexey Dosovitskiy (uni-freiburg.de) |
학회/년도 | ICCV 2015, 논문 |
키워드 | Dosovitskiy2015, |
데이터셋(센서)/모델 | KITTI, Sintel, Middlebury datasets, synthetic Flying Chairs dataset. |
관련연구 | DispNet, FlowNet 2 |
참고 | 홈페이지, Youtube, poster |
코드 | Code, Caffe |
optical flow estimation에 대한 [19-1981]의 연구 이후 많은 확장 논문들이 나왔다[29-1998, 5-2004, 34-2009]. Variational approaches have dominated optical flow estimation since the work of Horn and Schunck [19].Many improvements have been introduced[29, 5, 34].
최근 연구는 displacements & combinatorial matching를 여러 방법(variational approach)
으로 합치는 것이다[6-2011, 35-2013]. The recent focus was on large displacements,and combinatorial matching has been integrated into the variational approach [6, 35].
[35-DeepFlow]연구의 Deep Matching & DeepFlow아이디어를 본 논문에서 활용 하였다. The work of [35] termed Deep Matching and DeepFlow is related to our work in that feature information is aggregated from fine to coarse using sparse convolutions and max-pooling.
[30-EpicFlow]연구의 EpicFlow아이디어의 성공으로 성능이 좋아 졌따. The successive work of [30] termed EpicFlow has put even more emphasis on the quality of sparse matching as the matches from [35] are merely interpolated to denseflow fields while respecting image boundaries.
[35] P. Weinzaepfel, J. Revaud, Z. Harchaoui, and C. Schmid. DeepFlow: Large displacement optical flow with deep matching. In ICCV, Sydney, Australia, Dec. 2013
[30] J. Revaud, P. Weinzaepfel, Z. Harchaoui, and C. Schmid. EpicFlow: Edge-Preserving Interpolation of Correspondences for Optical Flow. In CVPR, Boston, United States,
June 2015
Several authors have applied machine learning techniques to optical flow before.
Sun et al. [32-2018] study statistics of optical flow and learn regularizers using Gaussian scale mixtures;
Rosenbaum et al. [31-2013] model local statistics of optical flow with Gaussian mixture models.
Black etal. [4-1997] compute principal components of a training set of flow fields.
To predict optical flow they then estimate coefficients of a linear combination of these ’basis flows’.
Other methods train classifiers to select among different inertial estimates [21-2015] or to obtain occlusion probabilities [27-2013].
[21] R. Kennedy and C. Taylor. Optical flow with geometric occlusion estimation and fusion of multiple frames. In EMMCVPR. 2015
There has been work on unsupervised learning of disparity or motion between frames of videos using neural network models.
These methods typically use multiplicative interactions to model relations between a pair of images.
Disparities and optical flow can then be inferred from the latent variables.
Taylor et al. [33-2010] approach the task with factored gated restricted Boltzmann machines.
Kondaand Memisevic [23-2013] use a special auto encoder called ‘synchrony autoencoder’.
While these approaches work well in a controlled setup and learn features useful for activity recognition in videos, they are not competitive with classical methods on realistic videos.
논문명 | FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks |
---|---|
저자(소속) | Eddy Ilg (uni-freiburg.de) |
학회/년도 | CVPPR 2017, 논문 |
키워드 | Eddy2017, |
데이터셋(센서)/모델 | |
관련연구 | DispNet, FlowNet 1 |
참고 | 홈페이지, Youtube, Review |
코드 | Code |