Visualization and Perception Lab

Moving Object Segmentation for Jittery Videos, by Clustering of Stabilized Latent Trajectories

Geethu Miriam Jacob and Sukhendu Das
Visualization and Perception Lab
Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India

Accepted in Image and Vision Computing (IMAVIS-2017), Elsevier

Abstract

Moving object segmentation in videos has always been a challenging task in the presence of large camera movements. Moreover when the camera motion is jittery, most of the existing motion segmentation approaches fail. In this work, we propose an optimization framework for the segmentation of the prominent moving object in jittery videos. A novel Optical Trajectory Descriptor Matrix (OTDM) built on point trajectories has been proposed for this purpose. An optimization function has been formulated for stabilizing the trajectories, followed by spectral clustering of the proposed latent trajectories. Latent trajectories are obtained by performing Probabilistic Latent Semantic Analysis (pLSA) on the OTDM (factorization of OTDM using KL divergence). This integrated framework yields accurate clustering of the trajectories from jittery videos. Foreground pixel labelling is obtained by utilizing the clustered trajectory coordinates for modelling the foreground and background, using a GraphCut based energy formulation. Experiments were performed on 16 real-world jittery videos. Also, the results have been generated for a standard segmentation dataset, SegTrackv2, with synthetic jitter incorporated. Jitter extracted from a real video is inserted into stable SegTrackv2 videos for analysis of performance. The proposed method, when compared to the state-of-the-art methods, was found to be superior.

Flowchart of the Algorithm

Segmentation Results on Jittery Videos

Demo 1 (Results on Natural Jittery Videos) :

Demo 2 (Results on Synthetic Jittery Videos) :

Downloadable Files

Data and Groundtruth of 16 Natural Jittery Videos

Matlab Code

More Visual Segmentation Results. Click the links below to download the comparative results of each video shot

Baby	Car	Cheery_Girl	Climb
Cycling1	Cycling2	Cycling3	Doll
Drone1	Drone2	Staircase1	Staircase2
Skating	Train	Walk1	Walk2

Jitter pattern JT4

Jitter pattern JRH (highest randomness added to parameters of JT1)

Click here to download all the comparative results

Quantitative evaluation of Segmentation using Intersection Over Union (IOU) score (higher, the better) for natural jittery videos
Video	Zhang et al. [1] (CVPR'13)	Papazoglou et al. [2] (ICCV'13)	Ochs et al. [3] (PAMI'14)	Proposed	Performance ratio
Walk1	0.401	0.135	0.02	0.714	1.78
Walk2	0.009	0.123	0.001	0.839	6.82
Cheery_Girl	0.144	0.201	0.09	0.801	3.99
Doll	0.139	0.926	0.819	0.928	1.002
Baby	0.116	0.671	0.007	0.693	1.03
Skating	0.033	0.248	0.318	0.716	2.25
Car	0.029	0.06	0.058	0.392	6.5
Cycling1	0.558	0.359	0.610	0.598	0.98
Cycling2	0.654	0.649	0.462	0.663	1.01
Climb	0.54	0.764	0.03	0.81	1.06
Drone1	0.715	0.755	0.658	0.769	1.01
Drone2	0.487	0.436	0.549	0.592	1.07
Train	0.211	0.37	0.535	0.857	1.6
Cycling3	0.701	0.342	0.605	0.732	1.04
Staircase1	0.726	0.296	0.713	0.819	1.12
Staircase2	0.875	0.889	0.801	0.934	1.05
Average	0.396	0.452	0.392	0.741	1.95

Performance ratio =^Proposed⁄_{max(CVPR'13,ICCV'13,PAMI'14)}

Quantitative evaluation of Segmentation using Intersection Over Union (IOU) score (higher, the better) for synthetic jitter JT1, JT2, JT3 and JT4.
Video	Zhang et al. [1]	Papazoglou et al. [2]	Ochs et al. [3]	Proposed
JT1	0.284	0.647	0.587	0.667
JT2	0.315	0.65	0.432	0.671
JT3	0.309	0.651	0.424	0.644
JT4	0.264	0.679	0.42	0.70

Quantitative evaluation of Segmentation using Intersection Over Union (IOU) score (higher, the better) for synthetic jitter JRL, JRM and JRH.
Video	Zhang et al. [1]	Papazoglou et al. [2]	Ochs et al. [3]	Proposed
JRL	0.586	0.637	0.327	0.665
JRM	0.551	0.575	0.525	0.60
JRH	0.543	0.585	0.479	0.65

References

[1] D. Zhang, O. Javed, and M. Shah, “Video object segmentation through spatially accurate and temporally dense extraction of primary object regions,” in CVPR, 2013

[2] A. Papazoglou and V. Ferrari, “Fast object segmentation in unconstrained video,” in ICCV, 2013.

[3] P. Ochs, J. Malik, and T. Brox, “Segmentation of moving objects by long term video analysis,” IEEE TPAMI , vol. 36, no. 6, pp. 1187–1200, 2014.