Sage Journals: Discover world-class research

Abstract

Visual tracking is a very challenging task in computer vision. In this paper, we present a general-purpose framework for robust tracking. We propose to couple one-shot learning and online discriminative learning together to address the fundamental stability-plasticity issue for tracking. A one-shot learner through offline training on large-scale datasets is used as a stable detector which does not suffer model drift while an online discriminative learner is adopted as the tracker which is adaptive to significant appearance changes. Based on the directive framework, we design a baseline tracking model to verify its effectiveness. In practice, a deep Siamese network trained offline plays as the one-shot learner which can re-detect the target in case of tracking drift and failure. A correlation classifier which incorporates a translation model and a scale model plays as the online learner. Through the coupling of the offline and online learning, the simple baseline tracker achieves a good balance between stability and adaptivity without time-consuming optimization. Experimental results on the large-scale benchmark dataset demonstrate the effectiveness of the proposed framework within which the designed baseline tracker outperforms many state-of-the-art methods both in precision and robustness.

Keywords

Visual tracking one-shot learning online learning deep learning

Get full access to this article

View all access options for this article.

References

, Lim

and Yang

M.-H.

, Object tracking benchmark, IEEE Trans Patt Anal Mach Intell 37(9) (2015), 1834–1848.

Ross

D.A.

, Lim

, Lin

R.-S.

and Yang

M.-H.

, Incremental learning for robust visual tracking, International Journal of Computer Vision 77(1–3) (2008), 125–141.

Zhang

, Zhang

and Yang

M.-H.

, Real-Time Compressive Tracking, in: Proc Eur Conf Comput Vis, 2012, pp. 864–877.

Danelljan

, Khan

F.S.

, Felsberg

and van de Weijer

, Adaptive Color Attributes for Real-Time Visual Tracking, in: Proc IEEE Conf Comput Vis Pattern Recognit, 2014, pp. 1090–1097.

, Lu

and Yang

M.-H.

, Visual Tracking Via Adaptive Structural Local Sparse Appearance Model, in: Proc IEEE Conf Comput Vis Pattern Recognit, 2012, pp. 1822–1829.

Bengio

, Courville

and Vincent

, Representation learning: A review and new perspectives, IEEE Trans Patt Anal Mach Intell 35(8) (2013), 1798–1828.

Wang

and Yeung

D.Y.

, Learning a Deep Compact Image Representation for Visual Tracking, in: Adv in Neural Inf Process Syst, 2013, pp. 809–817.

, Huang

J.B.

, Yang

and Yang

M.H.

, Hierarchical Convolutional Features for Visual Tracking, in: IEEE Int Conf Comput Vis, 2015, pp. 3074–3082.

Babenko

, Yang

M.H.

and Belongie

, Visual Tracking with Online Multiple Instance Learning, in: Proc IEEE Conf Comput Vis Pattern Recognit, 2009, pp. 983–990.

10.

Wang

, Yoon

, Xie

S.J.

, Lu

and Park

D.S.

, Visual Tracking with semi-supervised online weighted multiple instance learning, The Visual Computer 32(3) (2016), 307–320.

11.

Grabner

L.C.

and Bischof

H.H.

, Semi-Supervised on-Line Boosting for Robust Tracking, in: Proc Eur Conf Comput Vis, 2008, pp. 234–247.

12.

Avidan

, Ensemble tracking, IEEE Trans Patt Anal Mach Intell 29(2) (2007), 261–271.

13.

Hare

, Saffari

and Torr

P.H.S.

, Structured Output Tracking with Kernels, in: Proc IEEE Int Conf Comput Vis, 2011, pp. 2096–2109.

14.

Kalal

, Mikolajczyk

and Matas

, Tracking-learning-detection, IEEE Trans Patt Anal Mach Intell 34(7) (2012), 1409–1422.

15.

Henriques

J.F.

, Caseiro

, Martins

and Batista

, Highspeed tracking with kernelized correlation filters, IEEE Trans Patt Anal Mach Intell 37(3) (2015), 583–596.

16.

Baker

and Matthews

, Lucas-kanade 20 years on: A unifying framework, Int J Comput Vis 56(3) (2004), 221–255.

17.

Matthews

, Ishikawa

and Baker

, The template update problem, IEEE Trans Pattern Anal Mach Intell 26(6) (2004), 810–815.

18.

Supancic

J.S.

and Ramanan

, Self-Paced Learning for Longterm Tracking, in: Proc IEEE Conf Comput Vis Pattern Recognit, 2013, pp. 2379–2386.

19.

, Yang

, Zhang

and Yang

M.H.

, Long-Term Correlation Tracking, in: Proc IEEE Conf Comput Vis Pattern Recognit, 2015, pp. 5388–5396.

20.

Zhang

, Ma

and Sclaroff

, Robust Tracking Via Multiple Experts Using Entropy Minimization, in: Proc Eur Conf Comput Vis, 2014, pp. 188–203.

21.

Hong

, You

, Kwak

and Han

, Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network, in: Proc of 32rd Int Conf Mach Learn, 2015, pp. 597–606.

22.

, Zhang

, Qin

, Yao

, Huang

, Lim

and Yang

M.H.

, Hedged Deep Tracking, in: IEEE Conf Comput Vis Pattern Recognit, 2016, pp. 4303–4311.

23.

Tao

, Gavves

and Smeulders

A.W.

, Siamese Instance Search for Tracking, in: IEEE Conf Comput Vis Pattern Recognit, 2016, pp. 1420–1429.

24.

Bertinetto

, Valmadre

, Henriques

J.F.

, Vedaldi

and Torr

P.H.

, Fully-Convolutional Siamese Networks for Object Tracking, in: Proc Eur Conf Comput Vis, 2016, pp. 850–865.

25.

, Li

and Porikli

, DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking, in: Proc of British Machine Vision Conference, 2014.

26.

Bromley

, Guyon

, LeCun

, SÃd’ckinger

and Shah

, Signature Verification Using a “Siamese” Time Delay Neural Network, in: Adv in Neural Inf Process Syst, 1994, pp. 737–744.

27.

Krizhevsky

, Sutskever

and Hinton

, ImageNet Classification with Deep Convolutional Neural Networks, in: Adv In Neural Inf Process Syst, 2012, pp. 1097–1105.

28.

Russakovsky

, Deng

, Su

, Krause

, Satheesh

, Ma

, Huang

, Karpathy

, Khosla

, Bernstein

, Berg

A.C.

and Fei-Fei

, Imagenet large scale visual recognition challenge, Int J Comput Vis 115(3) (2015), 211–252.

29.

, Lim

and Yang

M.-H.

, Online Object Tracking: A Benchmark, in: Proc IEEE Conf Comput Vis Pattern Recognit, 2013, pp. 2411–2418.

30.

Danelljan

, HÃd’ger

, Khan

and Felsberg

, Accurate Scale Estimation for Robust Visual Tracking, in: Proc of British Mach Vis Conf, 2014, pp. 1–5.

Coupling one-shot learning and online discriminative learning for robust object tracking

Abstract

Keywords

Get full access to this article

References