not logged in. (login)


Motion-based Retrieval and Motion-based Data Mining


The aim of this project was to develop methods for indexing, retrieval, and data mining of motion trajectories in video databases. Computer vision techniques were devised for detection and tracking of moving objects, as well as estimation of statistical time-series models that describe each object's motion, that can be used in motion-based indexing and retrieval. Algorithms were developed that can discover clusters and other patterns in the extracted motion time-series data, and to identify common versus unusual motion patterns. The hope is that the products of this research effort can enable numerous applications that are valuable to society: homeland security; video-based analysis of human biomechanics for occupational safety, as well as dance and sports training; archive management and analysis for news, entertainment, and sports video; and video database management for non-intrusive monitoring of the motion patterns of handicapped, infirm, or elderly people to detect decline, danger, or to alert caregivers when needed.

This was a joint project between the Database and Image and Video Computing research groups.

This work was supported by the National Science Foundation under Grant No. NSF IIS-0308213, "Mining and Indexing Spatio-Temporal Patterns in Video Databases of Human Motion," September 2003 -- August 2007. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Principal Investigators

  • Stan Sclaroff
  • Margrit Betke
  • George Kollios

    Students Affiliated with the Research Project

  • Joni Alon, PhD 2006.
  • Vassilis Athitsos, PhD 2006
  • Johanna Brewer, MA 2004
  • Ching Chang, MA 2006
  • Murat Erdem, PhD student
  • Mikhail Gorman, MA 2006
  • Oleg Gusyatin, BA/MA 2004
  • Rui Li, PhD student
  • John Magee, PhD student
  • William Mullally, PhD student
  • Panagiotis Papapetrou, PhD student
  • Matthew Scott, BA 2004
  • Ashwin Thangali, PhD student
  • Tai-peng Tian, PhD student
  • Mikhail Urison, MA 2003
  • Benjamin Waber, MA/BA 2006
  • Jingbing Wang, PhD 2007
  • Zheng Wu, PhD student
  • Project Highlights

    In this project we developed new methods for extracting motion information from video data and indexing and mining this information in a novel and effective way. New systems and techniques have been developed that identify motion patterns in video streams. These patterns can be used to enhance human computer interaction and allow querying large databases of motion data more efficiently and effectively. We provided novel embedding based techniques for efficient indexing and classification of motion data that achieve orders of magnitude improvement on query and classification time compared to the previous best techniques. We also proposed new ways to model and query motion data. Furthermore, the proposed approaches opened up new areas of research that offer very promising directions for further investigation. The following is a list of some of the research highlights from research that was supported in part through this grant:

    Motion indexing, retrieval, analysis and mining

    • Prof. Kollios worked with Prof. Nikos Mamoulis and Prof. Yufei Tao and their students to design data mining algorithms for finding approximate periods in object trajectories and using these periods for efficient indexing and approximate query answering.
    • PhD student Ashwin Thangali worked with PI Sclaroff to develop a novel method for detecting periodic motion patterns in video databases. Samples from the video are taken along appropriately chosen directions in space-time. A windowed frequency analysis technique is used to determine the period for each of the sample paths. Dominant periodic motions in the shot are detected by clustering the sample paths having the same period.
    • PhD student Tai-peng Tian worked with PI Sclaroff to develop a method for matching novel video of human motion to an existing archive of 3D human motion capture data. The method automatically produces a similarity ranking of the 3D motion capture data, without the need to recover the camera calibration of 3D structure from motion for the query video.
    • Visiting PhD candidate Walter Nunziati, with PI Sclaroff and Prof. Alberto Del Bimbo of the U. of Florence developed trajectory-based indexing and retrieval methods for video databases.
    • PhD student Jonathan Alon, with PI's Kollios and Sclaroff, develop methods to model, match, index, and/or retrieve motion patterns in video databases; e.g., human activity patterns, gestures, moving objects. He produced new algorithms and a demonstratin system for learning sparse temporal models that focus on the most discriminative features for matching and classification of motion trajectories.
    • Co-PI George Kollios along with Dr. Michail Vlachos (IBM Research) and Prof. Dimitrios Gunopulos (UC Riverside) investigated techniques for analysis and retrieval of object trajectories extracted from video clips of human motion (among others).
    • PhD student U. Murat Erdem and PI Sclaroff developed methods for data mining, modeling, and then predicting human motion and traffic flow in a distributed collection of PTZ video cameras.

    Applications for individuals with disabilities and for physical rehabilitation

    • Graduate students Oleg Gusyatin and Mikhail Urison worked with Co-PI Margrit Betke on methods for finding similarities in human motion patterns created with pointer input devices for human-computer interaction. They proposed a method called SymbolDesign that can be used to design user-centered interfaces for pen-based input devices. It can also extend the functionality of pointer input devices such as the traditional computer mouse, or camera-based computer interface such as the Camera Mouse for individuals with limited movement.
    • Co-PI Margrit Betke and her graduate student Mikhail Gorman collaborated with rehabilitation specialist Prof. Elliot Saltzman from Boston University and his PhD student Amir Lahav to develop a camera-based human-computer interface called ``MusicMaker'' to provide a music-making rehabilitation tool. The proposed system uses a (1) video camera to record a subject performing therapeutic exercises, (2) detects and analyzes the subject's motion patterns, and (3) reacts to the motion patterns by producing music. The system can be adjusted to a patient's particular therapeutic needs and provides quantitative tools for monitoring the recovery process and assessing therapeutic outcomes.

    Nearest-neighbor indexing and retrieval methods

    • PhD students Vassilis Athitsos and Jonathan Alon, along with PI's Sclaroff and Kollios developed a strategy for approximation of expensive distance measures used in indexing and nearest neighbor classification problems. Using Lipshitz embeddings, 'vantage objects' are embeded in a Euclidean space in which similarities can be rapidly measured using a weighted Manhattan distance. Embedding construction is formulated as a machine learning task, where AdaBoost is used to combine many simple, 1D embeddings into a multidimensional embedding that preserves a significant amount of the proximity structure in the original space.
    • Dr. Vassilis Athitsos along with co-PI George Kollios and PhD students Panagiotis Papapetrou and Michalis Potamias proposed a new hash based method for indexing spaces with arbitrary distance measures. Hashing methods, such as Locality Sensitive Hashing (LSH), have been successfully applied for similarity indexing in vector spaces and string spaces under the Hamming distance. The key novelty of the proposed hashing technique is that it can be applied to spaces with arbitrary distance measures.

    Publically-Available Software

    Publically-Available Datasets




    Related Publications

    title year

    Rui Li, Tai-Peng Tian, Stan Sclaroff and Ming-Hsuan Yang , "3D Human Motion Tracking with a Coordinated Mixture of Factor Analyzers," International Journal of Computer Vision (IJCV), Vol. (preprint, DOI 10.1007/s11263-009-0283-4), 2009.
    2009

    Walter Nunziati, Stan Sclaroff and Alberto Del Bimbo. , "Matching trajectories between video sequences by exploiting a sparse projective invariant representation," IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), Vol. (preprint), 2009.
    2009

    Vassilis Athitsos, Jonathan Alon, Stan Sclaroff and George Kollios , "BoostMap: An embedding method for efficient nearest neighbor retrieval," IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), Vol. 30, No. 1, pp 89-104, 2008.
    2008

    Vassilis Athitsos,Marios Hadjieleftheriou,George Kollios and Stan Sclaroff , "Query Sensitive Embeddings," ACM Transactions on Database Systems (TODS), Vol. 32, No. 2, 2007.
    2007

    Mikhail Gorman, Amir Lahav, Elliot Saltzman and Margrit Betke , "A Camera-Based Music-Making Tool for Physical Rehabilitation," Computer Music Journal, Vol. 31, No. 2, pp 39-53, 2007.
    2007

    Rui Li, Tai-Peng Tian and Stan Sclaroff , "Simultaneous Learning of Nonlinear Manifold and Dynamical Models for High-dimensional Time Series," International Conference on Computer Vision (ICCV), 2007.
    2007

    Vassilis Athitsos, Alexandra Stefan, Quan Yuan and Stan Sclaroff , "ClassMap: Efficient Multiclass Recognition via Embeddings," International Conference on Computer Vision (ICCV), 2007.
    2007

    Feifei Li, Ching Chang, George Kollios and Azer Bestavros , "Characterizing and Exploiting Reference Locality in Data Stream Applications," Proc. IEEE Conf on Data Engineering (ICDE), pp 81, 2006.
    2006

    Rui Li,Ming-Hsuan Yang,Stan Sclaroff and Tai-Peng Tian , "Monocular Tracking of 3D Human Motion with a Coordinated Mixture of Factor Analyzers," Proceedings of the European Conference on Computer Vision (ECCV), pp 137-150, 2006.
    2006

    Margrit Betke, Oleg Gusyatin and Mikhail Urinson , "Symbol design: a user-centered method to design pen-based interfaces and extend the functionality of pointer input devices," Universal Access in the Information Society, Vol. 4, No. 3, pp 223-236, 2006.
    2006

    Jonathan Alon , "Spatiotemporal Gesture Segmentation," BU CS Technical Report, No. 2006-024, 2006.
    2006

    P. Papaterou , G. Kollios , S. Sclaroff and D. Gunopoulos , "Discovering Frequent Arrangements of Temporal Intervals," Proc. International Conf. on Data Mining (ICDM), pp 354-361, 2005.
    2005

    Ugur Murat Erdem and Stan Sclaroff , "Look there! Predicting where to look for motion in an active camera network," Proc. Int. Conf. on Advanced Video and Signal Based Surveillance (AVSS), pp 105-110, 2005.
    2005

    Jonathan Alon, Vassilis Athitsos and Stan Sclaroff , "Online and Offine Character Recognition Using Alignment to Prototypes," Proc. International Conf. on Document Analysis and Retrieval (ICDAR), Vol. 2, pp 839-843, 2005.
    2005

    Jonathan Alon, Vassilis Athitsos and Stan Sclaroff , "Fast and Accurate Gesture Spotting via Pruning and Subgesture Reasoning," Proc. IEEE Human Computer Interface Workshop, pp 189-198, 2005.
    2005

    Walter Nunziati, Stan Sclaroff and Alberto Del Bimbo , "Matching Trajectories Across Videos with Semi-Local Projective Invariant Features," Proc. International Conf. on Image and Video Retrieval, pp 318-327, 2005.
    2005

    Panagiotis Papapetrou, George Kollios, Stan Sclaroff and Dimitrios Gunopulos , "Discovering Frequent Arrangements of Temporal Intervals," Proc. IEEE Conf. on Data Mining (ICDM), pp 354-361, 2005.
    2005

    Walter Nunziati, Stan Sclaroff and Alberto Del Bimbo , "View registration using interesting segments of planar trajectories," Proc. Int. Conf. on Advanced Video and Signal Based Surveillance (AVSS), 2005.
    2005

    M. Gorman, M. Betke, E. Saltzman and A. Lahav , "MusicMaker -- A Camera-based Music Making Tool for Physical Rehabilitation," BU CS Technical Report, No. BUCS-2005-032, 2005.
    2005

    Stan Sclaroff, Margrit Betke, George Kollios, Jonathan Alon, Vassilis Athitsos, Rui Li, John Magee and Tai-peng Tian , "Tracking, Analysis, and Recognition of Human Gestures in Video," Proc. International Conf. on Document Analysis and Retrieval (ICDAR), pp 806-810, 2005.
    2005

    Marios Hadjieleftheriou, George Kollios, Petko Bakalov and Vassilis J. Tsotras , "Complex spatio-temporal pattern queries," Proc. Conf. on Very Large Data Bases (VLDB), pp 877 - 888, 2005.
    2005

    Vassilis Athitsos and Stan Sclaroff , "Boosting Nearest Neighbor Classifiers for Multiclass Recognition," Proceedings of IEEE Workshop on Learning in Computer Vision and Pattern Recognition, pp 45-55, 2005.
    2005

    Vassilis Athitsos, Jonathan Alon, Stan Sclaroff and George Kollios , "Filtering Methods for Similarity-Based Multimedia Retrieval," Proceedings of the Seventh International Workshop of the EU Network of Excellence DELOS on Audio-Visual Content and Information Visualization in Digital Libraries (AVIVDiLib), pp (10 pages), 2005.
    2005

    Vassilis Athitsos, Jonathan Alon and Stan Sclaroff , "Efficient Nearest Neighbor Classification Using a Cascade of Approximate Similarity Measures," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 486-493, 2005.
    2005

    Vassilis Athitsos, Marios Hadjieleftheriou, George Kollios and Stan Sclaroff , "Query-Sensitive Embeddings," ACM International Conference on Management of Data (SIGMOD), pp 706-717, 2005.
    2005

    Michail Vlachos, George Kollios and Dimitrios Gunopulos , "Elastic Translation Invariant Matching of Trajectories," Machine Learning, Vol. 58, No. 2-1, pp 301-334, 2005.
    2005

    Jonathan Alon, Vassilis Athitsos, Quan Yuan, and Stan Sclaroff , "Simultaneous Localization and Recognition of Dynamic Hand Gestures," Proc. IEEE Workshop on Motion and Video Computing , pp 254-260, 2005.
    2005

    Tai-Peng Tian and Stan Sclaroff , "Handsignals Recognition From Video Using 3D Motion Capture Data," Proc. IEEE Workshop on Motion and Video Computing , pp 189-194, 2005.
    2005

    Ashwin Thangali and Stan Sclaroff , "Periodic Motion Detection and Estimation via Space-Time Sampling," Proc. IEEE Workshop on Motion and Video Computing , pp 176-182, 2005.
    2005

    Dan Buzan, Stan Sclaroff and George Kollios , "Extraction and Clustering of Motion Trajectories in Video," Proc. International Conf. on Pattern Recognition (ICPR), Vol. 2, pp 512-524, 2004.
    2004

    Vassilis Athitsos, Jonathan Alon, Stan Sclaroff and George Kollios , "BoostMap: A Method for Efficient Approximate Similarity Rankings," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vol. II, pp 268-275, 2004.
    2004

    Nikos Mamoulis, Huiping Cao, George Kollios, Marios Hadjieleftheriou,Yufei Tao and David W. Cheung , "Mining, indexing, and querying historical spatiotemporal data," Proc. ACM SIGKDD Conf. on Knowledge Discovery and Data Mining, pp 236 - 245 , 2004.
    2004

    Oleg Gusyatin, Mikhail Urinson and Margrit Betke , "A Method to Extend Functionality of Pointer Input Devices," User-Centered Interaction Paradigms for Universal Access in the Information Society, pp 426-439, 2004.
    2004

    M. La Cascia, L. Valenti and S. Sclaroff , " Fully automatic, real-time detection of facial gestures from generic video," Proc. Multimedia Signal Processing Workshop, pp 175-178, 2004.
    2004

    Dan Buzan , "Robust Tracking of Human Motion," Masters Report, 2003.
    2003

    Jonathan Alon, Stan Sclaroff, George Kollios and Vladimir Pavlovic , "Discovering Clusters in Motion Time-Series Data," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Vol. 1, pp 375-381, 2003.
    2003