BU CLA CS 835: Seminar in Image and Video Computing

Fall 1995

Seminar Readings


Readings:

Recent journal and conference papers (see schedule and references below).

Supplemental texts:

  • Machine Vision, by Jain, Kasturi, and Schunk. McGraw Hill, 1995.
  • Fundamentals of Digital Image Processing, by A. K. Jain. Prentice-Hall, 1989.
  • Pattern Classification and Scene Analysis, by Duda and Hart, John Wiley, 1973.

    All are available on reserve at the Science and Engineering Library

  • Seminar Outline:

    Regular readings enclosed in [], optional supplemental readings in {}.

    Sept
    M 11 Introduction to image and video databases [1,2] M 18 Color histograms and color indexing [3,4] {5} M 25 Texture representations & texture-based search [6,7] {8} Oct
    M 2 Eigen-representations for grayscale imagery [9,10] {11} T 10 Shape-based indexing methods [12,13] {14} M 16 Nonrigid shape, shape detection [15,16] {17} M 23 Segmentation [18,19] {20} M 30 Multiple models, combinating similarity measures [21,22] {23}
    Nov
    M 6 Video motion estimation and representation [24,25] {26} M 13 Camera motion and scene cut detection [27,28] {29} M 20 Salient stills, image mosaics [30,31] {32} M 27 Video interface, annotation [33,34,35]
    Dec
    M 4 Human expression, gait, and gesture [36,37,38] {39,40,41} M 11 final project in-class presentations

    References

    1
    M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang, D. Dom, M. Gorkani, J. Hafner, D. Lee, D. Petkovic, D. Steele, and P. Yanker. "Query by image and video content: The QBIC system." IEEE Computer, pages 23--30, September 1995.

    2
    A. Pentland, R. Picard, and S. Sclaroff. "Photobook: Tools for content-based manipulation of image databases." International Journal of Computer Vision, to appear fall 1995.

    3
    M. Swain and D. Ballard. "Color indexing." International Journal of Computer Vision, 7(1):11--32, 1991.

    4
    J. Hafner, Harpreet Sawney, W. Equitz, M. Flickner, and W. Niblack. "Efficient color histogram indexing for quadratic form distance functions." IEEE Trans. Pattern Analysis and Machine Intelligence, 1(7):729--736, 1995.

    5
    S. Shafer and T. Kanade. Encyclopedia of Artificial Intelligence, volume I, chapter Color Vision, pages 124--131. John Wiley and Sons, 1991.

    6
    H. Tamura, S. Mori, and T. Yamawaki. "Textural features corresponding to visual perception." IEEE Transactions on Systems, Man, and Cybernetics, SMC-8(6):460--472, 1978.

    7
    R. Picard and M. Gorkani. "Finding perceptually dominant orientations in natural textures." Spatial Vision, 8(2):221--253, 1994.

    8
    W. Freeman and E. H. Adelson. "The Design and Use of Steerable Filters." IEEE Trans. Pattern Analysis and Machine Intelligence, 13(9):891--906, September 1991.

    9
    M. Turk and A. Pentland. "Eigenfaces for recognition." Journal of Cognitive Neuroscience, 3(1):71--86, 1991.

    10
    H. Murase and S. Nayar. "Visual Learning and Recognition of 3-D Objects from Appearance." International Journal of Computer Vision, 14(1):5--24, January 1995.

    11
    A. Pentland, B. Moghaddam, T. Starner, O. Oliyide, and M. Turk, "View-Based and Modular Eigenspaces for Face Recognition", in Proc. CVPR, 1994.Geometric Invariance in Computer Vision, MIT Press, 1992.

    12
    G. Taubin and D. B. Cooper, "Object Recognition Based on Moment (or Algebraic) Invariants", chapter in Geometric Invariance in Computer Vision, MIT Press, 1992.

    13
    W. Grosky and R. Mehrotra, "Index-Based Object Recognition in Pictorial Data Management", Computer Vision, Graphics, and Image Processing, 52(3):416-436, 1990.

    14
    R. Mehrotra and J. Gary, "Similar-Shape Retrieval in Shape Data Management", IEEE Computer, 57--62, September 1995.

    15
    S. Sclaroff, "Deformable Prototypes for Encoding Shape Categories in Image Databases", Boston University CS Dept TR95-017, 1995.

    16
    A. Evans, N. Thacker, and J. Mayhew, "The Use of Geometric Histograms for Model-based Object Recognition", Proc. British Machine Vision Conference, pp. 429--438, 1993.

    17
    D. Ballard and C. Brown, "The Hough Method for Curve Detection," section in Computer Vision, pp. 123--131, Prentice Hall, 1982.

    18
    S. Zhu, T. Lee, and A. Yuille, "Region Competition: Unifying Snakes, Region Growing, Energy/Bayes/MDL for Multiband Image Segmentation", in Proc. of CVPR, pp. 416--423.

    19
    D. Panjwani and G. Healey, "Markov Random Field Models for Unsupervised Segmentation of Textured Color Images", IEEE Trans. on Pattern Analysis and Machine Intelligence, 17(10):939--954, 1995.

    20
    R. Haralick and L. Shapiro, "Image Segmentation," chapter in Computer and Robot Vision, volume I, pp. 509--553, Addison Wesley, 1992.

    21
    T. Minka and R. Picard, "Interactive Learning Using a Society of Models," MIT Media Lab Perceptual Computing Group TR-349, 1995. Submitted to Pattern Recognition, special issue on pattern recognition in image databases.

    22
    S. Santini and R. Jain, "Similarity Matching," submitted to EEEE Trans. on Pattern Recognition and Machine Intelligence, special issue on image and video databases, 1995.

    23
    J.A. Benediktsson and P.H. Swain, "Consensus theoretic classification methods", in IEEE Trans. on Systems, Man and Cybernetics 22(4):688--704, 1992.

    24
    J.L. Barron, D.J. Fleet, and S.S. Beauchemin, "Performance of Optical Flow Techniques," in International Journal of Computer Vision, 12(1):43-77, 1994.

    25
    J.Y.A. Wang and E.H. Adelson, "Representing Moving Images with Layers," in IEEE Trans. on Image Processing 3(5):625-638, 1994.

    26
    S.W. Smoliar and H.J. Zhang, "Content-Based Video Indexing and Retrieval," in IEEE Multimedia 1(2):62-72, 1994.

    27
    A. Hampapur R. Jain and T. Weymouth, "Production Model Based Digital Video Segmentation," in Multimedia Tools and Applications, 1:9-46, 1995.

    28
    H.J. Zhang , A. Kankanhalli, and S.W. Smoliar, "Automatic Partitioning of Full-Motion Video," in Multimedia Systems, 1(1):10-28, 1993.

    29
    Y. Tonomura, A. Akutsu, Y. Taniguchi, and G. Suzuki, "Structured Video Computing," in IEEE Multimedia 1(3):34-43, 1994.

    30
    S. E. Chen, "QuickTime VR -- An Image-Based Approach to Virtual Environment Navigation," in Proc. ACM SIGGRAPH 1995 , pp. 29--38.

    31
    M. Irani, P. Anandan, and S. Hsu, "Mosaic Based Represenations of Video Sequences and Their Applications," in Proc. International Conference on Computer Vision 1995, pp. 605--611.

    32
    D. J. Heeger and A. D. Jepson, "Subspace Methods for Recovering Rigid Motion I: Algorithm and Implementation," in International Journal of Computer Vision, 7(2):95-117, 1992.

    33
    Marc Davis, "Media Streams," in Telektronikk, 89(4):59-71, 1993.

    34
    L. A. Rowe, J. S. Boreczky, and C. A. Eads, "Indexes for User Access to Large Video Databases," in Proc. Storage and Retrieval for Image and Video Databses II, SPIE, pp. 1-12, February 1994.

    35
    G. Ahanger, D. Benson, and T. D. C. Little, "Video Query Formulation," in Proc. Storage and Retrieval for Image and Video Databses II, SPIE, 2420:280-291, 1995.

    36
    M. Black and Y. Yacoob, "Tracking and Recognizing Rigid and Non-rigid Facial Motions using Local Parametric Models of Image Motion," in Proc. International Conf. on Computer Vision, pp. 374-381, 1995.

    37
    D. Terzopoulos and K. Waters, "Analysis and Synthesis of Facial Image Sequences Using Physical and Anatomical Models," in IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(6):569-579, 1993.

    38
    S. Niyogi and E. H. Adelson, "Analyzing and Recognizing Walking Figures in XYT," in Proc. IEEE Conf. on Vision and Pattern Recognition, pp. 469-474, 1994.

    39
    R. Polana and R. Nelson, " Low Level Recognition of Human Motion," in Proc. IEEE Workshop on Nonrigid and Articulate Motion, pp. 77-82, 1995.

    40
    I. Essa, T. Darrell, and A. Pentland, "Tracking Facial Motion," in Proc. IEEE Workshop on Nonrigid and Articulate Motion, pp. 36-42, 1995.

    41
    K. Rohr, "Towards Model-based recognition of human movements in image sequences," in Computer Vision, Graphics and Image Processing, 59(1):94-115, 1994.


    Stan Sclaroff
    Created: May 22, 1995
    Last Modified: Nov 10, 1995