skip to main content
research-article

Robust Spatiotemporal Matching of Electronic Slides to Presentation Videos

Published: 01 August 2011 Publication History
  • Get Citation Alerts
  • Abstract

    We describe a robust and efficient method for automatically matching and time-aligning electronic slides to videos of corresponding presentations. Matching electronic slides to videos provides new methods for indexing, searching, and browsing videos in distance-learning applications. However, robust automatic matching is challenging due to varied frame composition, slide distortion, camera movement, low-quality video capture, and arbitrary slides sequence. Our fully automatic approach combines image-based matching of slide to video frames with a temporal model for slide changes and camera events. To address these challenges, we begin by extracting scale-invariant feature-transformation (SIFT) keypoints from both slides and video frames, and matching them subject to a consistent projective transformation (homography) by using random sample consensus (RANSAC). We use the initial set of matches to construct a background model and a binary classifier for separating video frames showing slides from those without. We then introduce a new matching scheme for exploiting less distinctive SIFT keypoints that enables us to tackle more difficult images. Finally, we improve upon the matching based on visual information by using estimated matching probabilities as part of a hidden Markov model (HMM) that integrates temporal information and detected camera operations. Detailed quantitative experiments characterize each part of our approach and demonstrate an average accuracy of over 95% in 13 presentation videos.

    Cited By

    View all
    • (2024)SwapVid: Integrating Video Viewing and Document Exploration with Direct ManipulationProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642515(1-13)Online publication date: 11-May-2024
    • (2017)Behavior Discovery and Alignment of Articulated Object Classes from Unstructured VideoInternational Journal of Computer Vision10.1007/s11263-016-0939-9121:2(303-325)Online publication date: 1-Jan-2017
    • (2015)Structuring Lecture Videos by Automatic Projection Screen Localization and AnalysisIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2014.236113337:6(1233-1246)Online publication date: 1-Jun-2015
    • Show More Cited By

    Index Terms

    1. Robust Spatiotemporal Matching of Electronic Slides to Presentation Videos
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Image Processing
      IEEE Transactions on Image Processing  Volume 20, Issue 8
      August 2011
      309 pages

      Publisher

      IEEE Press

      Publication History

      Published: 01 August 2011

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)SwapVid: Integrating Video Viewing and Document Exploration with Direct ManipulationProceedings of the CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642515(1-13)Online publication date: 11-May-2024
      • (2017)Behavior Discovery and Alignment of Articulated Object Classes from Unstructured VideoInternational Journal of Computer Vision10.1007/s11263-016-0939-9121:2(303-325)Online publication date: 1-Jan-2017
      • (2015)Structuring Lecture Videos by Automatic Projection Screen Localization and AnalysisIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2014.236113337:6(1233-1246)Online publication date: 1-Jun-2015
      • (2014)Multi-modal Language Models for Lecture Video RetrievalProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2654964(1081-1084)Online publication date: 3-Nov-2014
      • (2012)Client-side backprojection of presentation slides into educational videoProceedings of the 20th ACM international conference on Multimedia10.1145/2393347.2396368(1005-1008)Online publication date: 29-Oct-2012

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media

      -