Semantic analysis for automatic event recognition and segmentation of wedding ceremony videos

Wen-Huang Cheng*, Yung Yu Chuang, Yin Tzu Lin, Chi Chang Hsieh, Shao Yen Fang, Bing Yu Chen, Ja Ling Wu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Scopus citations


Wedding is one of the most important ceremonies in our lives. It symbolizes the birth and creation of a new family. In this paper, we present a system for automatically segmenting a wedding ceremony video Into a sequence of recognizable wedding events, e.g., the couple's wedding kiss. Our goal is to develop an automatic tool that helps users to efficiently organize, search, and retrieve his/her treasured wedding memories. Furthermore, the obtained event descriptions could benefit and complement the current research in semantic video understanding. Based on the knowledge of wedding customs, a set of audiovisual features, relating to the wedding contexts of speech/music types, applause activities, picture-taking activities, and leading roles, are exploited to build statistical models for each wedding event. Thirteen wedding events are then recognized by a hidden Markov model, which takes into account both the fitness of observed features and the temporal rationality of event ordering to improve the segmentation accuracy. We conducted experiments on a collection of wedding videos and the promising results demonstrate the effectiveness of our approach. Comparisons with conditional random fields show that the proposed approach is more effective in this application domain.

Original languageEnglish
Article number4633636
Pages (from-to)1639-1650
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
Issue number11
StatePublished - 1 Nov 2008


  • Event detection
  • Home videos
  • Semantic content analysis
  • Video segmentation
  • Wedding ceremonies


Dive into the research topics of 'Semantic analysis for automatic event recognition and segmentation of wedding ceremony videos'. Together they form a unique fingerprint.

Cite this