|
MMAI_Proposal
MMAI final project proposal.
Title: “People Search beyond Text in Consumer Videos”Team members
IntroductionThis project is an extension of a previous work by Chien-Hsing Chiang (江建興) [1]. The previous work proposed a name propagation algorithm over a face similarity graph to boost people search in videos, and the results outperformed traditional text-based approach found in current online services such as Youtube. System framework: Name propagation based on cluster similarities: In this project, we aim to extend from celebrity videos to consumer (family) videos, in which every individual is considered a “celebrity”. In home videos, the frame quality is much poorer, and the facial variations in poses, expressions, and lighting are even farther from ideal. Our previous work did not make use of temporal information in video processing. To improve the performance, we plan to introduce automatic Clothing segmentation [2] and Face tracking techniques in the “face detection” step, so that after “local clustering”, we may be able to merge or split the clusters for the later steps. DatasetWe plan to use a small number (around 15) of HD family videos collected from Youtube, then manually annotate the names at the video level for initial tagging, and at the face level for performance evaluation. We expect the videos to cover faces for a large-enough percentage of time, and several people will appear equally long in the videos, instead of being dominated by one or two. Expected ResultThe expected result is a Web-based people search system similar to the demo system of our previous work. We'll show respectively the system with 10 fully-labeled videos and the system with 5 additional unlabeled videos. References
Any comment is appreciated! :) |