Coupled grouping and matching for sign and gesture recognition
摘要:
Matching an image sequence to a model is a core problem in gesture or sign recognition. In this paper, we consider such a matching problem, without requiring a perfect segmentation of the scene. Instead of requiring that low- and mid-level processes produce near-perfect segmentation, we take into account that such processes can only produce uncertain information and use an intermediate grouping module to generate multiple candidates. From the set of low-level image primitives, such as constant color region patches found in each image, a ranked set of salient, overlapping, groups of these primitives are formed, based on low-level cues such as region shape, proximity, or color. These groups corresponds to underlying object parts of interest, such as the hands. The sequence of these frame-wise group hypotheses are then matched to a model by casting it into a minimization problem. We show the coupling of these hypotheses with both non-statistical matching (match to sample-based modeling of signs) and statistical matching (match to HMM models) are possible. Our algorithm not only produces a matching score, but also selects the best group in each image frame, i.e. recognition and final segmentation of the scene are coupled. In addition, there is no need for tracking of features across sequences, which is known to be a hard task. We demonstrate our method using data from sign language recognition and gesture recognition, we compare our results with the ground truth hand groups, and achieved less than 5% performance loss for both two models. We also tested our algorithm on a sports video dataset that has moving background.
展开
关键词:
Practical, Theoretical or Mathematical/ gesture recognition hidden Markov models image colour analysis image matching image segmentation image sequences minimisation/ coupled grouping coupled matching gesture recognition image sequence matching near-perfect segmentation constant color region patches minimization problem nonstatistical matching sample-based modeling hidden Markov model sign language recognition sports video dataset/ B6135E Image recognition B0240J Markov processes B0260 Optimisation techniques C5260B Computer vision and image processing techniques C1250M Image recognition C1140J Markov processes C1180 Optimisation techniques
DOI:
10.1016/j.cviu.2008.09.005
被引量:
年份:
2009




























通过文献互助平台发起求助,成功后即可免费获取论文全文。
相似文献
参考文献
引证文献
辅助模式
引用
文献可以批量引用啦~
欢迎点我试用!