WU Wenchao, WANG Shilin, KURUOGLU Ercan Engin, MA Xiaoli, LI Shenghong, LI Jianhua, Lionel M. Ni. Optimization of Lip Contour Estimation[J]. Chinese Journal of Electronics, 2014, 23(2): 341-347.
Citation: WU Wenchao, WANG Shilin, KURUOGLU Ercan Engin, MA Xiaoli, LI Shenghong, LI Jianhua, Lionel M. Ni. Optimization of Lip Contour Estimation[J]. Chinese Journal of Electronics, 2014, 23(2): 341-347.

Optimization of Lip Contour Estimation

Funds:  This work is supported by the National Natural Science Foundation of China (No.61271319, No.60702043, No.61071152).
  • Received Date: 2013-08-01
  • Rev Recd Date: 2013-09-01
  • Publish Date: 2014-04-05
  • Developing algorithms based on lip contour estimation is a distinctive trend in lip segmentation which is the first step of visual speech recognition. In order to establish an optimized estimation of lip contour that is complex enough to describe the principal features of the lip but at the same time simple enough to be implemented, the selection of lip model, estimator as well as parameters for features of lips, including the horizontal length of snake of feature points and horizontal distance between these feature points will be optimized. Experimental result demonstrates that the optimized estimation method of lip contour provides more accurate and more stable results of lip segmentation.
  • loading
  • S.L. Wang, W.H. Lau, S.H. Leung, "Automatic lip contour extraction from color images", Pattern Recognition, Vol.37, No.12, pp.2375-2387, 2004.
    N.P. Erber, "Interaction of audition and vision in the recognition of oral speech stimuli", J. Speech Hear. Res. Vol.12, No.2, pp.423-425, 1969.
    M.T. Chan, "HMM-based audio-visual speech recognition integrating geometric and appearance-based visual features", IEEE Fourth Workshop on Multimedia Signal Processing, Cannes, France, pp.9-14, 2001.
    WU Wenchao, KURUOGLU Ercan Engin, WANG Shilin et al. "Automatic lip contour extraction using both pixel-based and parametric models", Chinese Journal of Electronics, Vol.22, No.1, pp.76-82, 2013.
    M.N. Kaynak, Q. Zhi, A.D. Cheok, K. Sengupta, K.C. Chung, "Audiovisual modeling for bimodal speech recognition", Proceedings of IEEE International Conference on Systems, Man, and Cybernetics, Tucson, AZ, USA, Vol.1, pp.181-186, 2001.
    Y. Zhang, S. Levinson, T. Huang, "Speaker independent audiovisual speech recognition", Proceedings of IEEE International Conference on Multimedia and Expo, NewYork, USA, Vol.2, pp.1073-1076, 2000.
    S.H. Leung, S.L. Wang, W.H. Lau, "Lip image segmentation using fuzzy clustering incorporating an elliptic shape function", IEEE Transactions on Image Processing, Vol.13, No.1, pp.51-62, 2004.
    M. Gordan, C. Kotropoulos, A. Georgakis, I. Pitas, "A new fuzzy c-means based segmentation strategy. applications to lip region identification", Proceedings of the 2002 IEEE-TTTC International Conference on Automation, Quality and Testing, Robotics, Romania, 2002.
    S.L. Wang, W.H. Lau, S.H. Leung, A.W.C. Liew, "Lip segmentation with the presence of beards", IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP' 04), Vol.3, pp.529-532, 2004.
    M. Sadeghi, J. Kittler, K. Messer, "Real time segmentation of lip pixels for lip tracker initialization", Lecture Notes in Computer Science, Vol.2124, pp.317-324, 2001.
    M. Sadeghi, J. Kittler, K. Messer, "Modelling and segmentation of lip area in face images", IEE Proceedings -Vision, Image and Signal Processing, Vol.149, pp.179-184, 2002.
    B. Goswami, W.J. Christmas, J. Kittler, "Statistical estimators for use in automatic lip segmentation", Proceedings of the 3rd European Conference on Visual Media Production (CVMP 2006), pp.79-86, 2006.
    I. Mpiperis, S. Malassiotis, M.G. Strintzis, "Expression compensation for face recognition using a polar geodesic representation", Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06), pp.224-231, 2006.
    I. Shdaifat, R. Grigat, D. Langmann, "Active shape lip modeling", Proceedings of the 2003 International Conference on Image Processing, Vol.3, pp.II-875-II-878, 2003.
    P.C. Yuen, J.H. Lai, Q.Y. Huang, "Mouth state estimation in mobile computing environment", Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), pp.705-710, 2004.
    K.S. Jang, S. Han, I. Lee, Y.W. Woo, "Lip localization based on active shape model and gaussian mixture model", Lecture Notes in Computer Science, Vol.4319, pp.1049-1058, 2006.
    M. Jiang, Z.H. Gan, G.M. He, W.Y. Gao, "Combining particle filter and active shape models for lip tracking", Proceedings of the Sixth World Congress on Intelligent Control and Automation (WCICA 2006), Vol.2, pp.9897-9901, 2006.
    Z. Hammal, N. Eveno, A. Caplier, P.Y. Coulon, "Parametric models for facial features segmentation", Signal Processing, Vol.86, pp.399-413, 2006.
    H. Seyedarabi, W.S. Lee, A. Aghagolzadeh, "Automatic lip tracking and action units classification using two-step active contours and probabilistic neural networks", Proceedings of the Canadian Conference on Electrical and Computer Engineering (CCECE '06), pp.2021-2024, 2006.
    B. Beaumesnil, F. Luthon, M. Chaumont, "Liptracking and mpeg4 animation with feedback control", Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2006), pp.II-677-II-680, 2006.
    L. Zhang, "Estimation of the mouth features using deformable templates", in Proceedings of International Conference on Image Processing, Santa Barbara, Vol.3, pp.328-331, 1997.
    M.U. Ramos-Sanchez, J. Matas, J. Kittler, "Statistical chromaticity-based lip tracking with b-splines", in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, Munich, pp.IV-2973-IV-2976, 1997.
    A.M. Martinez and R. Benavente, "The AR face database", CVC Technical Report #24, 1998.
    Steven M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory, Prentice Hall PTR, pp.219, 2005.
    M.O. Berger, R. Mohr, "Towards autonomy in active contour models", 10th International Conference on Pattern Recognition (ICPR'90), Atlantic City, pp.847-851, 1990.
    P. Delmas, N. Eveno, M. Lievin, "Towards robust lip tracking", 16th International Conference on Pattern Recognition (ICPR'02), Quebec, pp.528-531, 2002.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (395) PDF downloads(1646) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return