بهبود مانيفولد حالات و تصاوير مجازی با به‌کارگيری شبکه‌های عصبی دوسويه در بازشناسی چهره با يک تصوير از هر فرد

محورهای موضوعی : مهندسی برق و کامپیوتر

1 - دانشگاه صنعتی اميرکبير
2 - دانشگاه صنعتی امیرکبیر

تاریخ دریافت : 1394/09/07 تاریخ پذیرش : 1394/09/07 تاریخ انتشار : 1390/01/01

کلید واژه: بازشناسی چهره يک تصوير از هر فرد شبکه عصبی بازگشتی پويايی جاذب يادگيری مانيفولد تصاوير مجازی پالايش غير خطی,

چکیده مقاله :

در اين مقاله به‌منظور توسعه عملکرد مدل‌های شبکه عصبی در بازشناسی چهره با يک تصوير از هر فرد، يک ساختار شبکه عصبی دوسويه با الهام از نئوکورتکس مغز انسان ارائه شده است. در ساختار پيشنهادی همانند نئوکورتکس در ابتدا طی يک مرحله پردازش از پايين به بالا، يک تفسير زمخت از ورودی صورت می‌گيرد، سپس در مرحله بعد نتايج بازشناسی اوليه ضمن عبور از يک شبکه عصبی معکوس پالايش می‌شوند. از اين مدل جهت جداسازی غير خطی اطلاعات فرد از حالت و تخمين مانيفولدهای اطلاعات فرد و حالت استفاده شده است. به‌منظور افزايش تعداد نمونه‌های تعليم در شبکه طبقه‌بندی کننده با استفاده از مانيفولدهای تخمين زده شده، تصاوير مجازی چهره‌های نرمال موجود در پايگاه داده تست توليد شده است. با تعليم شبکه طبقه‌بندی کننده توسط تصاوير مجازی حاصل از تعليم شبکه دوسويه، درصد صحت بازشناسی 45/85٪ روی دادگان تست حاصل شده که در مقايسه با توليد تصاوير مجازی با استفاده از روش خوشه‌بندی بدون سرپرستی اطلاعات افراد و حالات دارای بهبود 82/1٪ می‌باشد.

چکیده انگلیسی:

In this article, for the purpose of improving neural network models applied in face recognition using single image per person, a bidirectional neural network inspired of neocortex functional model is presented. In the proposed model, recognition is not performed in a single stage, but via two bottom-up and top-down phases and the recognition results of first stage is used for model adaptation. We have applied this novel adapting model in combination with clustering person and pose information technique to separate person and pose information and to estimate corresponding manifolds. To increase the number of training samples in the classifier neural network, virtual views of frontal images in the test dataset are synthesized using estimated manifolds. Training classifier network via virtual images obtained from bidirectional network, gives an accuracy rate of 85.45% on the test dataset which shows 1.82% improvement in accuracy of face recognition compared to training classifier with virtual images obtained from clustering person and pose information network.

منابع و مأخذ:

[1] X. Zhang and Y. Gao, "Face recognition across pose: a review," Pattern Recognition, vol. 42, no. 11, pp. 2876-2896, Nov. 2009.
[2] T. Stonham, "Practical face recognition and verification with WISARD," Aspects of Face Processing, pp. 426-441, 1984.
[3] S. H. Lin, S. Y. Kung, and L. J. Lin, "Face recognition/detection by probabilistic decision - based neural network," IEEE Trans. Neural Network, vol. 8, no. 1, pp. 114-132, Jan. 1997.
[4] S. Lawrence, C. L. Giles, A. C. Tsoi, and A. D. Back, "Face recognition: a convolutional neural-network approach," IEEE Trans. Neural Network, vol. 8, no. 1, pp. 98-113, Jan.1997.
[5] Y. W. Wong, K. P. Seng, and L. M. Ang, "Radial basis function neural network with incremental learning for face recognition," IEEE Trans. Syst. Man Cybern. B, vol. 41, no. 4, pp. 940-949, Aug. 2011.
[6] F. Abdolali and S. Seyyedsalehi, "Face recognition from a single image per person using deep architecture neural networks," in Proc. 3rd Int. Conf. on Computer and Electrical Engineering, Chengdu, vol. 1, pp. 70-73, Nov. 2010.
[7] ن. داداشی، بازشناسی چهره با استفاده از يک تصوير از هر فرد، پايان‌نامه کارشناسی ارشد، دانشگاه صنعتی اميرکبير، دانشکده مهندسی پزشکی، دی 1387.
[8] ع. سيدصالحي، افزايش کارآيي بازشناخت الگوي شبکه‌هاي عصبي جلوسو از طريق توسعه روش‌هايي براي دوسويه‌کردن عملکرد آنها، گزارش طرح پژوهشي، دانشگاه صنعتي اميرکبير، دانشکده مهندسي پزشکي، 1383.
[9] A. Leila and S. Seyyedsalehi, "Modeling phones coarticulation effects in a neural network based speech recognition system," in Proc. INTERSPEECH-2004, vol. 1, pp. 1657-1660, Oct. 2004.
[10] I. Nejadgholi and S. Seyyedsalehi, "Nonlinear normailization of input patterns to speaker variability in speech recognition neural networks," Neural Computing & Applications, vol. 18, no. 1, pp. 45-55, Jan. 2009.
[11] M. M. Mesulam, "From sensation to cognition," Brain, vol. 121, no. 6, pp. 1013-1052, Jun. 1998.
[12] E. Koerner and G. Matsumoto, "Cortical architecture and self - referential control for brain - like computation, a new approach to understanding how the brain organizes computation," IEEE Eng. in Medicine and Biology Magazine, vol. 21, no. 5, pp. 121-133, Sep./Oct. 2002.
[13] E. Koerner, H. Tsujino, and T. Masutani, "A cortical - type modular neural network for hypothetical reasoning," Neural Networks, vol. 10, no. 5, pp. 791-814, Jul. 1997.
[14] E. Koerner and et al., "A model of computation in neocortical architecture," Neural Networks, vol. 12, no. 12, pp. 989-1005, Oct. 1999.
[15] J. Ghosen and Y. Bengio, "Bias learning, knowledge sharing," IEEE Trans. on Neural Networks, vol. 14, no. 4, pp. 748-765, Jul. 2003.
[16] J. Aghajanian and S. J. D. Prince, "Face pose estimation in uncontrolled environments," in Proc. British Machine Vision Conf., 2009.

اشتراک گذاری

آدرس مقاله

بهبود مانيفولد حالات و تصاوير مجازی با به‌کارگيری شبکه‌های عصبی دوسويه در بازشناسی چهره با يک تصوير از هر فرد

رایمگ

پیوندهای سایت

مراکز مرتبط

پشتیبانی

صفحات رسمی