تشخیص نواحی مزاحم بصری در تصاویر به وسیله روش نمایش تنک دومرحله‌ای و وزن‌دار نمونه آزمون

الموضوعات : electrical and computer engineering

1 - دانشگاه سمنان
2 - دانشگاه سمنان

تاريخ الإرسال : 18 الثلاثاء , جمادى الثانية, 1439 تاريخ التأكيد : 10 الإثنين , ذو القعدة, 1439 تاريخ الإصدار : 25 السبت , جمادى الثانية, 1440

الکلمات المفتاحية: سیستم بینایی انسانتوجه بصرینواحی برجستهمزاحمت بصریسیستم کدگذاری و بازنمایی تنک,

ملخص المقالة :

مخاطب یک تصویر مایل است که در کوتاه‌ترین زمان، پیام اصلی تصویر را دریافت کند. از این رو سیستم بینایی انسان توجه بصری را ناخودآگاه به سمت نواحی برجسته، با فرض وجود اطلاعات مفید در آنها هدایت می‌کند. عملاً این فرض همواره صادق نبوده و در مواردی، نواحی برجسته صرفاً موجب مزاحمت بصری می‌گردند. از این رو در کاربردهای مختلف نیاز به ساز و کاری جهت تشخیص این نواحی می‌باشد تا با حذف این نواحی، حواس مخاطب از سوژه اصلی تصویر پرت نشود. همچنین نادیده‌گرفتن این نواحی، کمک شایانی است به روش‌هایی که بر پایه تشخیص نواحی برجسته و مهم عمل می‌کنند. بدین منظور در این مقاله، بر اساس روش‌های منطبق بر چالش عدم توازن دسته‌ها، هر قطعه از تصاویر آموزشی با توجه به ماسک آنها به 9 دسته افراز می‌شود که شماره هر دسته متناسب با شدت مزاحمت است. سپس ویژگی‌های مبتنی بر قطعه استخراج و دسته هر قطعه بر اساس روش نمایش تنک دومرحله‌ای و وزن‌دار نمونه آزمون که بر مبنای سیستم کدگذاری و بازنمایی تنک است، تعیین می‌شود. به منظور ارزیابی دقیق روش پیشنهادی و مقایسه آن با سایر روش‌ها، 4 معیار ارزیابی با رویکردهای مختلف معرفی و پیشنهاد می‌شود. با ارزیابی و سنجش نتایج نشان داده می‌شود که روش پیشنهادی علی‌رغم زمان‌بر بودن، نسبت به کارهای پیشین دارای دقت بیشتری است.

المصادر:

[1] N. Senthilkumaran and J. Thimmiaraja, "Histogram equalization for image enhancement using MRI brain images," inProc. World Congress on Computing and Communication Technologies WCCCT'14, pp. 80-83, Trichirappalli, India, 27 Feb.- 1 Mar. 2014.
[2] V. A. Mateescu and I. V. Bajic, "Visual attention retargeting," IEEE MultiMedia, vol. 23, no. 1, pp. 82-91, Jan. 2016.
[3] T. V. Nguyen, et al., "Image re-attentionizing," IEEE Trans. on Multimedia, vol. 15, no. 8, pp. 1910-1919, Dec. 2013.
[4] R. Mechrez, E. Shechtman, and L. Zelnik-Manor, "Saliency driven image manipulation," in Proc. Conf. on Applications of Computer Vision, WACV'18, pp. 1368-1376, Lake Tahoe, NV, USA, 12-15 Mar. 2018.
[5] A. Borji and L. Itti, "State-of-the-art in visual attention modeling," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 35, no. 1, pp. 185-207, Jan. 2013.
[6] F. Sabouri and F. Yaghmaee, "Visual distractors detecting in images using TPTSSR," in Proc. 10th Iranian Conf. on Machine Vision and Image Processing, MVIP'17, pp. 87-92, Isfahan, Iran, 22-23 Nov. 2017.
[7] Y. Xu, D. Zhang, J. Yang, and J. Y. Yang, "A two-phase test sample sparse representation method for use with face recognition," IEEE Trans. on Circuits and Systems for Video Technology, vol. 21, no. 9, pp. 1255-1262, Sept. 2011.
[8] L. Itti, C. Koch, and E. Niebur, "A model of saliency-based visual attention for rapid scene analysis," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 20, no. 11, pp. 1254-1259, Nov. 1998.
[9] A. Oliva and A. Torralba, "Modeling the shape of the scene: a holistic representation of the spatial envelope," International J. of Computer Vision, vol. 42, no. 3, pp. 145-175, May-Jun. 2001.
[10] S. Goferman, L. Zelnik-Manor, and A. Tal, "Context-aware saliency detection," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 34, no. 10, pp. 1915-1926, Oct. 2012.
[11] A. Borji and L. Itti, "Exploiting local and global patch rarities for saliency detection," in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, CVPR'12, pp. 478-485, Providence, RI, USA, 16-21 Jun. 2012.
[12] R. Margolin, A. Tal, and L. Zelnik-Manor, "What makes a patch distinct?," in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1139-1146, Portland, OR, USA, 23-28 Jun. 2013.
[13] Q. Fan and C. Qi, "Saliency detection based on global and local shortterm sparse representation," Neurocomputing, vol. 175, no. ???, pp. 81-89, Oct. 2016.
[14] N. Dhavale and L. Itti, "Saliency-based multifoveated MPEG compression," in Proc. 7th Int. Symp. on Signal Processing and Its Applications, vol. 1, pp. 229-232, Paris, France, 4-4 Jul. 2003.
[15] C. Guo and L. Zhang, "A novel multiresolution spatiotemporal saliency detection model and its applications in image and video compression," IEEE Trans. on Image Processing, vol. 19, no. 1, pp. 185-198, Jan. 2010.
[16] G. Evangelopoulos, K. Rapantzikos, A. Potamianos, P. Maragos, A. Zlatintsi, and Y. Avrithis, "Movie summarization based on audiovisual saliency detection," in Proc. 15th IEEE Int. Conf. on Image Processing, ICIP'08, pp. 2528-2531, San Diego, CA, USA, 12-15 Oct. 2008.
[17] D. Simakov, Y. Caspi, E. Shechtman, and M. Irani, "Summarizing visual data using bidirectional similarity," Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, CVPR'08, 8 pp., Anchorage, AK, USA, 23-28 Jun. 2008.
[18] L. Zhang, Y. Shen, and H. Li, "VSI: a visual saliency-induced index for perceptual image quality assessment," IEEE Trans. on Image Processing, vol. 23, no. 10, pp. 4270-4281, Oct. 2014.
[19] R. Achanta and S. Susstrunk, "Saliency detection for content-aware image resizing," in Proc. 16th IEEE Int. Conf. on Image Processing, ICIP'09, pp. 1005-1008, Cairo, Egypt, 7-10 Nov. 2009.
[20] Y. Fang, et al., "Saliency-based stereoscopic image retargeting," Information Sciences, vol. 372, pp. 347-358, 1 Dec. 2016.
[21] O. Fried, E. Shechtman, D. B. Goldman, and A. Finkelstein, "Finding distractors in images," in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, CVPR'15, pp. 1703-1712, Boston, MA, USA, 7-12 Jun. 2015.
[22] R. Tibshirani, "Regression shrinkage and selection via the lasso," J. of the Royal Statistical Society, Series B (Methodological), vol. 58, no. 1, pp. 267-288, Jan. 1996.
[23] J. Wright, A. Y. Yang, A. Ganesh, S. S. Sastry, and Y. Ma, "Robust face recognition via sparse representation," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 31, no. 2, pp. 210-227, Feb. 2009.
[24] Z. Zhang, Y. Xu, J. Yang, X. Li, and D. Zhang, "A survey of sparse representation: algorithms and applications," IEEE Access, vol. 3, pp. 490-530, May 2015.
[25] K. Yu, T. Zhang, and Y. Gong, "Nonlinear learning using local coordinate coding," in Proc. of the 22nd International Conference on Neural Information Processing Systems, NIPS’09, pp. 2223-2231, Vancouver, BC, Canada, 7-10 Dec.. 2009.
[26] Z. Liu, J. Pu, M. Xu, and Y. Qiu, "Face recognition via weighted two phase test sample sparse representation," Neural Processing Letters, vol. 41, no. 1, pp. 43-53, Jan. 2015.
[27] H. Zou and T. Hastie, "Regularization and variable selection via the elastic net," J. of the Royal Statistical Society, Series B (Statistical Methodology), vol. 67, no. 2, pp. 301-320, Aug. 2005.
[28] T. Judd, K. Ehinger, F. Durand, and A. Torralba, "Learning to predict where humans look," in Proc. IEEE 12th Int. Conf. on Computer Vision, pp. 2106-2113, Kyoto, Japan, 29 Sept.- 2 Oct. 2009.
[29] E. Rosten, R. Porter, and T. Drummond, "Faster and better: a machine learning approach to corner detection," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 32, no. 1, pp. 105-119, Jan. 2010.
[30] C. L. Zitnick and P. Dollar, "Edge boxes: locating object proposals from edges," in Proc. European Conf. on Computer Vision, ECCV'14, pp. 391-405, Zurich, Switzerland, 6-12 Sept. 2014.
[31] M. M. Cheng, N. J. Mitra, X. Huang, P. H. Torr, and S. M. Hu, "Global contrast based salient region detection," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 37, no. 3, pp. 569-582, Mar. 2015.
[32] M. Wang, J. Konrad, P. Ishwar, K. Jing, and H. Rowley, "Image saliency: from intrinsic to extrinsic context," in Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition, CVPR'11, pp. 417-424, Colorado Springs, CO, USA, 20-25 Jun. 2011.
[33] K. K. Maninis, J. Pont-Tuset, P. Arbelaez, and L. Van Gool, "Convolutional oriented boundaries: from image segmentation to high-level tasks," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 40, no. 4, pp. 819 - 833, Apr. 2018..
[34] Y. Sun, A. K. Wong, and M. S. Kamel, "Classification of imbalanced data: a review," International J. of Pattern Recognition and Artificial Intelligence, vol. 23, no. 4, pp. 687-719, Nov. 2009.
[35] X. Li, et al., "DeepSaliency: multi-task deep neural network model for salient object detection," IEEE Trans. on Image Processing, vol. 25, no. 8, pp. 3919-3930, Oct. 2016.

شارک

عنوان URL للمقالة

تشخیص نواحی مزاحم بصری در تصاویر به وسیله روش نمایش تنک دومرحله‌ای و وزن‌دار نمونه آزمون

رایمگ

الروابط

المراكز ذات الصلة

دعامة

الصفحات الرسمية