شرح‌نگاری خودکار تصویر با روش چرخش بلاک اصلی

محورهای موضوعی : مهندسی برق و کامپیوتر

حازم الرکابی ¹ , نسرین صوفی ² , هادی صدوقی یزدی ³ , امیرحسین طاهری‌نیا ⁴

1 - دانشگاه فردوسي مشهد
2 - دانشگاه فردوسي مشهد
3 - دانشگاه فردوسی مشهد
4 - دانشگاه فردوسي مشهد

تاریخ دریافت : 1397/02/08 تاریخ پذیرش : 1397/08/22 تاریخ انتشار : 1398/05/02

کلید واژه: تجزیه نامنفی ماتریسچرخش بلاک اصلیk- نزدیک‌ترین همسایهشرح‌نگاری تصاویر,

چکیده مقاله :

سیستم‌های شرح‌نگاری خودکار تصاویر، وظیفه توصیف محتوای تصاویر به وسیله تخصیص برچسب به آنها را بر عهده دارند. هدف از انجام این تحقیق بهبود نتایج دقت و سرعت یک سیستم شرح‌نگار تصاویر است. اخیراً با توجه به رشد روزافزون تصاویر، فرایند شرح‌نگاری بر روی پایه‌های تصاویر به جای خودشان اجرا می‌گردد. یکی از این روش‌های جدید، پیاده‌سازی الگوریتم تجزیه نامنفی ماتریس (NMF) بر روی ویژگی‌های به دست آمده از تصاویر است. در روش پیشنهادی برای افزایش سرعت و کارایی بهتر سیستم شرح‌نگاری، برای اولین بار از روشی به نام چرخش بلاک اصلی برای حل NMF در شرح‌نگاری استفاده شده است. این روش با توانایی افزودن برخط کلاس جدیدی از داده‌ها به دانش خود و یادگیری دانش به صورت فشرده و علاوه بر آن، توانایی آموزش بر اساس داده‌های دریافتی بدون نیاز به پردازش مجدد توانسته از روش‌های پیشین ارائه‌شده برای حل NMF عملکرد بهتری را نشان دهد. در مرحله آموزش با روش چرخش بلاک اصلی ماتریس ضرایب و پایه تصاویر ورودی به دست می‌آیند. سپس در مرحله آزمون برای تصویر ورودی، توسط ویژگی‌های استخراج‌شده از تصویر و ضرایب به دست آمده از مرحله آموزش، ضریب تعلق تصویر آزمون به هر یک از کلاس‌های تصاویر آموزش محاسبه می‌گردد. سپس این ضریب در هنگام جستجو در میان تصاویر آموزش برای تخصیص برچسب به تصویر آزمون، دقت کار را افزایش می‌دهد. این جستجو توسط روش KNN بر روی پایه‌های تصاویر صورت می‌گیرد. برای آزمایش روش پیشنهادی از دو پایگاه داده K5Corel و داده‌های واقعی حیوانات (برگرفته از px 500) استفاده شده و نهایتاً با روش‌های موجود مقایسه شده که در پایگاه داده K5Corel به میزان دقت 20/50 و روی داده‌های واقعی به 89/62 رسیدیم که به‌طور قابل ملاحظه‌ای دقت افزایش یافته است.

چکیده انگلیسی:

Image annotation systems are responsible for describing the content of the images by assigning tags to them. The purpose of this research is to improve the accuracy and speed of image annotation system. Recently, with the growing of images, the image annotation process is based on the basics of images instead of themselves. One of these new methods is the implementation of the non-negative matrix algorithm (NMF) on the features of the images. In the proposed method, for the first time, in order to increase the speed and efficiency of the7 system, we use a method that called the block principal pivoting for the NMF solution. This method has ability to add online new class of data to its knowledge and knowledge learning in a compact form. Moreover, the ability to train based on received data without having to be re-processed. In the training phase, the matrix of the coefficients and the base of the input images are obtained using the Block Principal Pivoting method. Then, at the test phase for the input image, by extracted features of the image and the coefficients obtained from the training phase, the coefficient of belonging to the test image is calculated to each of the classes of training images. Then, this coefficient while searching among the teaching images for assigning the label to test image increases the accuracy of the algorithm. This search is done by the KNN method on the base of the images. To test the proposed method, we used two databases Corel5K and real animal data (derived from 500px) and, finally, compared with existing methods, which we found in the Corel5K database at a precision of 50.20 and real data was 62.89. Precision have been increased considerably.

منابع و مأخذ:

اشتراک گذاری

آدرس مقاله

شرح‌نگاری خودکار تصویر با روش چرخش بلاک اصلی

رایمگ

پیوندهای سایت

مراکز مرتبط

پشتیبانی

صفحات رسمی