کاربرد تحلیل گفتمان در خلاصه¬سازی خودکار متون علمی

محورهای موضوعی :

پگاه تاجر ¹ , عبدالرسول جوکار ² , سید مصطفی فخر احمد ³ , علیرضا خرمایی ⁴ , هاجر ستوده ⁵

1 - دانشجوی دکتری علم اطلاعات و دانش‌شناسی، دانشگاه شیراز
2 - دانشگاه شیراز
3 -
4 - دانشیار بخش زبان‌های خارجی و زبان‌شناسی، دانشگاه شیراز
5 - دانشیار بخش علم اطلاعات و دانش‌شناسی، دانشگاه شیراز

تاریخ دریافت : 1401/10/15 تاریخ پذیرش : 1401/10/15 تاریخ انتشار : 1401/10/15

کلید واژه: تحلیل گفتمان, ژانر علمی, خلاصه‌سازی خودکار متن, خلاصه‌سازی گفتمان‌مدار, خلاصه‌سازی استنادی, ناحیه‌بندی استدلالی,

چکیده مقاله :

هدف این پژوهش آشکارسازی کاربرد ویژه تحلیل گفتمان در خلاصه‌سازی خودکار متون علمی است و به این منظور از رویکرد مرور و تحلیل نوشتارها استفاده شده است. متن یک واحد زبانشناسی پیچیده است و ساختار گفتمانی و تئوری‌های سازماندهی متن را می‌توان شاخصی برای تفسیر متن و انتخاب جمله مناسب به منظور تولید خودکار خلاصه‌های خواناتر و منسجم‌تر در نظر گرفت. نشانه‌گذاری بلاغی اطلاعات، برای فرایندهای جدید دسترسی به اطلاعات علمی مفید می باشد. استراتژی‌های ویژه خلاصه‌سازی خودکار مقالات علمی عموما تلفیقی از روش‌های سطحی و معنایی هستند. یکی از استراتژی‌هایی که از تجزیه و تحلیل کلام بهره می برد عبارت است از ناحیه‌بندی استدلالی. بر اساس این استراتژی طرح‌های حاشیه‌نویسی گفتمانی ویژه متون علمی به وجود آمده‌اند که بر اساس آنها می توان ابعاد چهارگانه حل مسئله، تخصیص فکری، استدلال علمی و نگرش به آثار دیگران را در متون علمی معلوم کرد و بر غنای خلاصه‌های خودکار افزود. مشهورترین طرح حاشیه‌نویسی نواحی بلاغی مقالات علمی، طرح تویفل می‌باشد. از کاربردهای مهم و ویژه تحلیل گفتمان در خلاصه سازی خودکار ژانر علمی، تولید خلاصه‌های استنادی گفتمان‌مدار می‌باشد که بر اساس آن می‌توان انواع خلاصه های خودکار مبتنی بر پرسش کاربر و خلاصه‌های چندسندی‌ای که تضادها و شباهت‌های مدارک در خلاصه نهایی مشهود است را تولید نمود.

چکیده انگلیسی:

منابع و مأخذ:

البرزی ورکی ، پرویز (1386). "مبانی زبان شناسی متن". تهران: انتشارات امیر کبیر.

بشیری، حسن (1394). زبان شناسی و تحلیل گفتمان اخبار: رویکردی عملیاتی، فصلنامه نقد کتاب: اطلاع‌رسانی و ارتباطات، شماره 7و8. 2.

کلانتری، صمد و دیگران (1388). تحلیل گفتمان با تاکید بر گفتمان انتقادی به عنوان روش تحقیق کیفی. جامعه شناسی. شماره 1.4. 28-7.

Abu-Jbara, A., and Radev, D. (2011, June). Coherent citation-based summarization of scientific papers. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1. Association for Computational Linguistics. pp. 500-509.

Baxendale, P. B. (1958). Machine-made index for technical literature—an experiment. IBM Journal of Research and Development, 2(4). pp. 354-361.

Bosma, W. E. (2008). Discourse oriented summarization. University of Twente.

Brown, G and Yule, G. (1983) Discourse Analysis, Cambridge University Press.

Cohan, A., and Goharian, N. (2015). Scientific article summarization using citation-context and article's discourse structure. Proceedings of the 2015 Conference on Empirical Methods in Nathral Language Processing. Lisbon, Portugal, 17-21 September. pp. 390-400.

Cohan, A., Soldaini, L., and Goharian, N. (2014). Towards citation-based summarization of biomedical literature. In Proceedings of the Text Analysis Conference (TAC’14).

DeJong, G. (1982). An overview of the FRUMP system. Strategies for natural language processing, 113. pp. 149-176.

Elkiss, A., Shen, S., Fader, A., Erkan, G., States, D., and Radev, D. (2008). Blind men and elephants: What do citation summaries tell us about a research article?. Journal of the American Society for Information Science and Technology, 59(1). pp. 51-62.

Fisas, B., Ronzano, F., and Saggion, H. (2015, June). On the discoursive structure of computer graphics research papers. In The 9th Linguistic Annotation Workshop held in conjuncion with NAACL 2015. pp. 42.
Fraser, B. (1999). What are discourse markers?, Journal of Pragmatics, 31. pp. 931-952.

Jones, K. S. (1993). What might be in a summary?. Information retrieval, 93. pp. 9-26.

Jones, K. S., and Endres-Niggemeyer, B. (1995). Automatic summarizing. Information Processing & Management, 31(5). pp. 625-630.

Kupiec, J., Pedersen, J., and Chen, F. (1995, July). A trainable document summarizer. In Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval. ACM. pp. 68-73.

Lloret, E., and Palomar, M. (2013). COMPENDIUM: a text summarisation tool for generating summaries of multiple purposes, domains, and genres. Natural Language Engineering, 19(2). pp.147-186.

Luhn, H. P. (1958). The automatic creation of literature abstracts. IBM Journal of research and development, 2(2). pp.159-165.

Mani, I. (2001). Automatic summarization (Vol. 3). John Benjamins Publishing.

Mann, W. C., and Thompson, S. A. (1988). Rhetorical structure theory: Toward a functional theory of text organization. Text-Interdisciplinary Journal for the Study of Discourse, 8(3), 243-281.

Qazvinian, V., and Radev, D. R. (2008, August). Scientific paper summarization using citation summary networks. In Proceedings of the 22nd International Conference on Computational Linguistics-Volume 1 Association for Computational Linguistics. pp. 689-696.

Qazvinian, V., and Radev, D. R. (2010, July). Identifying non-explicit citing sentences for citation-based summarization. In Proceedings of the 48th annual meeting of the association for computational linguistics. Association for Computational Linguistics. pp. 555-564.

Radev, D. R., Hovy, E., and McKeown, K. (2002). Introduction to the special issue on summarization. Computational linguistics, 28(4). pp. 399-408.

Rahman, N., and Borah, B. (2015, September). A survey on existing extractive techniques for query-based text summarization. In Advanced Computing and Communication (ISACC), 2015 International Symposium on IEEE. pp. 98-102.

Saggion, H., and Poibeau, T. (2013). Automatic text summarization: Past, present and future. In Multi-source, multilingual information extraction and summarization (pp. 3-21). Springer Berlin Heidelberg.

Schiffrin, D. (1987). Discourse Markers. Cambridge: Cambridge University Press.

Teufel, S. (1999). Argumentative Zoning: Information Extraction from Scientific Text. School of Cognitive Science, University of Edinburg, UK..

Teufel, S., and Moens, M. (2002). Summarizing scientific articles: experiments with relevance and rhetorical status. Computational linguistics, 28(4). pp.409-445.

Teufel, S., Siddharthan, A., and Batchelor, C. (2009, August). Towards discipline-independent argumentative zoning: evidence from chemistry and computational linguistics. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3-Volume 3 . Association for Computational Linguistics. pp. 1493-1502.

Yang, S. H. A. N. R. U. (2011). Investigating discourse markers in pedagogical settings: a literature review. ARECLS, 8. pp. 95-108.

Yeloglu, O., Milios, E., and Zincir-Heywood, N. (2011, March). Multi-document summarization of scientific corpora. In Proceedings of the 2011 ACM Symposium on Applied Computing .ACM. pp. 252-258.

اشتراک گذاری

آدرس مقاله

کاربرد تحلیل گفتمان در خلاصه¬سازی خودکار متون علمی

رایمگ

پیوندهای سایت

مراکز مرتبط

پشتیبانی

صفحات رسمی