Volume 12, Issue 2 (6-2020)                   itrc 2020, 12(2): 46-53 | Back to browse issues page

XML Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Soroosh Akef S, Hadi Bokaei M H, Sameti H. Thematic Similarity Multiple-Choice Question Answering with Doc2Vec: A Step Toward Metaphorical Language Processing. itrc 2020; 12 (2) :46-53
URL: http://journal.itrc.ac.ir/article-1-459-en.html
1- Languages and Linguistics Center Sharif University of Technology Tehran, Iran
2- Department of Information Technology Iran Telecommunication Research Center Tehran, Iran , mh.bokaei@itrc.ac.ir
3- Department of Computer Engineering Sharif University of Technology Tehran, Iran
Abstract:   (2017 Views)

This paper reports our improvement over the previous benchmark of the task of answering poetic verses' thematic similarity multiple-choice questions (MCQs). In this experiment, we have trained a Doc2Vec model on a corpus of Persian poems and proceeded to use the trained model to get the vector representations of the poetic verses. Subsequently, the poetic verse among the options with the highest cosine similarity to the stem verse was selected as the correct answer by the model. This model managed to answer 38% of the questions correctly, which was an improvement of 6% over the previous benchmark. Provided that a large-scale thematic similarity MCQ dataset is developed, the performance of a language representation model on this task could be considered as a novel benchmark to measure the capacity of a model to understand metaphorical language.

Full-Text [PDF 1056 kb]   (754 Downloads)    
Type of Study: Research | Subject: Information Technology

Add your comments about this article : Your username or Email:
CAPTCHA

Send email to the article author


Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.