AUB ScholarWorks

Few-shot Learning for Conversational Bots in Low-Resource Settings

Show simple item record

dc.contributor.author Naous, Tarek
dc.date.accessioned 2022-04-06T18:28:58Z
dc.date.available 2022-04-06T18:28:58Z
dc.date.issued 4/6/2022
dc.date.submitted 4/6/2022
dc.identifier.uri http://hdl.handle.net/10938/23363
dc.description.abstract Open-domain dialogue agents are systems that can converse with users on any topic of user’s choice. Having such types of agents has been a long standing objective in Artificial Intelligence as they can make the human-computer interaction experience much more engaging. Recent advances in English open-domain dialogue have leveraged state-of-the-art Large Language Models (LLMs) for Natural Language Generation (NLG). Such LLMs are massively pre-trained on unlabeled data in a self-supervised mode to learn abstract representations of the language. They also require large amounts of labeled open-domain dialogue data for fine-tuning to achieve the challenging task of dialogue response generation. In low-resource settings such as Arabic and its dialects, such pre-trained LLMs and large labeled dialogue datasets are often non-existent, hindering the development of open-domain chatbots for those languages. Such limited resource modeling problem is known as the few-shot learning problem. In this thesis, we address multiple aspects of the few-shot learning problem for open-domain Arabic conversational bots. The first contribution is a solution to overcome the unavailability of LLMs with large amounts of labeled dialogue data for Arabic MSA. To address the response generation problem, we propose a model that transfers knowledge from a pre-trained BERT encoder to an encoder-decoder model for dialogue response generation. The second contribution addresses a more extreme case of limited resources with Arabic dialects. To address the LLM and NLG challenges for Arabic dialects, we propose a three-stage learning framework based on warm-starting, self-supervised pre-training, and few-shot fine-tuning. The third contribution focuses on addressing the challenge of ensuring generated responses are relevant to user’s query for both English and Arabic. We propose a new decoding algorithm that considers increased samples in response generation then chooses the response with highest similarity to user’s query. The fourth contribution is in the development of new data resources for Arabic with one message-response dataset in Modern Standard Arabic (MSA) and three datasets for the most widely spoken Arabic dialects (Levantine, Egyptian, and Gulf). The experiment results showed success of the proposed methods and achieved state of the art performance for Arabic open-dialogue systems.
dc.language.iso en_US
dc.subject Natural Language Processing
dc.subject Machine Learning
dc.subject Conversational AI
dc.title Few-shot Learning for Conversational Bots in Low-Resource Settings
dc.type Thesis
dc.contributor.department Department of Electrical and Computer Engineering
dc.contributor.faculty Maroun Semaan Faculty of Engineering and Architecture
dc.contributor.institution American University of Beirut
dc.contributor.commembers Jabr, Rabih
dc.contributor.commembers El Hajj, Wassim
dc.contributor.commembers Shaban, Khaled
dc.contributor.degree Master of Engineering
dc.contributor.AUBidnumber 202124053


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AUB ScholarWorks


Browse

My Account