AUB ScholarWorks

Machine Learning Models and Resources for Task-Oriented Chatbots in Arabic

Show simple item record

dc.contributor.advisor Hajj, Hazem
dc.contributor.author El Droubi, Nour
dc.date.accessioned 2021-09-06T03:29:35Z
dc.date.available 2021-09-06T03:29:35Z
dc.date.issued 9/6/2021
dc.date.submitted 9/5/2021
dc.identifier.uri http://hdl.handle.net/10938/22990
dc.description.abstract Recent developments enabled chatbots to be an essential part of people’s daily lives from asking general questions about the weather to booking movie tickets. Chatbots can be classified into open-domain bots or task-oriented bots. Open domain chatbots can have engaging conversations in any domain. On the other hand, task-oriented chatbots, which are the focus of this thesis, aim at handling specific tasks such as booking movie tickets. While task-oriented chatbots have seen significant advances in English, task-oriented chatbots in Arabic remain limited in their capabilities mainly due to the scarcity of the available datasets and resources for training task-oriented dialogue systems in Arabic. To overcome these challenges, we have explored two state-of-the-art strategies for task-oriented bots: End-to-end models and pipeline models that consist of Natural Language Understanding (NLU) followed by the Dialogue Manager (DM) and Natural Language Generation (NLG). For end-to-end, we proposed the use of AraGPT2 and created a large multi-domain human-to-human conversational dataset in Arabic by translating a large-scale English dataset. Our end-to-end model achieved state-of-the-art results for Arabic and proved to be comparable in performance to what has been achieved by state-of-the-art English end-to-end models. For pipeline models, we addressed the NLU challenge by developing a multi-task model that can simultaneously perform intent classification and slot filling using AraBERT. To train the NLU model, we created a large dataset labeled for intents and slots by translating another large English dataset for training task-oriented bots. The developed NLU model was able to achieve comparable results with respect to the state-of-the-art results of pipeline models in English.
dc.language.iso en
dc.subject Machine Learning
dc.subject Chatbots
dc.subject Task-Oriented Chatbots
dc.title Machine Learning Models and Resources for Task-Oriented Chatbots in Arabic
dc.type Thesis
dc.contributor.department Department of Electrical and Computer Engineering
dc.contributor.faculty Maroun Semaan Faculty of Engineering and Architecture
dc.contributor.institution American University of Beirut
dc.contributor.commembers Saghir, Mazen
dc.contributor.commembers El Hajj, Wassim
dc.contributor.degree ME
dc.contributor.AUBidnumber 201500898


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AUB ScholarWorks


Browse

My Account