dc.contributor.advisor |
Hajj, Hazem |
dc.contributor.author |
El Droubi, Nour |
dc.date.accessioned |
2021-09-06T03:29:35Z |
dc.date.available |
2021-09-06T03:29:35Z |
dc.date.issued |
9/6/2021 |
dc.date.submitted |
9/5/2021 |
dc.identifier.uri |
http://hdl.handle.net/10938/22990 |
dc.description.abstract |
Recent developments enabled chatbots to be an essential part of people’s daily lives from asking general questions about the weather to booking movie tickets. Chatbots can be classified into open-domain bots or task-oriented bots. Open domain chatbots can have engaging conversations in any domain. On the other hand, task-oriented chatbots, which are the focus of this thesis, aim at handling specific tasks such as booking movie tickets. While task-oriented chatbots have seen significant advances in English, task-oriented chatbots in Arabic remain limited in their capabilities mainly due to the scarcity of the available datasets and resources for training task-oriented dialogue systems in Arabic. To overcome these challenges, we have explored two state-of-the-art strategies for task-oriented bots: End-to-end models and pipeline models that consist of Natural Language Understanding (NLU) followed by the Dialogue Manager (DM) and Natural Language Generation (NLG). For end-to-end, we proposed the use of AraGPT2 and created a large multi-domain human-to-human conversational dataset in Arabic by translating a large-scale English dataset. Our end-to-end model achieved state-of-the-art results for Arabic and proved to be comparable in performance to what has been achieved by state-of-the-art English end-to-end models. For pipeline models, we addressed the NLU challenge by developing a multi-task model that can simultaneously perform intent classification and slot filling using AraBERT. To train the NLU model, we created a large dataset labeled for intents and slots by translating another large English dataset for training task-oriented bots. The developed NLU model was able to achieve comparable results with respect to the state-of-the-art results of pipeline models in English. |
dc.language.iso |
en |
dc.subject |
Machine Learning |
dc.subject |
Chatbots |
dc.subject |
Task-Oriented Chatbots |
dc.title |
Machine Learning Models and Resources for Task-Oriented Chatbots in Arabic |
dc.type |
Thesis |
dc.contributor.department |
Department of Electrical and Computer Engineering |
dc.contributor.faculty |
Maroun Semaan Faculty of Engineering and Architecture |
dc.contributor.institution |
American University of Beirut |
dc.contributor.commembers |
Saghir, Mazen |
dc.contributor.commembers |
El Hajj, Wassim |
dc.contributor.degree |
ME |
dc.contributor.AUBidnumber |
201500898 |