Arabic named entity recognition via deep co-learning -

dc.contributor.authorHelwe, Chadi Talal,
dc.contributor.departmentFaculty of Arts and Sciences.
dc.contributor.departmentDepartment of Computer Science,
dc.contributor.institutionAmerican University of Beirut.
dc.date2017
dc.date.accessioned2017-12-11T16:30:58Z
dc.date.available2017-12-11T16:30:58Z
dc.date.issued2017
dc.date.submitted2017
dc.descriptionThesis. M.S. American University of Beirut. Department of Computer Science, 2017. T:6553
dc.descriptionAdvisor : Dr. Shady Elbassuoni, Assistant Professor, Computer Science ; Committee members : Dr. Wassim El Hajj, Associate Professor, Computer Science ; Dr. Hazem El Hajj, Associate Professor, Electrical and Computer Engineering.
dc.descriptionIncludes bibliographical references (leaves 44-47)
dc.description.abstractNamed entity recognition (NER) is the task of identifying named entities such as locations, persons, and organizations in a given piece of text. NER plays a signi cant role in many applications including information retrieval, question an-swering, machine translation, text clustering, and navigation systems. In this thesis, we tackled the problem of Arabic NER. Arabic is a very challenging lan-guage when it comes to natural language processing (NLP) in general. Arabic is both morphologically rich and highly ambiguous and has complex morpho-syntactic agreement rules and many irregular forms. To address all these issues, we proposed to use deep learning based on Arabic word embeddings that cap-ture syntactic and semantic relationships between words. Deep learning has been shown to perform signi cantly better than other approaches for various NLP tasks including NER. However, deep learning models also require a signi cantly large amount of training data, which is highly lacking in the case of Arabic. To be able to overcome this, we proposed a semi-supervised deep learning approach that uses both labeled and semi-labeled data, which we coin deep co-learning. We tested our approach using di erent established benchmarks and compared it to the state-of-the-art Arabic NER tools such as MadaMira and Farasa. Our deep co-learning approach signi cantly outperformed the compared to Arabic NER approaches as well as purely-supervised deep learning ones.
dc.format.extent1 online resource (x, 47 leaves) : illustrations
dc.identifier.otherb19214327
dc.identifier.urihttp://hdl.handle.net/10938/21007
dc.language.isoen
dc.relation.ispartofTheses, Dissertations, and Projects
dc.subject.classificationT:006653
dc.subject.lcshNatural language processing (Computer science)
dc.subject.lcshMachine learning.
dc.subject.lcshArabic language -- Morphology.
dc.subject.lcshData mining.
dc.subject.lcshText processing (Computer science)
dc.subject.lcshComputational linguistics.
dc.titleArabic named entity recognition via deep co-learning -
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
t-6653.pdf
Size:
2.41 MB
Format:
Adobe Portable Document Format