AUB ScholarWorks

Entity and relation extraction from Arabic text using morphological analysis.

Show simple item record

dc.contributor.author Makhlouta, Jad Elia.
dc.date.accessioned 2013-10-02T09:21:54Z
dc.date.available 2013-10-02T09:21:54Z
dc.date.issued 2012
dc.identifier.uri http://hdl.handle.net/10938/9483
dc.description Thesis (M.E.)--American University of Beirut, Department of Electrical and Computer Engineering, 2012.
dc.description Advisor : Dr. Fadi Zaraket, Assistant Professor, Electrical and Computer Engineering--Members of Committee : Dr. Hasan Artail, Professor, Electrical and Computer Engineering ; Dr. Ibrahim Abou Faycal, Associate Professor, Electrical and Computer Engineering.
dc.description Includes bibliographical references (leaves 60-68)
dc.description.abstract Natural Language Processing concerns the automatic understanding of natural languages, such as entity and relational extraction from text documents. Morphological analysis is key for Arabic natural language processing since Arabic is a morphologically rich language. Researchers used techniques that work well for the Latin based languages such as local grammars, statistical learning models, pattern matching, and rule- based techniques to automatically extract named entities from Arabic text. These techniques need to boost their results by using application specific corpora, parallel language corpora, and morphological stemming analysis. We present a method for extracting entities and events, from Arabic text using a hierarchy of finite state machines driven by morphological features such as part of speech and gloss tags. We also use graph transformation algorithms and cross-document analysis to enhance entity extraction and extract relations amongst entities. We evaluated our method on three natural language processing applications. (1) We automated the extraction of temporal entities. (2) We automated the extraction of narrators, narrator relations and corresponding biographies from several corpora of Islamic hadith and biography books. (3) We automated the extraction of genealogical family trees from Biblical texts. In all applications, our method reports high precision and recall and learns lemmas about phrases that improve results.
dc.format.extent ix, 68 leaves : ill. (some col.) ; 30 cm.
dc.language.iso eng
dc.relation.ispartof Theses, Dissertations, and Projects
dc.subject.classification ET:005682 AUBNO
dc.subject.lcsh Arabic language -- Morphology.
dc.subject.lcsh Natural language processing (Computer science)
dc.subject.lcsh Artificial intelligence.
dc.subject.lcsh Computational linguistics.
dc.subject.lcsh Data mining.
dc.title Entity and relation extraction from Arabic text using morphological analysis.
dc.type Thesis
dc.contributor.department American University of Beirut. Faculty of Engineering and Architecture. Department of Electrical and Computer Engineering.


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search AUB ScholarWorks


Browse

My Account