dc.contributor.author |
Makhlouta, Jad Elia. |
dc.date.accessioned |
2013-10-02T09:21:54Z |
dc.date.available |
2013-10-02T09:21:54Z |
dc.date.issued |
2012 |
dc.identifier.uri |
http://hdl.handle.net/10938/9483 |
dc.description |
Thesis (M.E.)--American University of Beirut, Department of Electrical and Computer Engineering, 2012. |
dc.description |
Advisor : Dr. Fadi Zaraket, Assistant Professor, Electrical and Computer Engineering--Members of Committee : Dr. Hasan Artail, Professor, Electrical and Computer Engineering ; Dr. Ibrahim Abou Faycal, Associate Professor, Electrical and Computer Engineering. |
dc.description |
Includes bibliographical references (leaves 60-68) |
dc.description.abstract |
Natural Language Processing concerns the automatic understanding of natural languages, such as entity and relational extraction from text documents. Morphological analysis is key for Arabic natural language processing since Arabic is a morphologically rich language. Researchers used techniques that work well for the Latin based languages such as local grammars, statistical learning models, pattern matching, and rule- based techniques to automatically extract named entities from Arabic text. These techniques need to boost their results by using application specific corpora, parallel language corpora, and morphological stemming analysis. We present a method for extracting entities and events, from Arabic text using a hierarchy of finite state machines driven by morphological features such as part of speech and gloss tags. We also use graph transformation algorithms and cross-document analysis to enhance entity extraction and extract relations amongst entities. We evaluated our method on three natural language processing applications. (1) We automated the extraction of temporal entities. (2) We automated the extraction of narrators, narrator relations and corresponding biographies from several corpora of Islamic hadith and biography books. (3) We automated the extraction of genealogical family trees from Biblical texts. In all applications, our method reports high precision and recall and learns lemmas about phrases that improve results. |
dc.format.extent |
ix, 68 leaves : ill. (some col.) ; 30 cm. |
dc.language.iso |
eng |
dc.relation.ispartof |
Theses, Dissertations, and Projects |
dc.subject.classification |
ET:005682 AUBNO |
dc.subject.lcsh |
Arabic language -- Morphology. |
dc.subject.lcsh |
Natural language processing (Computer science) |
dc.subject.lcsh |
Artificial intelligence. |
dc.subject.lcsh |
Computational linguistics. |
dc.subject.lcsh |
Data mining. |
dc.title |
Entity and relation extraction from Arabic text using morphological analysis. |
dc.type |
Thesis |
dc.contributor.department |
American University of Beirut. Faculty of Engineering and Architecture. Department of Electrical and Computer Engineering. |