KGFusionX: Linking, Combining, and Exploring Data Through Knowledge Graphs

dc.contributor.AUBidnumber201901223en_US
dc.contributor.advisorZablith, Fouad
dc.contributor.advisorAzad, Bijan
dc.contributor.authorYoussef, Shadi
dc.contributor.commembersTaleb, Sirine
dc.contributor.commembersNasr, Walid
dc.contributor.degreeMSBAen_US
dc.contributor.departmentSuliman S. Olayan School of Businessen_US
dc.contributor.facultySuliman S. Olayan School of Businessen_US
dc.date.accessioned2024-02-07T13:02:52Z
dc.date.available2024-02-07T13:02:52Z
dc.date.issued2024-02-07
dc.date.submitted2024-02-07
dc.description.abstractIn the realm of data exploration, the persistent challenges of data disconnection and inconsistency often hinder the efficiency of data analysts, especially in terms of data enrichment and aggregation. This thesis focuses on addressing the following research questions: How can we improve data integration and reuse of data in a clean and downloadable format to facilitate data analysis? Moreover, how can we contextually expand data on the fly to leverage its value and enhance data exploration? This work proposes KGFusionX, a knowledge graph centered framework that recognizes the time-intensive nature of data enrichment and integration. The study employs a backend implementation utilizing knowledge graphs to seamlessly connect disparate datasets. Several datasets from Lebanon covering different domains (e.g. health care, economy, education, and others) were converted and published as openly accessible knowledge graphs in a triple store repository (749,500 triples). This conversion allows efficient and fast aggregation of data because of the connections generated by knowledge graphs. Also, it is integrated with open linked data sources that serves as a resource to expand the data. The framework is showcased through an online platform built with Streamlit that allows users to select, combine, and download tabular data that can be used in other visualization exploration tools (e.g. PowerBI and Tableau). The approach was evaluated by data analysts and two use cases. Potential pickup of our platform was expressed by users who relied on the tool to analyze school and university challenges in rural areas, in addition to boosting tourism in Lebanon. The results demonstrated a significant improvement in data exploration efficiency, and better visuals with the knowledge graph-driven approach proving successful in overcoming the challenges posed by disconnection, inconsistency, and enrichment. This research primarily contributes to streamlining data exploration using the high potential of knowledge graphs to support data aggregation, data enrichment and visual data analysis.en_US
dc.identifier.urihttp://hdl.handle.net/10938/24322
dc.language.isoen_USen_US
dc.subjectKnowledge graphsen_US
dc.subjectData enrichment
dc.subjectData integration
dc.subject.lcshRDF (Document markup language)
dc.titleKGFusionX: Linking, Combining, and Exploring Data Through Knowledge Graphsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
YoussefShadi_2024.pdf
Size:
2.27 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.65 KB
Format:
Item-specific license agreed upon to submission
Description: