Open this publication in new window or tab >>2025 (English)Conference paper, Oral presentation with published abstract (Other academic)
Abstract [en]
Large Language Models (LLMs) (Brown et al., 2020) have rapidly advanced, revolutionized and substantially reshaped automated AI-driven knowledge discovery methodologies. However, the opaque embeddings of LLMs often lack semantic interpretability and integration with disparate enterprise knowledge sources. Organizations continue to experience fragmentation across information ecosystems arising from isolated data silos and the continual evolution of semantic standards. There is an urgent need for platforms that holistically bridge LLMs’ embeddings, enterprise data, and semantic interoperability. Recently, the Knowledge Graphs (KGs) (Zhong et al., 2023) and LLMs complement each other and are emerging as a critical pathway for intelligent, context-aware, domain-centric AI-powered applications development (e.g., conversational agents (CAs), recommendation systems) (TudoCar et al., 2020) in various domains, especially in healthcare. Existing data integration approaches (Elmore et al., 2015) frequently conceptualize KGs as static and isolated repositories, thereby overlooking their potential as dynamic, conversational platforms. This Treo talk highlights the importance of the proposed Knowledge Graph as a Platform (KGaaP) (Figure 1) as a solution, where various domain-centric applications can be developed, embedding LLMs and leveraging the federated semantic knowledge lake (FSKL) framework. This federated, semantic, and interoperable environment embeds domain-specific knowledge into LLMs workflows, enabling dynamic, multi-source conversational intelligence. Recently, most organizations have increasingly relied on LLMs for decision support. Yet, LLM outputs are prone to hallucinations and lack explainability when disconnected from disparate data sources, including structured, unstructured, and authoritative factual knowledge. To avoid hallucination, semantic technological landscape resources, including ontologies, KGs, RDF datasets, and linked open data vocabularies, remain underutilized and play a pivotal role in harmonizing, and consolidating data fragmentation across organizational silos and incompatibilities in schema design efficiently. Addressing these challenges requires a unifying framework to bridge AI reasoning with federated, semantically interoperable knowledge assets in a scalable and query-efficient manner. The KGaaP operationalizes a FSKL Framework by utilizing: i) Ontop (a virtual KGs system) and Federated Query Engines like, FedX to unify heterogeneous SPARQL endpoints services; ii) Semantic Interoperability Layers using domain ontologies (e.g., SNOMED CT, LOINC) for constructing federated ontology and SHACL validation; iii) LLM-KG Embedding Integration, where vector embeddings are aligned with KG entities, enabling hybrid retrieval (combination of symbolic and neural); iv) Conversational Agent Layer that supports RAG (Retrieval-Augmented Generation) pipelines, dynamically grounding LLM responses in federated KG queries with provenance metadata. This research leads new avenues for IS research, how KGs plus LLMs as active participants in digital transformation, advancing explainable AI, cross-organizational knowledge interoperability, and data governance. The KGaaP paradigm supports emerging IS demands for trustworthy and federated AI ecosystems.
National Category
Information Systems
Identifiers
urn:nbn:se:hh:diva-58641 (URN)
Conference
International Conference on Information Systems (ICIS) 2025, Nashville, USA, December 14-17, 2025
2026-03-272026-03-272026-05-20Bibliographically approved