Graph-Enhanced Medical Question-Answering System Integrating Knowledge Graphs and Large Language Models

Yiqin Huang; Qing Liu; Zhengru Xie; Baopeng Ding; Ziwei Wang; Fenglong Yang

doi:10.55578/joaims.260408.001

Authors

Yiqin Huang School of Medical Imaging, Fujian Medical University, Fuzhou, China Author
Qing Liu School of Medical Imaging, Fujian Medical University, Fuzhou, China Author
Zhengru Xie School of Medical Imaging, Fujian Medical University, Fuzhou, China Author
Baopeng Ding Shanxi Datong University, Engineering Research Center of Coal-Based Ecological Carbon Sequestration Technology of the Ministry of Education, Datong, China; Boskind Research (Shanxi) Technology Co., Ltd., and Shanxi Yanyun Herbal Medicine Technology Co., Ltd Author
Ziwei Wang Biomedical Sciences College & Shandong Medicinal Biotechnology Centre, Shandong First Medical University & Shandong Academy of Medical Sciences, Ji’nan, Shandong, China Author
Fenglong Yang Department of Bioinformatics, Fujian Key Laboratory of Medical Bioinformatics, School of Medical Technology and Engineering, Fujian Medical University, Fuzhou, China; The Fujian Key Laboratory of Medical Bioinformatics, Institute of Precision Medicine, Fujian Medical University, Fuzhou, China; Key Laboratory of Gastrointestinal Cancer (Fujian Medical University), Ministry of Education, Fuzhou, China Author

DOI:

https://doi.org/10.55578/joaims.260408.001

Keywords:

Knowledge graph (KG), Graph algorithms, Large language model (LLM), Intelligent question answering

Abstract

Objective: To address the challenges posed by the rapid growth of medical data and the fragmentation of knowledge, this study aims to construct a medical knowledge graph (KG) and provide efficient knowledge services for clinical practice.

Methods: A total of 44,157 entities and 291,170 relationships from an open-source database were integrated to build a local medical KG based on Neo4j. Graph algorithms including degree centrality, Louvain community detection, K-nearest neighbor, and Dijkstra’s algorithm were applied to analyze the data. The retrieval results from the KG were combined with the Spark Lite model from IFlytek to develop a dual-channel question-and-answer system.

Results: Highly related entities, such as acute urethritis and blood routine tests, were successfully identified. The analysis yielded 35 disease communities and 17 department communities. Highly similar disease pairs, such as “lung abscess” and “pulmonary bullae,” were discovered. Potential therapeutic pathways, such as “Erythromycin Ethylsuccinate Granules - Erythrasma,” were uncovered, revealing clinical associations among various entities. The system is accessible at http://yangbiolab.cn:8054/.

Conclusion: Graph algorithms effectively mine key patterns and potential associations within medical knowledge, with several findings aligning closely with clinical practice. The integrated system offers an intuitive platform for exploring medical knowledge.

References

[1] L. Wang, H. Hao, X. Yan, T.H. Zhou, K.H. Ryu, From biomedical knowledge graph construction to semantic querying: a comprehensive approach, Sci. Rep. 15 (2025), 8523.

[2] G. Zhao, W. Gu, W. Cai, Z. Zhao, X. Zhang, J. Liu, MLEE: a method for extracting object-level medical knowledge graph entities from Chinese clinical records, Front. Genet. 13 (2022), 900242.

[3] H.-C. Hsueh, S.-C. Chien, C.-W. Huang, H.-C. Yang, U. Iqbal, L.-F. Lin, et al., A novel multi-level refined (MLR) knowledge graph design and chatbot system for healthcare applications, PLoS ONE. 19 (2024), e0302620.

[4] H. Zhu, A graph neural network-enhanced knowledge graph framework for intelligent analysis of policing cases, Math. Biosci. Eng. 20 (2023), 11585–11604.

[5] X. Mo, J. Pang, B. Wan, R. Tang, H. Liu, S. Jiang, Multirelational graph contrastive learning with learnable graph augmentation, Neural Netw. 181 (2025), 106757.

[6] R. Johnson, M.M. Li, A. Noori, O. Queen, M. Zitnik, Graph artificial intelligence in medicine, Annu. Rev. Biomed. Data Sci. 7 (2024), 345–368.

[7] X. Zhou, S. Zhang, M. Agarwal, J. Akryod, S. Mosbach, M. Kraft, Marie and BERT—a knowledge graph embedding based question answering system for chemistry, ACS Omega. 8 (2023), 33039–33057.

[8] H. Park, J. Son, J. Min, J. Choi, Selective UMLS knowledge infusion for biomedical question answering, Sci. Rep. 13 (2023), 14214.

[9] C. Fu, X. Pan, J. Wu, J. Cai, Z. Huang, F.v. Harmelen, et al., KG4NH: a comprehensive knowledge graph for question answering in dietary nutrition and human health, IEEE J. Biomed. Health Inform. 29 (2025), 1793–1804.

[10] Y. Feng, L. Zhou, C. Ma, Y. Zheng, R. He, Y. Li, Knowledge graph-based thought: a knowledge graph-enhanced LLM framework for pan-cancer question answering, GigaScience. 14 (2025), giae082.

[11] A. Singhal, Introducing the knowledge graph: things, not strings, Official Google Blog, 2012.

[12] Q. Wang, Z. Mao, B. Wang, L. Guo, Knowledge graph embedding: a survey of approaches and applications, IEEE Trans. Knowl. Data Eng. 29 (2017), 2724–2743.

[13] M. Afshar, Y. Gao, D. Gupta, E. Croxford, D. Demner-Fushman, On the role of the UMLS in supporting diagnosis generation proposedby Large Language Models, J. Biomed. Inform. 157 (2024), 104707.

[14] K. Schatz, D. Korn, A. Tropsha, R. Chirkova, Workflow for domain- and task-sensitive curation of knowledge graphs, with use case of DRKG, 2022 IEEE International Conference on Big Data (Big Data), IEEE, Osaka, Japan, 2022, pp. 3692–3701.

[15] P. Chandak, K. Huang, M. Zitnik, Building a knowledge graph to enable precision medicine, Sci. Data. 10 (2023), 67.

[16] H. Wang, Q. Zu, M. Lu, R. Chen, Z. Yang, Y. Gao, et al., Application of medical knowledge graphs in cardiology and cardiovascular medicine: a brief literature review, Adv. Ther. 39 (2022), 4052–4060.

[17] Y. Lan, S. He, K. Liu, X. Zeng, S. Liu, J. Zhao, Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion, BMC Med. Inform. Decis. Mak. 21 (2021), 335.

[18] D.Y. Li, T.J. Hu, J.L. Li, Q. Qian, W.Y. Zhu, Construction and application of the Chinese unified medical language system, J. Intell. 30 (2011), 147–151.

[19] H.Y. Zan, H.Y. Dou, Y.X. Jia, T.F. Guan, B. Odbayar, K.L. Zhang, et al., Construction of Chinese medical knowledge graph based on multi-source texts, J. Zhengzhou Univ. (Nat. Sci. Ed.), 52 (2020), 45–51.

[20] H. Long, Y. Zhu, L. Jia, B. Gao, J. Liu, L. Liu, et al., An ontological framework for the formalization, organization and usage of TCM-knowledge, BMC Med. Inform. Decis. Mak. 19 (2019), 53.

[21] X. Xue, Y. Mei, B. Zhao, M. Zhang, Adaptive similarity feature construction for ontology matching via multilayer hybrid genetic programming, IEEE Trans. Evol. Comput. 30 (2026), 519–533.

[22] T. Cheng, T. Bi, W. Ji, C. Tian, Graph convolutional network for image restoration: a survey, Mathematics. 12 (2024), 2020.

[23] R. Zeng, C. Zeng, X. Wang, B. Li, X. Chu, Incentive mechanisms in federated learning and a game-theoretical approach, IEEE Netw. 36 (2022), 229–235.

[24] L. Ma, M. Huang, S. Yang, R. Wang, X. Wang, An adaptive localized decision variable analysis approach to large-scale multiobjective and many-objective optimization, IEEE Trans. Cybern. 52 (2022), 6684–6696.

[25] T. Winograd, Five lectures on artificial intelligence, Stanford University, Stanford, CA, USA, 1974.

[26] J. Kupiec, MURAX: a robust linguistic approach for question answering using an on-line encyclopedia, Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, New York, NY, United States, 1993, pp. 181–190.

[27] L. Dong, M. Lapata, Language to logical form with neural attention, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL, Berlin, Germany, 2016, pp. 33–43.

[28] R. Jia, P. Liang, Data recombination for neural semantic parsing, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, ACL, Berlin, Germany, 2016, pp. 12–22.

[29] R. Wang, M. Wang, J. Liu, W. Chen, M. Cochez, S. Decker, Leveraging knowledge graph embeddings for natural language question answering, In: G. Li, J. Yang, J. Gama, J. Natwichai, Y. Tong (Eds.), Database Systems for Advanced Applications, DASFAA 2019, Lecture Notes in Computer Science, Vol. 11446, Springer, Cham, 2019, pp. 659–675.

[30] M.Y. Cao, Q.Q. Li, Z.H. Yang, L. Wang, Y. Zhang, H.F. Lin, et al., A question answering system for primary liver cancer based on knowledge graph, J. Chin. Inf. Process. 33 (2019), 88–93.

[31] Y.Y. Chen, Construction and research of knowledge graph for stroke, syndrome, disease and medicine based on deep learning, Master’s Thesis, Gansu University of Chinese Medicine, Lanzhou, China, 2023.

[32] Z.M. Yi, K. Li, Z.Y. Xu, Z.M. Xu, F. Gao, A construction method and device for personalized medication assistance question answering system for Parkinson’s disease, Patent CN202410698116.0, Beijing, China, 2024.

[33] L.C. Freeman, Centrality in social networks conceptual clarification, Soc. Netw. 1 (1978–1979), 215–239.

[34] V. Blondel, J.-L. Guillaume, R. Lambiotte, Fast unfolding of communities in large networks: 15 years later, J. Stat. Mech. Theory Exp. 2024 (2024), 10R001.

[35] S. Lloyd, Least squares quantization in PCM, IEEE Trans. Inf. Theory. 28 (1982), 129–137.

[36] E.W. Dijkstra, A note on two problems in connexion with graphs, Numer. Math. 1 (1959), 269–271.

[37] H. Lu, M. Halappanavar, A. Kalyanaraman, Parallel heuristics for scalable community detection, Parallel Comput. 47 (2015), 19–37.

[38] D. Liben-Nowell, J. Kleinberg, The link-prediction problem for social networks, J. Am. Soc. Inf. Sci. Technol. 58 (2007), 1019–1031.

[39] B. Perozzi, R. Al-Rfou, S. Skiena, DeepWalk: online learning of social representations, Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, New York, NY, United States, 2014, pp. 701–710.

Graph-Enhanced Medical Question-Answering System Integrating Knowledge Graphs and Large Language Models

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

Latest publications

Information

Language