STEMM Institute Press
Science, Technology, Engineering, Management and Medicine
Tumor Event Extraction based on Semantic Features and Its Application in Auxiliary Diagnosis and Treatment
DOI: https://doi.org/10.62517/jbdc.202601210
Author(s)
Xiaoxue Li*, Bairu Jia, Yao Liu
Affiliation(s)
School of Information Engineering, Shaanxi University of International Trade & Commerce, Xi’an, Shaanxi, China * Corresponding Author
Abstract
To address the challenge of utilizing unstructured text in tumor electronic medical records for clinical decision-making, this study proposes a medical event extraction method based on fused semantic features and designs an oncology-assisted diagnosis and treatment visualization system. First, a pre-trained medical event extraction model is employed to automatically extract key event information from tumor records, including tumor type, lesion size, location, grade, and metastasis. Second, an auxiliary diagnosis and treatment system is developed, incorporating functions such as tumor grading, metastasis detection, personalized treatment plan formulation, prognosis evaluation, and data statistical analysis. Finally, a user interface is designed using Qt Designer to visualize the workflow of record input, model invocation, diagnostic recommendation generation, and evaluation. Practical applications demonstrate that this system effectively assists clinicians in rapidly extracting critical information from records and generating reasonable treatment plans, enhancing the efficiency and interpretability of tumor diagnosis and treatment while validating the feasibility of medical event extraction technology in clinical oncology assistance.
Keywords
Tumor Event Extraction; Semantic Feature Fusion; Auxiliary Diagnosis and Treatment; Electronic Medical Records; Visualization System.
References
[1] Bray F, Laversanne M, Sung H, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians, 2024, 74(3): 229-263. [2] Goyal N, Singh N .Named entity recognition and relationship extraction for biomedical text: A comprehensive survey, recent advancements, and future research directions. Neurocomputing, 2025, 618:129171. [3] Bilal M, Hamza A, Malik N, et al. NLP for Analyzing Electronic Health Records and Clinical Notes in Cancer Research: A Review. Journal of Pain and Symptom Management, 2025. [4] Kim M, Kim S, Park B .Leveraging BERT and Network-Based Attention Analysis for Identifying Treatment Milestones in EHRs. 2024 IEEE International Conference on Big Data (BigData), 2024:8707-8709. [5] Ji X R, Wei D J, Zhang J Z, Zhang S, Cao H. Research progress on information extraction methods for Chinese electronic medical records. Computer Engineering and Science, 2024, 46(2): 325-337. [6] An B, Zhang H, Ma L, Zhao Z. End-to-end Chinese clinical event extraction based on large language model. Scientific Reports, 2025, 15(1): 16943. [7] Fu Y, Ramachandran G K, Halwani A, et al. CACER: Clinical Concept Annotations for Cancer Events and Relations. Journal of the American Medical Informatics Association, 2024: ocae231. [8] Namasivayam G, et al. Use of large language models to extract cancer diagnosis, histology, grade, and staging from unstructured electronic health records. Journal of Clinical Oncology, 2025, 43(16_suppl): 11176. [9] Yao J, Goldner E, Hochheiser H,et al. Systemic Anticancer Therapy Timelines Extraction From Electronic Medical Records Text: Algorithm Development and Validation. JMIR Bioinformatics & Biotechnology, 2025, 6(1). [10]Gangopadhyay S, et al. Clinical Decision Support and Natural Language Processing in Medicine: Systematic Literature Review. JMIR Medical Informatics, 2024. [11]Ren W, Zhu J, Liu Z, et al. A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models. 2025. [12]Fu Y, Ramachandran G K, Halwani A, et al. CACER: Clinical Concept Annotations for Cancer Events and Relations. Journal of the American Medical Informatics Association, 2024: ocae231. [13]Yao J, Goldner E, Hochheiser H, et al. Systemic Anticancer Therapy Timelines Extraction From Electronic Medical Records Text: Algorithm Development and Validation. JMIR Bioinformatics & Biotechnology, 2025, 6(1). [14]Delourme S, Redjdal A, Bouaud J, et al. Leveraging Guideline-Based Clinical Decision Support Systems with Large Language Models: A Case Study with Breast Cancer. Methods of Information in Medicine, 2024, 63(03/04): 085-096. [15]Niu M, He Z, Xu X, et al. Liver cancer knowledge graph construction based on dynamic entity replacement and masking strategies RoBERTa-wwm-large-BiLSTM-CRF model with clinical Chinese EMRs. Frontiers in Artificial Intelligence, 2025, 8: 1663877.
Copyright @ 2020-2035 STEMM Institute Press All Rights Reserved