期刊:
Journal of Biomedical Informatics,2019年91:103114 ISSN:1532-0464
通讯作者:
Wang, Jianxin
作者机构:
[Fei, Zhihui; Li, Min; Yu, Ying; Liu, Liangliang; Wang, Jianxin] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China.;[Yu, Ying] Univ South China, Sch Comp Sci & Technol, Hengyang 421001, Peoples R China.;[Wu, Fang-Xiang] Univ Saskatchewan, Div Biomed Engn, Saskatoon, SK S7N 5A9, Canada.;[Wu, Fang-Xiang] Univ Saskatchewan, Dept Mech Engn, Saskatoon, SK S7N 5A9, Canada.
通讯机构:
[Wang, Jianxin] C;Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China.
关键词:
Codes (symbols);Embeddings;Multilayers;Semantics;Bidirectional recurrent neural networks;Character-enhanced;Clinical notes;Electronic health record;Hierarchical approach;ICD code;International classification of disease;Neural network model;Recurrent neural networks;Article;artificial neural network;attention;Chinese (language);hospital admission;human;International Classification of Diseases;linguistics;machine learning;medical record;multilayer attention bidirectional recurrent neural network;performance;prediction;priority journal;problem solving;short term memory;automation;China;electronic health record;information processing;machine learning;Automation;China;Datasets as Topic;Electronic Health Records;International Classification of Diseases;Machine Learning
摘要:
International Classification of Diseases (ICD) code is an important label of electronic health record. The automatic ICD code assignment based on the narrative of clinical documents is an essential task which has drawn much attention recently. When Chinese clinical notes are the input corpus, the nature of Chinese brings some issues that need to be considered, such as the accuracy of word segmentation and the representation of single Chinese characters which contain semantics. Taking the lengthy text of patient notes and the representation of Chinese words into account, we present a multilayer attention bidirectional recurrent neural network (MA-BiRNN) model to implement the assignment of disease codes. A hierarchical approach is used to represent the feature of discharge summaries without manual feature engineering. The combination of character level embedding and word level embedding can improve the representation of words. Attention mechanism is introduced into bidirectional long short term memory networks, which helps to solve the performance dropping problem when plain recurrent neural networks encounter long text sequences. The experiment is carried out on a real-world dataset containing 7732 admission records in Chinese and 1177 unique ICD-10 labels. The proposed model achieves 0.639 and 0.766 in F1-score on full-level code and block-level code, respectively. It outperforms the baseline neural network models and achieves the lowest Hamming loss value. Ablation analysis indicates that the multilevel attention mechanism plays a decisive role in the system for dealing with Chinese clinical notes.
期刊:
International Journal of Multimedia and Ubiquitous Engineering,2014年9(11):385-396 ISSN:1975-0080
通讯作者:
Wen, Zhou
作者机构:
[Ouyang C.; Wen Zhou; Yu Y.; Liu Zhiming; Yang Xiaohua] School of Computer Science and Technology, University of South China, Hunan Hengyang, 421001, China
通讯机构:
School of Computer Science and Technology, University of South China, Hunan Hengyang, China
摘要:
We studied the electronic communication of knowledge users collaborating on a community and found that their work and interactions were mediated by the use of tag. Drawing on these, we found social ta
摘要:
Using the genre perspective, we studied the electronic communication of knowledge users collaborating on a movie community and found that their work and interactions were mediated by the use of genres. Drawing on these findings, we develop the concept of genre repertoire to designate the set of genres enacted by groups, organizations, or communities to accomplish their work. Automatic discourse classification according to genre in social information sharing, transfer and knowledge communication provides a higher level of service quality. By investigating user behavior in movie community, the relationship between intertextuality of discourse genre and user behavior was studied. We denoted genre by using vector, and discourse genre intertextuality intensity is measured with vector distance. And for those discourse which genre is unknown, genre intertextuality is calculated using user behavior. The results show that user various behaviors stickiness in movie community and discourse genre intertextuality intensity have potential common features.
摘要:
传统的基于关键词的信息检索不能理解用户的需要,仅仅对关键词进行简单的匹配,其结果往往包含大量与用户实际需要毫不相干的信息,同时却丢失用户实际需要的信息,使得检索的效率很低.基于本体的语义检索技术的出现,弥补了基于关键词检索的不足,成为目前构建信息检索系统的应用热点.本文主要针对燃气管网的材料腐蚀信息,设计一个基于GIS(Geography Information System)的管网材料腐蚀信息语义检索系统,使用户检索管网空间数据和腐蚀数据时为其提供相关数据的语义信息,同时也使得检索结果更加符合用户需求.