Classification method of traditional Chinese medicine compound decoction duration based on multi-dimensional feature weighted fusion

Comput Methods Biomech Biomed Engin. 2024 Jan 9:1-15. doi: 10.1080/10255842.2024.2302225. Online ahead of print.ABSTRACTThis paper extends a text classification method utilizing natural language processing (NLP) into the field of traditional Chinese medicine (TCM) compound decoction to effectively and scientifically extend the TCM compound decoction duration. Specifically, a TCM compound decoction duration classification named TCM-TextCNN is proposed to fuse multi-dimensional herb features and improve TextCNN. Indeed, first, we utilize word vector technology to construct feature vectors of herb names and medicinal parts, aiming to describe the herb characteristics comprehensively. Second, considering the impact of different herb features on the decoction duration, we use an improved Term Frequency-Inverse Word Frequency (TF-IWF) algorithm to weigh the feature vectors of herb names and medicinal parts. These weighted feature vectors are then concatenated to obtain a multi-dimensional herb feature vector, allowing for a more comprehensive representation. Finally, the feature vector is input into the improved TextCNN, which uses k-max pooling to reduce information loss rather than max pooling. Three fully connected layers are added to generate higher-level feature representations, followed by softmax to obtain the final results. Experimental results on a dataset of TCM compound decoction duration demonstrate that TCM-TextCNN improves accuracy, recall, and F1 score by 5.31%, 5.63...
Source: Computer Methods in Biomechanics and Biomedical Engineering - Category: Biomedical Engineering Authors: Source Type: research