This is the project of the paper "A Non-Invasive Interpretable NAFLD Diagnostic Method Combining TCM Tongue Diagnosis", this paper can be get:https://arxiv.org/abs/2309.02959.
Non-alcoholic fatty liver disease (NAFLD) is a clinicopathological syndrome characterized by hepatic steatosis resulting from the exclusion of alcohol and other identifiable liver-damaging factors. It has emerged as a leading cause of chronic liver disease worldwide. Currently, the conventional methods for NAFLD detection are expensive and not suitable for users to perform daily diagnostics. To address this issue, this study proposes a non-invasive and interpretable NAFLD diagnostic method. This method involves extracting patients' non-invasive physiological indicators (e.g., Height, Weight, BMI, etc.) and tongue features, which are subsequently integrated using a fusion network referred to as SelectorNet. SelectorNet can autonomously learn to process different features, providing attention scores for different attributes from the network for each case. The experimental results show that the proposed method achieves an accuracy of 77.22% using only non-invasive data, and it also provides compelling interpretability matrices. This study contributes to the early diagnosis of NAFLD and the intelligent advancement of TCM tongue diagnosis.
This method requires only the tongue im- age along with readily obtainable physiological indicators to diagnose NAFLD. Apart from the calculable indicators, the required user-provided indicators are only Gender, Age, Height, Weight, Waist Circumference, and Hip Circumference. Initially, the patient’s tongue image is automatically seg- mented. Subsequently, the segmented tongue image undergoes two modes of feature extraction: one involves custom-defined disease relevance feature extraction methods based on prior research experience, while the other employs a ResNet34 pre-trained on ImageNet21k to generate features, thereby preventing the loss of latent disease-related features. Finally, the custom-extracted tongue image features are concatenated with the patient’s physiological indicators as input to Selec- torNet. Meanwhile, the features derived from ResNet34 are fused in the final stage of SelectorNet to yield the ultimate diagnostic result.
Due to concerns regarding patient privacy, we cannot disclose patient information and tongue images publicly.
1.Tongue Segmentation
This paper uses TongueSAM as the tongue segmentation tool, and you can obtain the model and pre-trained model from here.
2.Feature Exaction
The pre-trained ResNet34 used in the paper is sourced from here, and their project provides both the model and pre-trained models.
The tongue coat and tongue body segmentation method used in the article is ./split_body_coat.py, and an example of the segmentation is shown in the following figure.
For the color feature extraction and morphological feature extraction of tongue images, basic image processing algorithms are employed, treating the images as arrays for calculation. Further elaboration on these methods is not provided here.
For the texture features, Contrast (CON), Angular Second Moment (ASM), Entropy (ENT), and Mean (MEAN), this study employs approaches based on the skimage library. We utilized a pre-trained YOLOX model to extract the remaining texture features. For privacy reasons, we are unable to disclose the training dataset used, but we have made the pre-trained model publicly available here. The YOLOX we use from here. Please load the ./yolox.pth as instructed and use our class file ./tongue_classes.txt.
3.SelectorNet
The model architecture of SelectorNet can be found in ./SelectorNet.py, and the methods compared in Experiment A of the paper are sourced from here. The hyperparameter settings are provided in ./best_params.yml.
This work was supported by the Key Project of National Key R&D Project (No.2017YFC1703303); Industry-University- Research Cooperation Project of Fujian Science and Tech- nology Planning (No.2022H6012); Industry-University-Rese arch Cooperation Project of Ningde City and Xiamen Uni- versity (No.2020C001); Natural Science Foundation of Fujian Province of China (No.2020J01435, No.2021J011169).
This project is licensed under the MIT LICENSE.
If you want to reference this paper, please refer to the following format:
@misc{cao2023noninvasive,
title={A Non-Invasive Interpretable NAFLD Diagnostic Method Combining TCM Tongue Features},
author={Shan Cao and Qunsheng Ruan and Qingfeng Wu},
year={2023},
eprint={2309.02959},
archivePrefix={arXiv},
primaryClass={eess.IV}
}




