中国组织工程研究 ›› 2026, Vol. 30 ›› Issue (6): 1431-1438.doi: 10.12307/2026.569

• 组织构建实验造模 experimental modeling in tissue construction • 上一篇    下一篇

基于纹理特征检索增强的颅脑组织分割模型构建

李金茜1,汪  潮1,窦壮壮1,靳晓珂2,阮世捷1,李  佳1   

  1. 天津科技大学,1人工智能学院,2生物工程学院,天津市  300000

  • 收稿日期:2024-11-28 接受日期:2025-01-25 出版日期:2026-02-28 发布日期:2025-07-15
  • 通讯作者: 阮世捷,教授,天津科技大学,人工智能学院,天津市 300000 通讯作者:李佳,讲师,天津科技大学,人工智能学院,天津市 300000
  • 作者简介:李金茜,1999年生,硕士,主要从事医学图像分割方面的研究。
  • 基金资助:
    天津科学技术局科学技术普及研发项目(22KPXMRC00210)

Construction of craniocerebral tissue segmentation model based on texture feature retrieval enhancement

Li Jinqian1, Wang Chao1, Dou Zhuangzhuang1, Jin Xiaoke2, Ruan Shijie1, Li Jia1   

  1. 1School of Artificial Intelligence, 2School of Bioengineering, Tianjin University of Science and Technology, Tianjin 300000, China
  • Received:2024-11-28 Accepted:2025-01-25 Online:2026-02-28 Published:2025-07-15
  • Contact: Ruan Shijie, Professor, School of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin 300000, China Co-corresponding author: Li Jia, Lecturer, School of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin 300000, China
  • About author:Li Jinqian, MS, School of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin 300000, China
  • Supported by:
    Science and Technology Popularization, research and development Project of Tianjin Science and Technology Bureaus, No. 22KPXMRC00210

摘要:


文题释义:
MedSAM模型:全称Segment Anything in Medical Images,是一种基于深度学习的通用医学影像分割模型,该模型旨在通过实现通用医学图像分割,为医生提供快速、准确的诊断支持。MedSAM模型能够处理各种不同的医学影像和复杂情况,包括不同的成像方式(CT、MRI、X射线片、超声等)和多种疾病类型。
纹理特征:是一种全局特征,刻画了图像中重复出现的局部模式与它们的排列规则,纹理体现了物体表面的具有缓慢变化或周期性变化的表面结构组织排列属性,通过纹理特征,可以对图像的质地如粗糙度、光滑性、随机性和规范性等进行量化描述。

背景:快速、准确的脑组织医学影像分割对于颅脑损伤生物力学三维建模和诊断具有重大意义。目前,基于人工智能的基础模型在大规模数据集上具有卓越的泛化能力,但由于颅脑组织的特异性和复杂性,其在颅脑组织分割应用上有一定的局限性。同时,颅脑组织样本数据的稀缺也使基础模型难以通过微调得到精确的分割结果。
目的:构建基于纹理特征检索增强的颅脑组织分割模型,以提高少量样本条件下的分割准确度。
方法:选取医学图像任意分割模型(MedSAM)作为基础构架,将纹理特征与深度学习相融合构建基于纹理特征检索增强的颅脑组织分割模型(DP-MedSAM),选取Dice系数(Dice Coefficient)和平均交并比评估图像分割结果的效能;消融实验通过与原始MedSAM模型进行比较,系统地评估关键组件对模型性能的影响;对比MedSAM、医学二维图像分割增强模型(SAM-Med2D)和DP-MedSAM在下颚骨、左视神经、左腮腺中的敏感度。
结果与结论:①通过在HaN-Seg数据集上验证点位提示个数对分割结果的影响,实验结果说明增加3个点位,Dice系数最佳;②DP-MedSAM
在2个数据集(HaN-Seg和公共领域计算解剖数据集)上相较于MedSAM和SAM-Med2D都显示出了性能提升,尤其是在公共领域计算解剖数据集上,平均交并比,DP-MedSAM比MedSAM高出6.59%,比SAM-Med2D高出37.35%;Dice系数,DP-MedSAM分别比MedSAM和SAM-Med2D高出4.34%和25.32%;③消融实验结果显示,移除DP-MedSAM模型中的纹理特征提取模块,仅依赖于原始图像特征,在测试集上结果显著下降;再移除了模型的向量缓存库及其检索增强功能,使得模型无法利用外部知识库进行相似性检索,模型性能进一步下降;④DP-MedSAM模型在数据资源有限的情况下,各项评估指标均优于其他2个模型;DP-MedSAM模型在处理简单样本和中等难度样本时表现出色,与其他2个模型相比具有明显的优势,表明模型具有良好的泛化能力;处理困难样本的细微结构对模型的分割能力提出了更高的要求,DP-MedSAM模型的性能略有下降,但仍优于其他2个模型;⑤此次研究提出了一种创新性的颅脑组织分割模型DP-MedSAM,通过引入目标区域纹理特征提取,提高了基础模型捕捉医学图像中的局部细节和全局结构信息方面的表现;通过向量相似性检索技术,DP-MedSAM能够从预先构建的向量数据库中检索出与当前目标区域最为相似的特征向量,从而为分割过程提供了更为精确的引导信息。

https://orcid.org/0009-0007-7027-3572(李金茜)

中国组织工程研究杂志出版内容重点:干细胞;骨髓干细胞;造血干细胞;脂肪干细胞;肿瘤干细胞;胚胎干细胞;脐带脐血干细胞;干细胞诱导;干细胞分化;组织工程

关键词: 颅脑组织, 纹理特征, 医学分割, 基础模型, 检索增强

Abstract: BACKGROUND: Rapid and accurate segmentation of brain tissue in medical images is of great significance for three-dimensional biomechanical modeling and diagnosis of craniocerebral injuries. Currently, artificial intelligence (AI)-based baseline models exhibit excellent generalization capabilities on large-scale datasets. However, due to the specificity and complexity of craniocerebral tissues, these models have certain limitations in their application to craniocerebral tissue segmentation. Additionally, the scarcity of craniocerebral tissue samples makes it difficult for baseline models to achieve precise segmentation results through fine-tuning.
OBJECTIVE: To construct a craniocerebral tissue segmentation model based on texture feature retrieval enhancement to improve segmentation accuracy under a small number of samples.
METHODS: Segment Anything in Medical Images (MedSAM) model was selected as the basic framework, and texture features were combined with deep learning to build a brain tissue segmentation model based on texture feature retrieval enhancement (DP-MedSAM). Dice Coefficient and mean intersection over union (MIoU) were selected to evaluate the efficiency of image segmentation results. In comparison with the original MedSAM model, the ablation experiment systematically evaluated the influence of key components on the model performance. The sensitivities of MedSAM, the Segment Anything Model (SAM) for medical image segmentation (SAM-Med2D) and DP-MedSAM in the mandible, left optic nerve, and left parotid gland were compared.
RESULTS AND CONCLUSION: (1) By verifying the impact of the number of point prompts on segmentation results on the HaN-Seg dataset, the experimental results indicated that the optimal Dice score was achieved with the addition of three points. (2) DP-MedSAM demonstrated performance improvements compared with MedSAM and SAM-Med2D on two datasets (HaN and Public Domain Database for Computational Anatomy). Especially on the Public Domain Database for Computational Anatomy dataset, in terms of the MIoU metric, DP-MedSAM outperformed MedSAM by 6.59% and SAM-Med2D by 37.35%; in terms of the Dice metric, DP-MedSAM outperformed MedSAM and SAM-Med2D by 4.34% and 25.32%, respectively. (3) The ablation experiment results showed that removing the texture feature extraction module in the DP-MedSAM model, relying solely on original image features, led to a significant decrease in results on the test set. Furthermore, removing the vector cache database and its retrieval enhancement function from the model, which deprived the ability of the model to perform similarity retrieval using an external knowledge base, further reduced model performance. (4) Under conditions of limited data resources, the DP-MedSAM model outperformed the other two models in all evaluation metrics. The DP-MedSAM model performed excellently when processing simple and moderately difficult samples, demonstrating a clear advantage over the other two models and indicating good generalization ability. Processing the fine structures of difficult samples placed higher demands on the model’s segmentation capabilities. Although the performance of the DP-MedSAM model declined slightly, it still outperformed the other two models. (5) This study proposes an innovative craniocerebral tissue segmentation model, DP-MedSAM, which improves the baseline model’s performance in capturing local details and global structural information in medical images by introducing target region texture feature extraction. Through vector similarity retrieval technology, DP-MedSAM can retrieve the feature vector most similar to the current target region from a pre-constructed vector database, providing more precise guiding information for the segmentation process.

Key words: craniocerebral, textural features, medical segmentation, base model, retrieval enhancement

中图分类号: