VALSE

VALSE 首页 活动通知 查看内容

VALSE 论文速览 第205期:监督式3D模型在医疗图像分析任务中的应用 ...

2025-1-13 19:10| 发布者: 程一-计算所| 查看: 39| 评论: 0

摘要: 为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速 ...

为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速览选取了来自约翰霍普金斯大学的医疗图像分析 (Medical Image Analysis)的工作。该工作由约翰霍普金斯大学AlanYuille周纵苇博士指导,论文一作李文璇同学录制。


论文题目:

How well do supervised models transfer to 3d image segmentation?

作者列表:

李文璇 (约翰霍普金斯大学),Alan Yuille (约翰霍普金斯大学),周纵苇 (约翰霍普金斯大学)

B站观看网址:

https://www.bilibili.com/video/BV1BDxNeeEuB/?spm_id_from=333.999.0.0&vd_source=2044ac986b5caa4c8b5f00b525441df3


复制链接到浏览器打开或点击阅读原文即可跳转至观看页面。


论文摘要:

The pre-training and fine-tuning paradigm has become prominent in transfer learning. For example, if the model is pre-trained on ImageNet and then fine-tuned to PASCAL, it can significantly outperform that trained on PASCAL from scratch. While ImageNet pre-training has shown enormous success, it is formed in 2D and the learned features are for classification tasks; when transferring to more diverse tasks, like 3D image segmentation, its performance is inevitably compromised due to the deviation from the original ImageNet context. A significant challenge lies in the lack of large, annotated 3D datasets rivaling the scale of ImageNet for model pre-training. To overcome this challenge, we make two contributions. Firstly, we construct AbdomenAtlas 1.1 that comprises 9,262 three-dimensional computed tomography (CT) volumes with high-quality, per-voxel annotations of 25 anatomical structures and pseudo annotations of seven tumor types. Secondly, we develop a suite of models that are pre-trained on our AbdomenAtlas 1.1 for transfer learning. Our preliminary analyses indicate that the model trained only with 21 CT volumes, 672 masks, and 40 GPU hours has a transfer learning ability similar to the model trained with 5,050 (unlabeled) CT volumes and 1,152 GPU hours. More importantly, the transfer learning ability of supervised models can further scale up with larger annotated datasets, achieving significantly better performance than preexisting pre-trained models, irrespective of their pre-training methodologies or data sources. We hope this study can facilitate collective efforts in constructing larger 3D vision datasets and more releases of supervised pre-trained models.


参考文献:

[1] Li, Wenxuan, Alan Yuille, and Zongwei Zhou. "How well do supervised models transfer to 3d image segmentation?." In The Twelfth International Conference on Learning Representations. 2024.


论文链接:

[https://www.cs.jhu.edu/~alanlab/Pubs23/li2023transitioning.pdf]


代码链接:

[https://github.com/MrGiovanni/SuPreM]


视频讲者简介:

Wenxuan Li, a student of Computer Science from Johns Hopkins University, is mentored by Professor Alan Yuille and Dr. Zongwei Zhou. Her research focuses on utilizing large-scale datasets and annotations to enhance medical image analysis. She has published several papers in top conferences such as ICLR and RSNA. She has also served as a leader for organizing several challenges, including ISBI and MICCAI.


个人主页:

https://github.com/MrGiovanni/SuPreM/tree/main



特别鸣谢本次论文速览主要组织者:

月度轮值AC:武宇 (武汉大学)


活动参与方式

1、VALSE每周举行的Webinar活动依托B站直播平台进行,欢迎在B站搜索VALSE_Webinar关注我们!

直播地址:

https://live.bilibili.com/22300737;

历史视频观看地址:

https://space.bilibili.com/562085182/ 


2、VALSE Webinar活动通常每周三晚上20:00进行,但偶尔会因为讲者时区问题略有调整,为方便您参加活动,请关注VALSE微信公众号:valse_wechat 或加入VALSE QQ T群,群号:863867505);


*注:申请加入VALSE QQ群时需验证姓名、单位和身份缺一不可。入群后,请实名,姓名身份单位。身份:学校及科研单位人员T;企业研发I;博士D;硕士M。


3、VALSE微信公众号一般会在每周四发布下一周Webinar报告的通知。


4您也可以通过访问VALSE主页:http://valser.org/ 直接查看Webinar活动信息。Webinar报告的PPT(经讲者允许后),会在VALSE官网每期报告通知的最下方更新。

小黑屋|手机版|Archiver|Vision And Learning SEminar

GMT+8, 2025-1-31 05:12 , Processed in 0.014070 second(s), 14 queries .

Powered by Discuz! X3.4

Copyright © 2001-2020, Tencent Cloud.

返回顶部