为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速览选取了来自厦门大学的长尾学习 (Long-tailed learning)的工作,该论文由李梦柯老师、卢杨老师、张晓明老师、王菡子老师共同指导,论文一作金焱同学录制。 论文题目:Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation 作者列表: 金焱 (厦门大学),李梦柯 (深圳光明实验室),卢杨 (厦门大学),张晓明 (香港浸会大学),王菡子 (厦门大学) B站观看网址: 论文摘要: Deep neural networks have made huge progress in the last few decades. However, as the real-world data often exhibits a long-tailed distribution, vanilla deep models tend to be heavily biased toward the majority classes. To address this problem, state-of-the-art methods usually adopt a mixture of experts (MoE) to focus on different parts of the long-tailed distribution. Experts in these methods are with the same model depth, which neglects the fact that different classes may have different preferences to be fit by models with different depths. To this end, we propose a novel MoE-based method called Self-Heterogeneous Integration with Knowledge Excavation (SHIKE). We first propose Depth-wise Knowledge Fusion (DKF) to fuse features between different shallow parts and the deep part in one network for each expert, which makes experts more diverse in terms of representation. Based on DKF, we further propose Dynamic Knowledge Transfer (DKT) to reduce the influence of the hardest negative class that has a non-negligible impact on the tail classes in our MoE framework. As a result, the classification accuracy of long-tailed data can be significantly improved, especially for the tail classes. SHIKE achieves the state-of-the-art performance of 56.3%, 60.3%, 75.4%, and 41.9% on CIFAR100-LT (IF100), ImageNet-LT, iNaturalist 2018, and Places-LT, respectively. 论文信息: [1] Yan Jin, Mengke Li, Yang Lu, Yiu-ming Cheung, and Hanzi Wang, “Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation,” in Proceedings of IEEE/ CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, June 18-22, 2023. 论文链接: [https://arxiv.org/abs/2304.01279] 代码链接: [https://github.com/jinyan-06/SHIKE] 视频讲者简介: 金焱,厦门大学信息学院硕士生。主要研究方向为深度长尾学习。 特别鸣谢本次论文速览主要组织者: 月度轮值AC:叶茫 (武汉大学) 活动参与方式 1、VALSE每周举行的Webinar活动依托B站直播平台进行,欢迎在B站搜索VALSE_Webinar关注我们! 直播地址: https://live.bilibili.com/22300737; 历史视频观看地址: https://space.bilibili.com/562085182/ 2、VALSE Webinar活动通常每周三晚上20:00进行,但偶尔会因为讲者时区问题略有调整,为方便您参加活动,请关注VALSE微信公众号:valse_wechat 或加入VALSE QQ S群,群号:317920537); *注:申请加入VALSE QQ群时需验证姓名、单位和身份,缺一不可。入群后,请实名,姓名身份单位。身份:学校及科研单位人员T;企业研发I;博士D;硕士M。 3、VALSE微信公众号一般会在每周四发布下一周Webinar报告的通知。 4、您也可以通过访问VALSE主页:http://valser.org/ 直接查看Webinar活动信息。Webinar报告的PPT(经讲者允许后),会在VALSE官网每期报告通知的最下方更新。 |
小黑屋|手机版|Archiver|Vision And Learning SEminar
GMT+8, 2024-11-23 16:18 , Processed in 0.014470 second(s), 14 queries .
Powered by Discuz! X3.4
Copyright © 2001-2020, Tencent Cloud.