VALSE

VALSE 首页 活动通知 查看内容

VALSE 论文速览 第138期:HumanBench:Towards General Human-centric Models

2023-10-20 12:07| 发布者: 程一-计算所| 查看: 346| 评论: 0

摘要: 为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速 ...

为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速览选取了来自悉尼大学的以人为中心的通用模型研发的工作。该工作由白磊研究员和欧阳万里教授指导,论文一作唐诗翔同学录制。


论文题目:HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining

作者列表:

Shixiang Tang (悉尼大学), Cheng Chen (商汤科技), Qingsong Xie (商汤科技), Meilin Chen (浙江大学), Yizhou Wang (浙江大学), Yuanzheng Ci (悉尼大学), Lei Bai (悉尼大学), Feng Zhu (商汤科技), Haiyang Yang (商汤科技), Li Yi (商汤科技), Rui Zhao (商汤科技), Wanli Ouyang (悉尼大学)


B站观看网址:

https://www.bilibili.com/video/BV128411k7WD/



论文摘要:

Human-centric perceptions include a variety of vision tasks, which have widespread industrial applications, including surveillance, autonomous driving, and the metaverse. It is desirable to have a general pretrain model for versatile human-centric downstream tasks. This paper forges ahead along this path from the aspects of both benchmark and pretraining methods. Specifically, we propose a HumanBench based on existing datasets to comprehensively evaluate on the common ground the generalization abilities of different pretraining methods on 19 datasets from 6 diverse downstream tasks, including person ReID, pose estimation, human parsing, pedestrian attribute recognition, pedestrian detection, and crowd counting. To learn both coarse-grained and fine-grained knowledge in human bodies, we further propose a Projector AssisTed Hierarchical pretraining method (PATH) to learn diverse knowledge at different granularity levels. Comprehensive evaluations on HumanBench show that our PATH achieves new state-of-the-art results on 17 downstream datasets and on-par results on the other 2 datasets. 


论文信息:

[1] Shixiang Tang, Cheng Chen, Qingsong Xie, Meilin Chen, Yizhou Wang, Yuanzheng Ci, Lei Bai, Feng Zhu, Haiyang Yang, Li Yi, Rui Zhao, Wanli Ouyang, HumanBench: Towards General Human-centric Perception with Projector Assisted Pretraining, Accepted in CVPR2023.


论文链接:

[https://arxiv.org/abs/2303.05675]


代码链接:

[https://github.com/OpenGVLab/HumanBench]


视频讲者简介:

唐诗翔,悉尼大学信息科学与工程学院博士生。主要研究方向为无监督学习、以人为中心的通才模型研发。



特别鸣谢本次论文速览主要组织者:

月度轮值AC:秦杰 (南京航空航天大学)

季度轮值AC:叶茫 (武汉大学)


活动参与方式

1、VALSE每周举行的Webinar活动依托B站直播平台进行,欢迎在B站搜索VALSE_Webinar关注我们!

直播地址:

https://live.bilibili.com/22300737;

历史视频观看地址:

https://space.bilibili.com/562085182/ 


2、VALSE Webinar活动通常每周三晚上20:00进行,但偶尔会因为讲者时区问题略有调整,为方便您参加活动,请关注VALSE微信公众号:valse_wechat 或加入VALSE QQ S群,群号:317920537);


*注:申请加入VALSE QQ群时需验证姓名、单位和身份缺一不可。入群后,请实名,姓名身份单位。身份:学校及科研单位人员T;企业研发I;博士D;硕士M。


3、VALSE微信公众号一般会在每周四发布下一周Webinar报告的通知。


4您也可以通过访问VALSE主页:http://valser.org/ 直接查看Webinar活动信息。Webinar报告的PPT(经讲者允许后),会在VALSE官网每期报告通知的最下方更新。

小黑屋|手机版|Archiver|Vision And Learning SEminar

GMT+8, 2024-7-17 15:30 , Processed in 0.015667 second(s), 14 queries .

Powered by Discuz! X3.4

Copyright © 2001-2020, Tencent Cloud.

返回顶部