VALSE

VALSE 首页 活动通知 查看内容

VALSE 论文速览 第161期:用于场景文本识别的自监督字符到字符蒸馏网络 ...

2024-1-9 19:37| 发布者: 程一-计算所| 查看: 125| 评论: 0

摘要: 为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速 ...

为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速览选取了来自上海交通大学的场景文本识别 (Scene Text Recognition)的工作。该工作由沈为副教授和杨小康教授指导,论文一作官同坤同学录制。


论文题目:

Self-Supervised Character-to-Character Distillation for Text Recognition

作者列表:

官同坤 (上海交通大学)、沈为 (上海交通大学)、杨学 (上海交通大学)、冯琦 (上海交通大学)、姜泽坤 (上海交通大学)、杨小康 (上海交通大学)


B站观看网址:

https://www.bilibili.com/video/BV1zQ4y1j7YT/



论文摘要:

After many years of supervised text recognition research, one question in the community is how to explore the potential of unlabeled real images by self-supervised learning. This talk will present our recent work in self-supervised text recognition from a different perspective: taking characters rather than sequences as the basic items for representation learning. We will also discuss how to keep their feature consistency learning under flexible augmentations.


参考文献:

[1] Tongkun Guan, Wei Shen, Xue Yang, Qi Feng, Zekun Jiang, Xiaokang Yang, “Self-Supervised Character-to-Character Distillation for Text Recognition,” in Proceeding of IEEE International Conference on Computer Vision (ICCV 2023), Paris, France, October 2023.


论文链接:

[https://openaccess.thecvf.com/content/ICCV2023/papers/Guan_Self-Supervised_Character-to-Character_Distillation_for_Text_Recognition_ICCV_2023_paper.pdf]


代码链接:

[https://github.com/TongkunGuan/CCD]


视频讲者简介:

Tongkun Guan is a PhD student at the Artificial Intelligence Institute, Shanghai Jiao Tong University. He received the M.S. degree in the Department of Automation from Shanghai Jiao Tong University, Shanghai, China, in 2023, and received the B.S. degree in Electrical Engineering and Automation from Hunan University, Changsha, China, in 2020. His research interests mainly include computer vision, text detection, and text recognition.


个人主页:

https://tongkunguan.github.io/



特别鸣谢本次论文速览主要组织者:

月度轮值AC:李爽 (北京理工大学)

小黑屋|手机版|Archiver|Vision And Learning SEminar

GMT+8, 2024-5-14 19:57 , Processed in 0.015511 second(s), 14 queries .

Powered by Discuz! X3.4

Copyright © 2001-2020, Tencent Cloud.

返回顶部