VALSE

VALSE 首页 活动通知 查看内容

VALSE 论文速览 第222期:In-Context Matting

2025-9-9 12:07| 发布者: 程一-计算所| 查看: 19| 评论: 0

摘要: 为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。这项研究提出了 ...

为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。这项研究提出了一种新颖的图像抠图任务设置——“in-context matting” (上下文抠图)。该设置通过给定一张特定前景的参考图像和引导先验,在一批同类前景的目标图像上实现自动alpha matte估计,无需额外辅助输入,平衡了抠图性能与自动化之间的矛盾。


论文题目:

In-Context Matting

作者列表:

He Guo (Huazhong University of Science and Technology), Zixuan Ye (Huazhong University of Science and Technology), Zhiguo Cao (Huazhong University of Science and Technology), Hao Lu (Huazhong University of Science and Technology)


B站观看网址:

https://www.bilibili.com/video/BV1zio7YhEt2/


论文摘要:

We introduce in-context matting, a novel task setting of image matting. Given a reference image of a certain foreground and guided priors such as points, scribbles, and masks, in-context matting enables automatic alpha estimation on a batch of target images of the same foreground category, without additional auxiliary input.

This setting marries good performance in auxiliary input-based matting and ease of use in automatic matting, which finds a good trade-off between customization and automation.

To overcome the key challenge of accurate foreground matching, we introduce IconMatting, an in-context matting model built upon a pre-trained text-to-image diffusion model. Conditioned on inter- and intra-similarity matching, IconMatting can make full use of reference context to generate accurate target alpha mattes. To benchmark the task, we also introduce a novel testing dataset ICM-57, covering 57 groups of real-world images.

Quantitative and qualitative results on the ICM-57 testing set show that IconMatting rivals the accuracy of trimap-based matting while retaining the automation level akin to automatic matting.

 

参考文献:

[1] He Guo, Zixuan Ye, Zhiguo Cao, Hao Lu, “In-Context Matting,” in Proceeding of IEEE Computer Vision and Pattern Recognition (CVPR 2024), Seattle, WA, June 2024.


论文链接:

https://openaccess.thecvf.com/content/CVPR2024/papers/Guo_In-Context_Matting_CVPR_2024_paper.pdf


视频讲者简介:

He Guo is a master student at Huazhong University of Science and Technology. He received his B.S. degree from the School of Artificial Intelligence and Automation at Huazhong University of Science and Technology. His research interests include computer vision. Recently, his research is focused on the application of generative models.

 

个人主页:

https://github.com/guohe369



特别鸣谢本次论文速览主要组织者:

月度轮值AC:杨旭 (西安电子科技大学)

小黑屋|手机版|Archiver|Vision And Learning SEminar

GMT+8, 2025-10-14 13:31 , Processed in 0.013140 second(s), 14 queries .

Powered by Discuz! X3.4

Copyright © 2001-2020, Tencent Cloud.

返回顶部