为了使得视觉与学习领域相关从业者快速及时地了解领域的最新发展动态和前沿技术进展,VALSE最新推出了《论文速览》栏目,将在每周发布一至两篇顶会顶刊论文的录制视频,对单个前沿工作进行细致讲解。本期VALSE论文速览分析了艺术图像编辑领域中现有工作面临复杂语义难以提取以及跨域伪影的问题,提出了使用无语义信息作为引导,通过区域搬运的策略来实现自监督图像生成。 论文题目: Learning to Manipulate Artistic Images 作者列表: 郭炜、张瑜琦、马德、郑乾 B站观看网址: https://www.bilibili.com/video/BV1fSxNeNEcC/?spm_id_from=333.999.0.0&vd_source=2044ac986b5caa4c8b5f00b525441df3 复制链接到浏览器打开或点击阅读原文即可跳转至观看页面。 论文摘要: Recent advancement in computer vision has significantly lowered the barriers to artistic creation. Exemplar-based image translation methods have attracted much attention due to flexibility and controllability. However, these methods hold assumptions regarding semantics or require semantic information as the input, while accurate semantics is not easy to obtain in artistic images. Besides, these methods suffer from cross-domain artifacts due to training data prior and generate imprecise structure due to feature compression in the spatial domain. In this paper, we propose an arbitrary Style Image Manipulation Network (SIM-Net), which leverages semantic-free information as guidance and a region transportation strategy in a self-supervised manner for image generation. Our method balances computational efficiency and high resolution to a certain extent. Moreover, our method facilitates zero-shot style image manipulation. Both qualitative and quantitative experiments demonstrate the superiority of our method over state-of-the-art methods. 参考文献: [1] Guo W, Zhang Y, Ma D, et al. Learning to Manipulate Artistic Images[J]. arXiv preprint arXiv:2401.13976, 2024. 论文链接: [https://arxiv.org/abs/2401.13976] 代码链接: [https://github.com/SnailForce/SIM-Net] 视频讲者简介: Wei Guo is currently a second-year Ph.D. at the College of Computer Science, Zhejiang University. He obtained his bachelor's and master's degrees from Northwestern Polytechnical University in 2019 and 2022, respectively. His research interests span multiple directions in computer vision, primarily including image detection and image generation. He is currently focused on learning from 2D vision and textual data and applying it to tasks in 3D vision. He has completed an internship at ByteDance. 特别鸣谢本次论文速览主要组织者: 月度轮值AC:于茜 (北京航空航天大学) 活动参与方式 1、VALSE每周举行的Webinar活动依托B站直播平台进行,欢迎在B站搜索VALSE_Webinar关注我们! 直播地址: https://live.bilibili.com/22300737; 历史视频观看地址: https://space.bilibili.com/562085182/ 2、VALSE Webinar活动通常每周三晚上20:00进行,但偶尔会因为讲者时区问题略有调整,为方便您参加活动,请关注VALSE微信公众号:valse_wechat 或加入VALSE QQ T群,群号:863867505); *注:申请加入VALSE QQ群时需验证姓名、单位和身份,缺一不可。入群后,请实名,姓名身份单位。身份:学校及科研单位人员T;企业研发I;博士D;硕士M。 3、VALSE微信公众号一般会在每周四发布下一周Webinar报告的通知。 4、您也可以通过访问VALSE主页:http://valser.org/ 直接查看Webinar活动信息。Webinar报告的PPT(经讲者允许后),会在VALSE官网每期报告通知的最下方更新。 |
小黑屋|手机版|Archiver|Vision And Learning SEminar
GMT+8, 2025-1-31 20:42 , Processed in 0.012795 second(s), 14 queries .
Powered by Discuz! X3.4
Copyright © 2001-2020, Tencent Cloud.