
Instruction-based image editing improves the controllability and flexibility of image manipulation via natural commands without elaborate descriptions or regional masks. However, human instructions are sometimes too brief for current methods to capture and follow. Multimodal large language models (MLLMs) show promising capabilities in cross-modal understanding and visual-aware response generation via LMs. We investigate how MLLMs facilitate edit instructions and present MLLM-Guided Image Editing (MGIE). MGIE learns to derive expressive instructions and provides explicit guidance. The editing model jointly captures this visual imagination and performs manipulation through end-to-end training. We evaluate various aspects of Photoshop-style modification, global photo optimization, and local editing. Extensive experimental results demonstrate that expressive instructions are crucial to instruction-based image editing, and our MGIE can lead to a notable improvement in automatic metrics and human evaluation while maintaining competitive inference efficiency.
👇 press the tab for different datasets
数据统计
数据评估
关于[ICLR’24] MGIE特别声明
本站鸟瑞导航提供的[ICLR’24] MGIE数据都来源于网络,不保证外部链接的准确性和完整性,同时,对于该外部链接的指向,不由鸟瑞导航实际控制,在2025年9月10日 下午7:04收录时,该网页上的内容,都属于合法合规,后期网页的内容如出现违规,请联系本站网站管理员进行举报,我们将进行删除,鸟瑞导航不承担任何责任。
相关导航

Skybox AI: One-click 360° image generator from Blockade Labs

Auth0: Secure access for everyone. But not just anyone.
Rapidly integrate authentication and authorization for web, mobile, and legacy applications so you can focus on your core business.

AI motion capture and 3D scene design with RADiCAL
Create animation from video to 3D with RADiCAL's AI motion capture solution, and design scenes and environments in real-time.

炉米Lumi

包图AI文生图
包图网拥有亿级正版商用素材模板,为企业 、政府机关、个人用户提供原创可商用的精品版权,涵盖4K/8K高清视频、AE模板、MG动画、配乐音效、AI素材、AI视频、AI音乐、PPT模板、海报模板、UI设计素材、PNG元素、电商淘宝、摄影图、插画动图、装饰装修、3D素材等,满足企业宣传、政府党建宣传及个人用户的创意剪辑、智能抠图、在线设计、AI绘画等各种使用场景,会员可享免费下载,立即访问包图网,获取高质素材!
艾绘
艾绘是一家专注于使用AI技术创作儿童绘本创作的平台,结合人工智能技术的绘本创作平台,提供文生图、文生视频、图生图、背景生成和涂鸦绘画等创新工具,让孩子们的想象力得以无限扩展,创作出独特的个性化绘本,提供多样化的故事类型,包括魔法冒险、动物友谊、科普知识、历史传说等,旨在通过寓教于乐的方式,激发孩子们的想象力、创造力和学习兴趣,让孩子们在阅读中学习和成长。

大设AI
大设网(原AI大作)是基于Stable Diffusion的免费ai绘画网站,为ai作画爱好者提供一键生成高清精绘大图、sdxl模型保姆级教程、AI提示词工具。在大设ai人工智能绘画平台随意发挥自己的绘画创意。

大画丹青
领先的、稳定的、安全的Stable Diffusion API服务提供商 | 绘图体验 | 大画智慧-PS插件 | 智启特AI
暂无评论...