toplogo
Войти
аналитика - Computer Vision - # Image Generation with Diffusion Transformers

OminiControl: A Parameter-Efficient Framework for Integrating Image Conditions into Pre-trained Diffusion Transformer Models


Основные понятия
OminiControl is a novel, parameter-efficient framework that enables diverse image control for diffusion transformer models by leveraging a unified token processing approach and multi-modal attention, outperforming existing methods in both spatially aligned and non-spatially aligned tasks.
Аннотация
edit_icon

Настроить сводку

edit_icon

Переписать с помощью ИИ

edit_icon

Создать цитаты

translate_icon

Перевести источник

visual_icon

Создать интеллект-карту

visit_icon

Перейти к источнику

Tan, Z., Liu, S., Yang, X., Xue, Q., & Wang, X. (2024). OminiControl: Minimal and Universal Control for Diffusion Transformer. arXiv preprint arXiv:2411.15098.
This paper introduces OminiControl, a novel framework designed to address the limitations of existing image conditioning methods for diffusion models, particularly in terms of parameter efficiency and the ability to handle both spatially aligned and non-spatially aligned tasks within a unified architecture.

Ключевые выводы из

by Zhenxiong Ta... в arxiv.org 11-25-2024

https://arxiv.org/pdf/2411.15098.pdf
OminiControl: Minimal and Universal Control for Diffusion Transformer

Дополнительные вопросы

0
star