toplogo
Đăng nhập
thông tin chi tiết - Computer Vision - # Image Generation with Diffusion Transformers

OminiControl: A Parameter-Efficient Framework for Integrating Image Conditions into Pre-trained Diffusion Transformer Models


Khái niệm cốt lõi
OminiControl is a novel, parameter-efficient framework that enables diverse image control for diffusion transformer models by leveraging a unified token processing approach and multi-modal attention, outperforming existing methods in both spatially aligned and non-spatially aligned tasks.
Tóm tắt
edit_icon

Tùy Chỉnh Tóm Tắt

edit_icon

Viết Lại Với AI

edit_icon

Tạo Trích Dẫn

translate_icon

Dịch Nguồn

visual_icon

Tạo sơ đồ tư duy

visit_icon

Xem Nguồn

Tan, Z., Liu, S., Yang, X., Xue, Q., & Wang, X. (2024). OminiControl: Minimal and Universal Control for Diffusion Transformer. arXiv preprint arXiv:2411.15098.
This paper introduces OminiControl, a novel framework designed to address the limitations of existing image conditioning methods for diffusion models, particularly in terms of parameter efficiency and the ability to handle both spatially aligned and non-spatially aligned tasks within a unified architecture.

Thông tin chi tiết chính được chắt lọc từ

by Zhenxiong Ta... lúc arxiv.org 11-25-2024

https://arxiv.org/pdf/2411.15098.pdf
OminiControl: Minimal and Universal Control for Diffusion Transformer

Yêu cầu sâu hơn

0
star