cyberagent/opencole-typographylmm-llava-v1.5-7b-lora Image-Text-to-Text • Updated May 9, 2024 • 33 • 6
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition Paper • 2412.19712 • Published 7 days ago • 14
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Paper • 2412.18605 • Published 10 days ago • 17
SpotLight: Shadow-Guided Object Relighting via Diffusion Paper • 2411.18665 • Published Nov 27, 2024 • 3
MotiF: Making Text Count in Image Animation with Motion Focal Loss Paper • 2412.16153 • Published 14 days ago • 6
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Paper • 2406.02347 • Published Jun 4, 2024 • 2
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation Paper • 2412.14283 • Published 16 days ago • 3
Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion Paper • 2412.14462 • Published 15 days ago • 15
FashionComposer: Compositional Fashion Image Generation Paper • 2412.14168 • Published 16 days ago • 16