seonukkim
's Collections
For Content Creator
updated
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Paper
•
2305.06131
•
Published
•
2
Perpetual Humanoid Control for Real-time Simulated Avatars
Paper
•
2305.06456
•
Published
•
1
Drag Your GAN: Interactive Point-based Manipulation on the Generative
Image Manifold
Paper
•
2305.10973
•
Published
•
32
LDM3D: Latent Diffusion Model for 3D
Paper
•
2305.10853
•
Published
•
10
OpenShape: Scaling Up 3D Shape Representation Towards Open-World
Understanding
Paper
•
2305.10764
•
Published
•
6
Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D
Diffusion Probabilistic Models
Paper
•
2305.11870
•
Published
•
3
StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity
3D Avatar Generation
Paper
•
2305.19012
•
Published
•
4
AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation
Paper
•
2305.19245
•
Published
•
2
AniFaceDrawing: Anime Portrait Exploration during Your Sketching
Paper
•
2306.07476
•
Published
•
18
AvatarBooth: High-Quality and Customizable 3D Human Avatar Generation
Paper
•
2306.09864
•
Published
•
14
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based
Image Editing
Paper
•
2306.14435
•
Published
•
20
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D
and 3D Diffusion Priors
Paper
•
2306.17843
•
Published
•
43
SDXL: Improving Latent Diffusion Models for High-Resolution Image
Synthesis
Paper
•
2307.01952
•
Published
•
82
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models
Paper
•
2307.02421
•
Published
•
34
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
without Specific Tuning
Paper
•
2307.04725
•
Published
•
64
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based
Image Manipulation
Paper
•
2308.00906
•
Published
•
13
ConceptLab: Creative Generation using Diffusion Prior Constraints
Paper
•
2308.02669
•
Published
•
23
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image
Diffusion Models
Paper
•
2308.06721
•
Published
•
29
ControlMat: A Controlled Generative Approach to Material Capture
Paper
•
2309.01700
•
Published
•
13
Deep Geometrized Cartoon Line Inbetweening
Paper
•
2309.16643
•
Published
•
24
Matryoshka Diffusion Models
Paper
•
2310.15111
•
Published
•
40
FaceStudio: Put Your Face Everywhere in Seconds
Paper
•
2312.02663
•
Published
•
30
X-Adapter: Adding Universal Compatibility of Plugins for Upgraded
Diffusion Model
Paper
•
2312.02238
•
Published
•
25
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators
Paper
•
2312.03793
•
Published
•
17
VecFusion: Vector Font Generation with Diffusion
Paper
•
2312.10540
•
Published
•
21
SDXL-Lightning: Progressive Adversarial Diffusion Distillation
Paper
•
2402.13929
•
Published
•
28
Genie: Generative Interactive Environments
Paper
•
2402.15391
•
Published
•
71
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
•
2402.17485
•
Published
•
189
Sora: A Review on Background, Technology, Limitations, and Opportunities
of Large Vision Models
Paper
•
2402.17177
•
Published
•
88
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in
Text-to-Image Generation
Paper
•
2402.17245
•
Published
•
10
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Paper
•
2403.05135
•
Published
•
42
AnimateDiff-Lightning: Cross-Model Diffusion Distillation
Paper
•
2403.12706
•
Published
•
17
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of
Text-to-Image Models
Paper
•
2403.13535
•
Published
•
22
ThemeStation: Generating Theme-Aware 3D Assets from Few Exemplars
Paper
•
2403.15383
•
Published
•
13
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
Paper
•
2403.16627
•
Published
•
20
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Paper
•
2403.17694
•
Published
•
10
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image
Generation
Paper
•
2404.02733
•
Published
•
20
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
Matching
Paper
•
2404.03653
•
Published
•
33
ControlNet++: Improving Conditional Controls with Efficient Consistency
Feedback
Paper
•
2404.07987
•
Published
•
47
AniClipart: Clipart Animation with Text-to-Video Priors
Paper
•
2404.12347
•
Published
•
12
Dynamic Typography: Bringing Words to Life
Paper
•
2404.11614
•
Published
•
44
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image
Synthesis
Paper
•
2404.13686
•
Published
•
27
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Paper
•
2404.16022
•
Published
•
20
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper
•
2404.19427
•
Published
•
71
Compositional Text-to-Image Generation with Dense Blob Representations
Paper
•
2405.08246
•
Published
•
12
Toon3D: Seeing Cartoons from a New Perspective
Paper
•
2405.10320
•
Published
•
19
FIFO-Diffusion: Generating Infinite Videos from Text without Training
Paper
•
2405.11473
•
Published
•
53
3DitScene: Editing Any Scene via Language-guided Disentangled Gaussian
Splatting
Paper
•
2405.18424
•
Published
•
7
I4VGen: Image as Stepping Stone for Text-to-Video Generation
Paper
•
2406.02230
•
Published
•
16
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Paper
•
2406.04333
•
Published
•
36
Step-aware Preference Optimization: Aligning Preference with Denoising
Performance at Each Step
Paper
•
2406.04314
•
Published
•
27
Commonsense-T2I Challenge: Can Text-to-Image Generation Models
Understand Commonsense?
Paper
•
2406.07546
•
Published
•
8
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN
Inversion and High Quality Image Editing
Paper
•
2406.10601
•
Published
•
65
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Paper
•
2406.13393
•
Published
•
5
Magic Insert: Style-Aware Drag-and-Drop
Paper
•
2407.02489
•
Published
•
20
Paper
•
2407.14358
•
Published
•
23
OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any
Person
Paper
•
2407.16224
•
Published
•
25
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning
using Instruct Prompts
Paper
•
2408.03209
•
Published
•
21
Transformer Explainer: Interactive Learning of Text-Generative Models
Paper
•
2408.04619
•
Published
•
155
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from
User's Casual Sketches
Paper
•
2408.04567
•
Published
•
24
ControlNeXt: Powerful and Efficient Control for Image and Video
Generation
Paper
•
2408.06070
•
Published
•
52
UniPortrait: A Unified Framework for Identity-Preserving Single- and
Multi-Human Image Personalization
Paper
•
2408.05939
•
Published
•
13
ZePo: Zero-Shot Portrait Stylization with Faster Sampling
Paper
•
2408.05492
•
Published
•
7
CustomCrafter: Customized Video Generation with Preserving Motion and
Concept Composition Abilities
Paper
•
2408.13239
•
Published
•
11
CSGO: Content-Style Composition in Text-to-Image Generation
Paper
•
2408.16766
•
Published
•
17
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper
•
2409.02097
•
Published
•
31
IFAdapter: Instance Feature Control for Grounded Text-to-Image
Generation
Paper
•
2409.08240
•
Published
•
18
InstantDrag: Improving Interactivity in Drag-based Image Editing
Paper
•
2409.08857
•
Published
•
30
DrawingSpinUp: 3D Animation from Single Character Drawings
Paper
•
2409.08615
•
Published
•
14
Click2Mask: Local Editing with Dynamic Mask Generation
Paper
•
2409.08272
•
Published
•
4
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
Paper
•
2409.11355
•
Published
•
28
LVCD: Reference-based Lineart Video Colorization with Diffusion Models
Paper
•
2409.12960
•
Published
•
22
StoryMaker: Towards Holistic Consistent Characters in Text-to-image
Generation
Paper
•
2409.12576
•
Published
•
15
MIMO: Controllable Character Video Synthesis with Spatial Decomposed
Modeling
Paper
•
2409.16160
•
Published
•
32
Improvements to SDXL in NovelAI Diffusion V3
Paper
•
2409.15997
•
Published
•
11
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through
Data, Reward, and Conditional Guidance Design
Paper
•
2410.05677
•
Published
•
14
Story-Adapter: A Training-free Iterative Framework for Long Story
Visualization
Paper
•
2410.06244
•
Published
•
19
TextToon: Real-Time Text Toonify Head Avatar from Single Video
Paper
•
2410.07160
•
Published
•
8
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image
Generation
Paper
•
2410.08159
•
Published
•
23
Animate-X: Universal Character Image Animation with Enhanced Motion
Representation
Paper
•
2410.10306
•
Published
•
52
MagicTailor: Component-Controllable Personalization in Text-to-Image
Diffusion Models
Paper
•
2410.13370
•
Published
•
35