Rich feature hierarchies for accurate object detection and semantic segmentation Paper • 1311.2524 • Published Nov 11, 2013 • 1
DeepPose: Human Pose Estimation via Deep Neural Networks Paper • 1312.4659 • Published Dec 17, 2013 • 1
ImageNet Large Scale Visual Recognition Challenge Paper • 1409.0575 • Published Sep 1, 2014 • 8
Very Deep Convolutional Networks for Large-Scale Image Recognition Paper • 1409.1556 • Published Sep 4, 2014 • 1
Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs Paper • 1412.7062 • Published Dec 22, 2014 • 1
U-Net: Convolutional Networks for Biomedical Image Segmentation Paper • 1505.04597 • Published May 18, 2015 • 8
You Only Look Once: Unified, Real-Time Object Detection Paper • 1506.02640 • Published Jun 8, 2015 • 1
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size Paper • 1602.07360 • Published Feb 24, 2016 • 1
Xception: Deep Learning with Depthwise Separable Convolutions Paper • 1610.02357 • Published Oct 7, 2016 • 1
Image-to-Image Translation with Conditional Adversarial Networks Paper • 1611.07004 • Published Nov 21, 2016 • 1
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks Paper • 1703.10593 • Published Mar 30, 2017 • 1
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications Paper • 1704.04861 • Published Apr 17, 2017 • 1
A Style-Based Generator Architecture for Generative Adversarial Networks Paper • 1812.04948 • Published Dec 12, 2018 • 2
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields Paper • 1812.08008 • Published Dec 18, 2018 • 1
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks Paper • 1905.11946 • Published May 28, 2019 • 3
Deep High-Resolution Representation Learning for Visual Recognition Paper • 1908.07919 • Published Aug 20, 2019 • 2
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Paper • 2010.11929 • Published Oct 22, 2020 • 6
Learning Transferable Visual Models From Natural Language Supervision Paper • 2103.00020 • Published Feb 26, 2021 • 11
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows Paper • 2103.14030 • Published Mar 25, 2021 • 4
Emerging Properties in Self-Supervised Vision Transformers Paper • 2104.14294 • Published Apr 29, 2021 • 3
Elucidating the Design Space of Diffusion-Based Generative Models Paper • 2206.00364 • Published Jun 1, 2022 • 14
Score-Based Generative Modeling through Stochastic Differential Equations Paper • 2011.13456 • Published Nov 26, 2020 • 2
Mixture of Diffusers for scene composition and high resolution image generation Paper • 2302.02412 • Published Feb 5, 2023 • 1
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Paper • 2406.02347 • Published Jun 4 • 2
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching Paper • 2402.14167 • Published Feb 21 • 10
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers Paper • 2211.01324 • Published Nov 2, 2022 • 3
Prompt-to-Prompt Image Editing with Cross Attention Control Paper • 2208.01626 • Published Aug 2, 2022 • 2
Make It Count: Text-to-Image Generation with an Accurate Number of Objects Paper • 2406.10210 • Published Jun 14 • 76
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 60
Common Diffusion Noise Schedules and Sample Steps are Flawed Paper • 2305.08891 • Published May 15, 2023 • 8
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models Paper • 2312.06573 • Published Dec 11, 2023 • 1
Imagine yourself: Tuning-Free Personalized Image Generation Paper • 2409.13346 • Published Sep 20 • 68
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion Paper • 2208.01618 • Published Aug 2, 2022 • 1
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models Paper • 2409.10695 • Published Sep 16 • 2
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Paper • 1511.06434 • Published Nov 19, 2015 • 1
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Paper • 1609.04802 • Published Sep 15, 2016 • 1
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold Paper • 2305.10973 • Published May 18, 2023 • 33
Autoregressive Image Generation without Vector Quantization Paper • 2406.11838 • Published Jun 17 • 3
CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models Paper • 2406.09368 • Published Jun 13 • 1