Oryx Collection Oryx: One Multi-Modal LLM for On-Demand Spatial-Temporal Understanding β’ 5 items β’ Updated 10 days ago β’ 8
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 11 items β’ Updated 3 days ago β’ 273
Scene123: One Prompt to 3D Scene Generation via Video-Assisted and Consistency-Enhanced MAE Paper β’ 2408.05477 β’ Published Aug 10 β’ 1
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Paper β’ 2409.07452 β’ Published 17 days ago β’ 18
Synthetic Dataset Creation Spaces Collection Spaces focused on generating synthetic datasets β’ 5 items β’ Updated 9 days ago β’ 4
Dataset Creation Tools and Utilities Collection Spaces and utilities for creating datasets and getting them on the Hub β’ 3 items β’ Updated 9 days ago β’ 7
view article Article All LLMs Write Great Code, But Some Make (A Lot) Fewer Mistakes By onekq β’ 16 days ago β’ 3
π Awesome 3D AIGC Demos Collection Representative 3D AIGC Demos. #Image-to-3D #Text-to-3D β’ 22 items β’ Updated Aug 20 β’ 4
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper β’ 2408.06292 β’ Published Aug 12 β’ 114
Building and better understanding vision-language models: insights and future directions Paper β’ 2408.12637 β’ Published Aug 22 β’ 110
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation Paper β’ 2408.14819 β’ Published Aug 27 β’ 19
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Paper β’ 2406.18790 β’ Published Jun 26 β’ 33
Text-to-Image History Collection How Text-to-Image evolved on HF and inspired the Community β’ 50 items β’ Updated 2 days ago β’ 11
view article Article Sentiment Classification with Fully Homomorphic Encryption using Concrete ML Nov 17, 2022 β’ 3
view article Article CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models May 24 β’ 21
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 β’ 62
MaPa: Text-driven Photorealistic Material Painting for 3D Shapes Paper β’ 2404.17569 β’ Published Apr 26 β’ 12
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. β’ 21 items β’ Updated Apr 26 β’ 23
β UI is a good thing π β Collection cool spaces with a cool UI, what could be better? β’ 5 items β’ Updated Jun 18 β’ 13
view article Article SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model By xingxm β’ Apr 19 β’ 5
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing Paper β’ 2404.09990 β’ Published Apr 15 β’ 12
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 161
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated May 6 β’ 88
Multimodal Models Collection Multimodal models with leading performance. β’ 14 items β’ Updated 4 days ago β’ 12
HyperGraph Datasets Collection Collection of HyperGraph Datasets β’ 17 items β’ Updated Apr 4 β’ 7
RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS Paper β’ 2403.13806 β’ Published Mar 20 β’ 18
Latent Consistency Model Demos Collection Latent Consistency Models for Stable Diffusion β’ 8 items β’ Updated Nov 12, 2023 β’ 25
VLMs for 3D reconstructions and their evaluation Collection List of papers to help with developing a model that reviews a photogrammetry scan and evaluates its quality β’ 11 items β’ Updated Dec 5, 2023 β’ 2
Biomedical NLP papers Collection Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) β’ 150 items β’ Updated 11 days ago β’ 31
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey Paper β’ 2403.01528 β’ Published Mar 3 β’ 1
TnT-LLM: Text Mining at Scale with Large Language Models Paper β’ 2403.12173 β’ Published Mar 18 β’ 19
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 264 items β’ Updated Jun 22 β’ 397
Pretrained Text-Generation Models Below 250M Parameters Collection Great candidates for fine-tuning targeting Transformers.js, ordered by number of parameters. β’ 8 items β’ Updated Aug 10 β’ 7
Soft Prompts Collection Ordered List of Resources to understand soft prompting while covering the basics of discrete prompting as well. β’ 4 items β’ Updated Mar 22 β’ 2
based Collection These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes. β’ 14 items β’ Updated May 14 β’ 8
FiT: Flexible Vision Transformer for Diffusion Model Paper β’ 2402.12376 β’ Published Feb 19 β’ 48
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs Paper β’ 2402.11753 β’ Published Feb 19 β’ 5
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. β’ 11 items β’ Updated Apr 3 β’ 103
OWL-series π¦ Collection Models and applications of OWL-ViT and OWLv2. β’ 13 items β’ Updated Mar 11 β’ 5
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper β’ 2311.07069 β’ Published Nov 13, 2023 β’ 43
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper β’ 2311.06772 β’ Published Nov 12, 2023 β’ 34
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper β’ 2310.17994 β’ Published Oct 27, 2023 β’ 8
LP-MusicCaps: LLM-Based Pseudo Music Captioning Paper β’ 2307.16372 β’ Published Jul 31, 2023 β’ 37
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing Paper β’ 2306.10012 β’ Published Jun 16, 2023 β’ 35