Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 2 days ago β’ 166
MagpieLM Collection Aligning LMs with Fully Open Recipe (data+training configs+logs) β’ 9 items β’ Updated 5 days ago β’ 13
view article Article Meet Yi-Coder: A Small but Mighty LLM for Code By lorinma β’ 23 days ago β’ 11
Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming Paper β’ 2408.16725 β’ Published 29 days ago β’ 50
ChatGPT-Mini Collection A collection of fine-tuned GPT-2 models each designed to deploy a ChatGPT-like model at home. These models can also be deployed on an old computer. β’ 8 items β’ Updated Nov 16, 2023 β’ 4
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 27 items β’ Updated 9 days ago β’ 467
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated Aug 20 β’ 41
view article Article Train custom AI models with the trainer API and adapt them to π€ By not-lain β’ Jun 29 β’ 33
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. β’ 39 items β’ Updated 10 days ago β’ 339
Qwen2-Audio Collection Audio-language model series based on Qwen2 β’ 4 items β’ Updated 10 days ago β’ 41
view article Article Samantha Mistral Instruct 7b - Comprehensive Bulleted Notes By cognitivetech β’ Mar 28 β’ 6
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne β’ Jul 29 β’ 204
Chocolatineπ« Collection DPO fine-tuned models Family, high performance β’ 11 items β’ Updated 15 days ago β’ 2
Brazil XL Collection Bringing Brazilian culture to the latent space β’ 12 items β’ Updated Jul 25 β’ 1
xLAM models Collection xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM β’ 9 items β’ Updated 4 days ago β’ 40
π Stable Diffusion LoRAs Collection Awesome LoRAs found on the hub - using only π΅ β’ 7 items β’ Updated Jul 23 β’ 16
Llama 3 Merges Collection Here is a collection of merged models based on Llama-3 variants to showcase the seamless compatibility of MergeKit with Llama-3 architecture. β’ 6 items β’ Updated 7 days ago β’ 4
view article Article Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages May 24 β’ 24
view article Article The Great LLM Showdown: Amy's Quest for the Perfect LLM By wolfram β’ Jul 9 β’ 12
ImageGen Collection Feeling good but are relatively niche in Text2Image. β’ 4 items β’ Updated 21 days ago β’ 2
Recent highlights Collection Some recent models worth checking out β’ 15 items β’ Updated 10 days ago β’ 24
Magpie-Pro Datasets (Llama-3) Collection Dataset built with Meta Llama 3 70B. Models are fine-tuned from Llama 3 8B. β’ 6 items β’ Updated 8 days ago β’ 16
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper β’ 2406.08464 β’ Published Jun 12 β’ 61
abliterated-v3 Collection Latest gen of the abliterated models I've produced β’ 17 items β’ Updated Jun 3 β’ 90
π¦ 3D creation workflow Collection Going from a text prompt to a nice 3D model β’ 3 items β’ Updated 22 days ago β’ 29
β UI is a good thing π β Collection cool spaces with a cool UI, what could be better? β’ 5 items β’ Updated Jun 18 β’ 13
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ By merve β’ Aug 25, 2023 β’ 18
AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct Paper β’ 2405.14906 β’ Published May 23 β’ 21
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram β’ Apr 24 β’ 56