TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings Paper • 2406.15586 • Published 5 days ago • 2
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch Paper • 2406.14563 • Published 6 days ago • 26
nabla^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials Paper • 2406.14347 • Published 6 days ago • 93
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains Paper • 2406.12045 • Published 9 days ago • 4
Performant Models Collection My top performing models based off my own testing, user feedback and benches. This are the models I would recommend you using. • 8 items • Updated 11 days ago • 18
Merging Improves Self-Critique Against Jailbreak Attacks Paper • 2406.07188 • Published 15 days ago • 3
Multimodal Models 🔀 Collection A collection of multimodal models developed by the Komorebi AI team • 2 items • Updated 8 days ago • 2
Aligning to Thousands of Preferences via System Message Generalization Paper • 2405.17977 • Published 29 days ago • 6
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell • Apr 28 • 35
view article Article Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • May 7 • 7
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published Apr 29 • 67
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • 22 days ago • 63
Configurable Safety Tuning ⚙️ Collection CST allows for configurable inference-time control of LLM safety levels, so users can dictate model behavior based on the system prompt • 7 items • Updated 13 days ago • 2
Configurable Safety Tuning of Language Models with Synthetic Preference Data Paper • 2404.00495 • Published Mar 30 • 2
Quantized Models (GGUF, IQ, Imatrix) Collection Various quantizations of models in the GGUF format. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 81 items • Updated 3 days ago • 39
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs Paper • 2402.08005 • Published Feb 12 • 1
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models Paper • 2402.03749 • Published Feb 6 • 9
🛰️🌍 Geospatial Datasets Collection A curated collections of diverse geospatial and satellite imagery datasets. • 54 items • Updated Mar 6 • 11
Exotic Frankenmerges 🥨 Collection Merges of models of different architectures and sizes that end up working surprisingly well • 1 item • Updated 13 days ago • 1
Upscaled Models ⏫ Collection A collection of my frankenmerges, upscaling several models. All of them have the corresponding GGUF variants. • 4 items • Updated 13 days ago • 2
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated 14 days ago • 187
Distilled Self-Critique of LLMs with Synthetic Data: a Bayesian Perspective Paper • 2312.01957 • Published Dec 4, 2023 • 1
Optimised Translation Models 🌍 Collection A collection of optimised and quantised multilingual translation models • 6 items • Updated Nov 7, 2023 • 3
Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation Paper • 2308.07929 • Published Jul 15, 2023 • 1
Personalizing Text-to-Image Generation via Aesthetic Gradients Paper • 2209.12330 • Published Sep 25, 2022 • 1
LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition Paper • 2307.13269 • Published Jul 25, 2023 • 30