K - a diwank Collection

chat-ui

jondurbin/py-dpo-v0.1

Viewer • Updated Jan 11, 2024 • 9.47k • 63 • 46

jondurbin/gutenberg-dpo-v0.1

Viewer • Updated Jan 12, 2024 • 918 • 958 • 127

jondurbin/cinematika-v0.1

Viewer • Updated Apr 11, 2024 • 47.1k • 194 • 52

ParisNeo/lollms_aware_dataset

Viewer • Updated Oct 27, 2023 • 464 • 76 • 5

grimulkan/LimaRP-augmented

Viewer • Updated Jan 24, 2024 • 804 • 57 • 29

TIGER-Lab/MathInstruct

Viewer • Updated May 15, 2024 • 262k • 2.94k • 257

christopher/rosetta-code

Viewer • Updated Sep 24, 2023 • 79k • 184 • 32

b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 2.5k • 427

migtissera/Synthia-v1.3

Viewer • Updated Nov 16, 2023 • 119k • 224 • 99

tinyBenchmarks/tinyMMLU

Viewer • Updated Jul 8, 2024 • 385 • 3.34k • 16

tinyBenchmarks/tinyWinogrande

Preview • Updated May 25, 2024 • 899 • 4

tinyBenchmarks/tinyAI2_arc

Preview • Updated May 25, 2024 • 1.17k • 3

tinyBenchmarks/tinyHellaswag

Viewer • Updated May 25, 2024 • 50k • 893 • 4

tinyBenchmarks/tinyTruthfulQA

Preview • Updated May 25, 2024 • 702 • 3

tinyBenchmarks/tinyAlpacaEval

Viewer • Updated Apr 19, 2024 • 100 • 110 • 5

tinyBenchmarks/tinyGSM8k

Preview • Updated May 25, 2024 • 1.05k • 5

cognitivecomputations/samantha-data

Updated Mar 29, 2024 • 414 • 126

roborovski/synthetic-tool-calls

Viewer • Updated Mar 5, 2024 • 6.01k • 41 • 1

roborovski/glaive-tool-usage-dpo

Viewer • Updated Feb 29, 2024 • 42k • 34 • 2

kalomaze/StackMix-v0.1

Viewer • Updated Feb 28, 2024 • 30 • 46 • 2

roborovski/glaive-function-calling-v2-conversation

Viewer • Updated Feb 19, 2024 • 113k • 31 • 2

mlabonne/truthy-dpo-v0.1

Viewer • Updated Feb 18, 2024 • 1.02k • 40 • 1

ai4bharat/indic-align

Viewer • Updated Jul 25, 2024 • 97.4M • 938 • 11

coseal/CodeUltraFeedback_binarized

Viewer • Updated Mar 18, 2024 • 9.5k • 782 • 15

coseal/CodeUltraFeedback

Viewer • Updated Mar 15, 2024 • 10k • 64 • 25

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 16

ai4bharat/sangraha

Viewer • Updated Oct 21, 2024 • 268M • 5.27k • 33

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Paper • 2311.04205 • Published Nov 7, 2023 • 5

Multilingual Instruction Tuning With Just a Pinch of Multilinguality

Paper • 2401.01854 • Published Jan 3, 2024 • 10

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 64

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 186

Self-Instruct: Aligning Language Model with Self Generated Instructions

Paper • 2212.10560 • Published Dec 20, 2022 • 9

HuggingFaceH4/self-instruct-seed

Viewer • Updated Jan 31, 2023 • 175 • 39 • 27

ToolTalk: Evaluating Tool-Usage in a Conversational Setting

Paper • 2311.10775 • Published Nov 15, 2023 • 7

Dynamic Planning with a LLM

Paper • 2308.06391 • Published Aug 11, 2023 • 2

FreedomIntelligence/SocraticChat

Viewer • Updated Oct 12, 2023 • 50.7k • 51 • 6

Large Language Model as a User Simulator

Paper • 2308.11534 • Published Aug 21, 2023 • 2

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Paper • 2309.10814 • Published Sep 19, 2023 • 3

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 22

mlabonne/alpagasus

Viewer • Updated Aug 3, 2023 • 9.23k • 45 • 8

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

THUDM/AgentInstruct

Viewer • Updated Oct 23, 2023 • 1.87k • 324 • 204

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Paper • 2310.01557 • Published Oct 2, 2023 • 12

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 33

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Paper • 2309.10691 • Published Sep 19, 2023 • 4

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Paper • 2304.11477 • Published Apr 22, 2023 • 3

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 75

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Paper • 2308.00436 • Published Aug 1, 2023 • 22

Running

533

📢

UGI Leaderboard

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Paper • 2310.16049 • Published Oct 24, 2023 • 4

Instruction-Following Evaluation for Large Language Models

Paper • 2311.07911 • Published Nov 14, 2023 • 19

allenai/UNcommonsense

Viewer • Updated Jan 19, 2024 • 18.3k • 63 • 10

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Paper • 2311.08469 • Published Nov 14, 2023 • 10

Flows: Building Blocks of Reasoning and Collaborating AI

Paper • 2308.01285 • Published Aug 2, 2023 • 2

aiflows/CCFlows

Updated Dec 10, 2023 • 2

Learning to Reason and Memorize with Self-Notes

Paper • 2305.00833 • Published May 1, 2023 • 4

Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework

Paper • 2305.03268 • Published May 5, 2023 • 2

Making Large Language Models Better Reasoners with Alignment

Paper • 2309.02144 • Published Sep 5, 2023 • 2

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Paper • 2309.17382 • Published Sep 29, 2023 • 4

ALERT: Adapting Language Models to Reasoning Tasks

Paper • 2212.08286 • Published Dec 16, 2022 • 2

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Paper • 2402.04858 • Published Feb 7, 2024 • 14

Vivacem/MMIQC

Viewer • Updated Jan 20, 2024 • 2.29M • 87 • 14

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Paper • 2403.04746 • Published Mar 7, 2024 • 22

Learning to Decode Collaboratively with Multiple Language Models

Paper • 2403.03870 • Published Mar 6, 2024 • 18

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Paper • 2402.10466 • Published Feb 16, 2024 • 17

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

Paper • 2402.02285 • Published Feb 3, 2024 • 1

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 23

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 16

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27, 2024 • 18

Aman279/Locomo

Viewer • Updated Mar 7, 2024 • 35 • 5 • 1

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 47

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22, 2024 • 82

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21, 2024 • 47

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Paper • 2402.16288 • Published Feb 26, 2024 • 1

pandalla/Machine_Mindset_MBTI_dataset

Viewer • Updated Jun 4, 2024 • 161k • 268 • 54

berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 303 • 282

totally-not-an-llm/sharegpt-hyperfiltered-3k

Viewer • Updated Jul 13, 2023 • 3.24k • 86 • 14

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 22.5k • 570

argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 7.76k • 127

dmayhem93/self-critiquing-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 41 • 1

dmayhem93/self-critiquing-critique-and-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 34 • 1

morzecrew/RefinedPersonaChat

Viewer • Updated Aug 7, 2023 • 207k • 49 • 2

beratcmn/rephrased-instruction-turkish-poems

Viewer • Updated Dec 16, 2023 • 4.96k • 42 • 4

Birchlabs/openai-prm800k-stepwise-critic

Viewer • Updated Jun 3, 2023 • 1.09M • 213 • 43

theblackcat102/evol-codealpaca-v1

Viewer • Updated Mar 10, 2024 • 111k • 1.12k • 156

meta-math/GSM8K_Backward

Viewer • Updated Nov 10, 2023 • 1.27k • 43 • 16

meta-math/MetaMathQA-40K

Viewer • Updated Nov 10, 2023 • 40k • 215 • 23

glaiveai/glaive-code-assistant-v2

Viewer • Updated Apr 4, 2024 • 215k • 46 • 44

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Paper • 2403.03186 • Published Mar 5, 2024 • 5

PROC2PDDL: Open-Domain Planning Representations from Texts

Paper • 2403.00092 • Published Feb 29, 2024 • 1

btan2/cappy-large

Text Classification • Updated Dec 7, 2023 • 51 • 20

VMware/open-instruct

Viewer • Updated Jul 12, 2023 • 143k • 189 • 44

QizhiPei/BioT5_finetune_dataset

Viewer • Updated Sep 2, 2024 • 33 • 217 • 6

Tensoic/gooftagoo

Viewer • Updated Mar 16, 2024 • 16.2k • 81 • 9

GenVRadmin/Aryabhatta-Orca-Maths-Hindi

Viewer • Updated Mar 18, 2024 • 200k • 37 • 3

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Paper • 2310.00280 • Published Sep 30, 2023 • 3

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Paper • 2311.05997 • Published Nov 10, 2023 • 36

wangwilliamyang/wikihow

Updated Jan 18, 2024 • 8

argilla/distilabel-capybara-kto-15k-binarized

Viewer • Updated Mar 19, 2024 • 15.1k • 49 • 5

argilla/ultrafeedback-binarized-preferences-cleaned-kto

Viewer • Updated Mar 19, 2024 • 231k • 89 • 9

argilla/distilabel-intel-orca-kto

Viewer • Updated Mar 19, 2024 • 23.1k • 63 • 7

argilla/kto-mix-15k

Viewer • Updated Apr 19, 2024 • 15.3k • 68 • 13

KnutJaegersberg/dolphin_orca_clustered

Updated Sep 14, 2023 • 40 • 1

GAIR/autoj-scenario-classifier

Text Generation • Updated Oct 9, 2023 • 16 • 5

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 71

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 183

Ask Optimal Questions: Aligning Large Language Models with Retriever's Preference in Conversational Search

Paper • 2402.11827 • Published Feb 19, 2024 • 1

Grounding Language Model with Chunking-Free In-Context Retrieval

Paper • 2402.09760 • Published Feb 15, 2024

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19, 2024 • 16

BAAI/OPI

Preview • Updated Nov 6, 2024 • 235 • 8

internlm/Agent-FLAN

Preview • Updated Mar 20, 2024 • 74 • 68

kaist-ai/selfee-train

Viewer • Updated May 31, 2023 • 178k • 47 • 9

fabiochiu/medium-articles

Preview • Updated Jul 17, 2022 • 225 • 23

Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20, 2024 • 13

voidful/MuSiQue

Preview • Updated May 20, 2023 • 7 • 4

BAAI/bge-reranker-v2-m3

Text Classification • Updated Jun 24, 2024 • 796k • 445

allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 5.73k • 79

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 22

In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 42

Are Emergent Abilities in Large Language Models just In-Context Learning?

Paper • 2309.01809 • Published Sep 4, 2023 • 3

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 212 • 73

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25, 2024 • 65

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

princeton-nlp/QuRatedPajama-260B

Viewer • Updated Apr 16, 2024 • 254M • 463 • 6

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20, 2024 • 20

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 32

Locutusque/OpenCerebrum-dpo

Viewer • Updated Mar 26, 2024 • 21.1k • 40 • 6

Doctor-Shotgun/theory-of-mind-dpo

Viewer • Updated Mar 14, 2024 • 539 • 39 • 16

Locutusque/arc-cot-dpo

Viewer • Updated Mar 26, 2024 • 957 • 58 • 6

fblgit/simple-math-DPO

Viewer • Updated Aug 1, 2024 • 800k • 93 • 16

KrisPi/PythonTutor-Evol-1k-DPO-GPT4_vs_35

Viewer • Updated Nov 18, 2023 • 943 • 32 • 14

zerolink/zsql-postgres-dpo

Viewer • Updated Feb 2, 2024 • 259k • 62 • 6

Lakera/gandalf_ignore_instructions

Viewer • Updated Oct 2, 2023 • 1k • 208 • 27

mrm8488/unnatural-instructions-full

Viewer • Updated Dec 21, 2022 • 66k • 90 • 16

NilanE/SmallParallelDocs-Ja_En-6k

Viewer • Updated Mar 5, 2024 • 6.32k • 53 • 2

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27, 2024 • 24

NousResearch/OLMo-Bitnet-1B

Text Generation • Updated Apr 11, 2024 • 50 • 118

pyp1/VoiceCraft

Text-to-Speech • Updated Aug 21, 2024 • 21 • 208

CarperAI/openai_summarize_comparisons

Viewer • Updated Feb 27, 2023 • 260k • 1.94k • 40

PygmalionAI/PIPPA

Updated Sep 7, 2023 • 127 • 204

ivanleomk/gpt4-chain-of-density

Preview • Updated Nov 12, 2023 • 75 • 6

AIRI-NLP/cnli_memory_extracted

Viewer • Updated Mar 22, 2024 • 8.23k • 40 • 1

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 27

openbmb/UltraInteract_sft

Viewer • Updated Apr 5, 2024 • 289k • 23.8k • 119

openbmb/UltraInteract_pair

Viewer • Updated Apr 5, 2024 • 220k • 403 • 106

openbmb/Eurus-70b-nca

Text Generation • Updated Apr 12, 2024 • 319 • 11

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8, 2024 • 1

ai2lumos/lumos_multimodal_ground_iterative

Viewer • Updated Mar 19, 2024 • 15.9k • 45 • 1

ai2lumos/lumos_multimodal_plan_iterative

Viewer • Updated Mar 19, 2024 • 15.9k • 47 • 2

ai2lumos/lumos_complex_qa_plan_onetime

Viewer • Updated Mar 19, 2024 • 19.4k • 41 • 3

ai2lumos/lumos_complex_qa_ground_onetime

Viewer • Updated Mar 19, 2024 • 19.2k • 50 • 3

ai2lumos/lumos_complex_qa_ground_iterative

Viewer • Updated Mar 19, 2024 • 19.1k • 54 • 2

ai2lumos/lumos_unified_plan_iterative

Viewer • Updated Mar 19, 2024 • 55.4k • 35 • 2

ai2lumos/lumos_complex_qa_plan_iterative

Viewer • Updated Mar 18, 2024 • 19k • 46 • 6

ai2lumos/lumos_unified_ground_iterative

Viewer • Updated Mar 19, 2024 • 55.5k • 44 • 2

ai2lumos/lumos_web_agent_ground_iterative

Viewer • Updated Mar 18, 2024 • 1.01k • 38 • 2

ai2lumos/lumos_web_agent_plan_iterative

Viewer • Updated Mar 18, 2024 • 1.01k • 37 • 4

ai2lumos/lumos_maths_ground_iterative

Viewer • Updated Mar 18, 2024 • 19.5k • 78 • 3

ai2lumos/lumos_maths_ground_onetime

Viewer • Updated Mar 18, 2024 • 19.8k • 84 • 1

ai2lumos/lumos_maths_plan_onetime

Viewer • Updated Mar 18, 2024 • 19.8k • 44 • 2

Symbol-LLM/Symbol-LLM-7B-Instruct

Text Generation • Updated Jun 23, 2024 • 24 • 13

MoritzLaurer/deberta-v3-large-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 11, 2024 • 58k • • 86

MoritzLaurer/bge-m3-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 22, 2024 • 928k • 43

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

Pavithree/eli5

Viewer • Updated Apr 23, 2022 • 229k • 196 • 2

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23, 2024 • 1.95k • 155 • 20

paraloq/json_data_extraction

Viewer • Updated Mar 25, 2024 • 484 • 87 • 19

livecodebench/execution

Viewer • Updated Mar 12, 2024 • 479 • 157 • 4

iamtarun/python_code_instructions_18k_alpaca

Viewer • Updated Jul 27, 2023 • 18.6k • 1.87k • 262

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22, 2024 • 25

manishiitg/CogStack-QA

Viewer • Updated Feb 9, 2024 • 24.7k • 34 • 1

manishiitg/CogStack-Tasks

Viewer • Updated Feb 9, 2024 • 4.69k • 40 • 1

manishiitg/CogStack-Conv

Viewer • Updated Feb 9, 2024 • 2.35k • 36 • 1

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19, 2024 • 16

abacusai/SystemChat-1.1

Viewer • Updated Apr 11, 2024 • 20.2k • 44 • 30

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 104

Anthropic/persuasion

Viewer • Updated Apr 9, 2024 • 3.94k • 392 • 179

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 87

M4-ai/prm_dpo_pairs

Viewer • Updated Jul 1, 2024 • 93.9k • 42 • 7

OpenLLM-France/Claire-Dialogue-French-0.1

Viewer • Updated Dec 5, 2023 • 37k • 112 • 41

amaydle/npc-dialogue

Viewer • Updated Mar 25, 2023 • 1.92k • 84 • 15

facebook/empathetic_dialogues

Updated Jan 18, 2024 • 1.16k • 95

Salesforce/dialogstudio

Updated Jul 21, 2024 • 381 • 218

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 60

microsoft/Taskbench

Viewer • Updated Aug 21, 2024 • 17.3k • 355 • 23

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 82

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 24

mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17, 2024 • 44.2k • 1.04k • 265

allenai/persona-bias

Updated Feb 5, 2024 • 39 • 11

PleIAs/YouTube-Commons

Updated Jun 26, 2024 • 613 • 324

FreedomIntelligence/evol-instruct-hindi

Viewer • Updated Aug 6, 2023 • 59k • 6 • 2

FreedomIntelligence/OVM-process

Viewer • Updated Apr 1, 2024 • 7.47k • 35 • 1

nuprl/CanItEdit

Viewer • Updated Mar 19, 2024 • 105 • 539 • 12

totally-not-an-llm/EverythingLM-data-V3

Viewer • Updated Sep 11, 2023 • 1.07k • 39 • 31

RUCAIBox/Story-Generation

Updated Mar 3, 2023 • 40 • 12

fabraz/writingPromptAug

Viewer • Updated Oct 14, 2023 • 24.1k • 87 • 2

jerryjalapeno/nart-100k-synthetic

Viewer • Updated Jul 16, 2023 • 99.1k • 74 • 40

jat-project/jat-dataset

Viewer • Updated Feb 16, 2024 • 258M • 214k • 34

euclaise/ReMask-3B

Text Generation • Updated Aug 10, 2024 • 109 • 15

google/Synthetic-Persona-Chat

Viewer • Updated Mar 1, 2024 • 10.9k • 1.89k • 79

google/cvss

Updated Feb 10, 2024 • 103 • 13

neural-bridge/rag-dataset-12000

Viewer • Updated Feb 5, 2024 • 12k • 1.53k • 124

HannahRoseKirk/prism-alignment

Viewer • Updated Apr 25, 2024 • 77.9k • 1.07k • 78

Gigax/NPC-LLM-3_8B

Text Generation • Updated May 14, 2024 • 299 • 24

nuprl/MultiPL-T

Viewer • Updated Aug 20, 2024 • 215k • 72 • 7

cognitivecomputations/SystemChat-1.2

Viewer • Updated Apr 30, 2024 • 52 • 57 • 6

mlabonne/arena-preferences

Viewer • Updated Apr 27, 2024 • 2.69k • 51 • 9

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12, 2024 • 12

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization

Paper • 2401.07793 • Published Jan 15, 2024 • 3

yutaozhu94/INTERS

Preview • Updated Feb 19, 2024 • 750 • 12

THUDM/CogAgent

Updated Dec 18, 2023 • 16

urchade/gliner_large-v2.1

Token Classification • Updated Apr 10, 2024 • 27.1k • 29

shachardon/ShareLM

Viewer • Updated Aug 6, 2024 • 331k • 175 • 29

nvidia/ChatQA-Training-Data

Viewer • Updated Jun 4, 2024 • 442k • 897 • 163

lightblue/tagengo-gpt4

Viewer • Updated Jun 2, 2024 • 78.1k • 50 • 61

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16, 2024 • 2.25k • 30

bigcode/commitpackft

Viewer • Updated Aug 20, 2023 • 702k • 5.47k • 62

glaiveai/glaive-code-assistant-v3

Viewer • Updated May 20, 2024 • 950k • 424 • 46

davanstrien/cosmochat

Viewer • Updated May 10, 2024 • 199 • 50 • 11

davanstrien/cosmopedia_chat

Viewer • Updated Mar 8, 2024 • 1.19k • 36 • 7

MemGPT/MSC-Self-Instruct

Viewer • Updated Nov 2, 2023 • 500 • 44 • 11

MemGPT/qa_data

Viewer • Updated Feb 6, 2024 • 18.6k • 8 • 1

google/imageinwords

Updated May 25, 2024 • 134 • 117

grammarly/coedit

Viewer • Updated Oct 21, 2023 • 70.8k • 898 • 67

bea2019st/wi_locness

Updated Jan 18, 2024 • 170 • 14

GEM/FairytaleQA

Viewer • Updated Oct 25, 2022 • 10.6k • 274 • 8

grammarly/medit

Viewer • Updated Oct 1, 2024 • 113k • 85 • 13

MemGPT/MemGPT-DPO-Dataset

Viewer • Updated Apr 18, 2024 • 42.3k • 45 • 9

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17, 2024 • 57.5k • 869 • 138

princeton-nlp/QuRating-GPT3.5-Judgments

Viewer • Updated Mar 29, 2024 • 250k • 38 • 5

princeton-nlp/AutoCompressor-Llama-2-7b-6k

Updated Nov 22, 2023 • 504 • 2

H-D-T/Select-Stack

Viewer • Updated Sep 2, 2024 • 1.46M • 37 • 16

EleutherAI/lichess-puzzles

Viewer • Updated May 9, 2024 • 1.48M • 52 • 20

selfrag/selfrag_train_data

Viewer • Updated Oct 31, 2023 • 146k • 97 • 68

community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24, 2024 • 1.46M • 1.47k • 54

TIGER-Lab/MMLU-Pro

Viewer • Updated Nov 27, 2024 • 12.1k • 35.7k • 303

ylacombe/expresso

Viewer • Updated Apr 30, 2024 • 11.6k • 209 • 32

microsoft/MeetingBank-QA-Summary

Viewer • Updated May 16, 2024 • 862 • 50 • 12

microsoft/MeetingBank-LLMCompressed

Viewer • Updated May 16, 2024 • 5.17k • 113 • 15

nvidia/ChatRAG-Bench

Viewer • Updated May 24, 2024 • 34.6k • 1.3k • 103

xingyaoww/code-act

Viewer • Updated Feb 5, 2024 • 78.4k • 106 • 51

kaist-ai/Multifaceted-Collection-ORPO

Viewer • Updated Jul 1, 2024 • 64.6k • 44 • 10

Alibaba-NLP/gte-Qwen2-7B-instruct

hwjiang/Real3D

Image-to-3D • Updated Jun 14, 2024 • 24 • 15

nvidia/Aegis-AI-Content-Safety-Dataset-1.0

Viewer • Updated Jun 28, 2024 • 12k • 505 • 47

ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

Updated Oct 30, 2024 • 1.52k • 120

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 28

facebook/multi-token-prediction

Updated Jun 18, 2024 • 352

TIGER-Lab/M-BEIR

Viewer • Updated Aug 7, 2024 • 2.86M • 614 • 15

tomg-group-umd/pixelprose

Viewer • Updated Jun 23, 2024 • 15.6M • 418 • 134

mit-han-lab/ShareGPT4V

Preview • Updated Feb 22, 2024 • 42 • 3

mit-han-lab/litepose

Updated Jun 5, 2024 • 1

mit-han-lab/Llama-3-8B-Instruct-QServe-g128

Text Generation • Updated May 6, 2024 • 16 • 1

internlm/internlm-xcomposer2-vl-7b

Visual Question Answering • Updated Apr 12, 2024 • 2.15k • 80

OpenGVLab/InternViT-6B-448px-V1-5

Image Feature Extraction • Updated 24 days ago • 1.61k • 79

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated 15 days ago • 2.21k • 405

OpenGVLab/Mini-InternVL-Chat-4B-V1-5

Image-Text-to-Text • Updated 15 days ago • 319 • 60

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Sep 25, 2024 • 28.7k • 1.38k

microsoft/Florence-2-large

Image-Text-to-Text • Updated 24 days ago • 397k • 1.32k

llava-hf/LLaVA-NeXT-Video-7B-DPO-hf

Video-Text-to-Text • Updated Nov 22, 2024 • 430 • 9

arcee-ai/BAAI-Infinity-Instruct-System

Viewer • Updated Jun 24, 2024 • 2.36M • 112 • 15

hpcai-tech/OpenSora-VAE-v1.2

Updated Jun 17, 2024 • 181k • 54

hpcai-tech/OpenSora-STDiT-v3

Updated Jun 17, 2024 • 83.6k • 42

liuqi6777/RankGPT-msmarco-100k-clean

Viewer • Updated Feb 6, 2024 • 87.3k • 36 • 1

failspy/Meta-Llama-3-70B-Instruct-abliterated-v3.5

Text Generation • Updated May 30, 2024 • 3.45k • 40

ResplendentAI/NSFW_RP_Format_DPO

Viewer • Updated Mar 17, 2024 • 400 • 76 • 62

microsoft/msr_text_compression

Updated Jan 18, 2024 • 85 • 8

microsoft/msr_sqa

Updated Jan 18, 2024 • 85 • 4

microsoft/crd3

Updated Jan 18, 2024 • 210 • 23

nvidia/domain-classifier

Updated 23 days ago • 44.8k • 67

jhu-clsp/FollowIR-train

Viewer • Updated Mar 25, 2024 • 1.78k • 69 • 5

vicgalle/Phudge-3

Text Classification • Updated May 30, 2024 • 26 • 3

TWO/sutra-mlt256-v2

Updated May 24, 2024 • 9

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Paper • 2406.19251 • Published Jun 27, 2024 • 8

aiana94/xMINDlarge

Viewer • Updated Oct 25, 2024 • 4.12M • 255 • 4

OpenCo7/UpVoteWeb

Viewer • Updated Jul 17, 2024 • 557M • 257 • 93

davanstrien/magpie-preference

Viewer • Updated 10 days ago • 503 • 506 • 12

FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 2.99k • 195

euclaise/gsm8k_multiturn

Viewer • Updated Jul 6, 2024 • 8.79k • 72 • 13

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 80.6k • 185

dell-research-harvard/newswire

Viewer • Updated Jul 2, 2024 • 1.44M • 264 • 70

alexshengzhili/SciGraphQA-295K-train

Viewer • Updated Aug 8, 2023 • 296k • 117 • 11

xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 67.8k • 1.21k

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 9

laion/links_to_pocasts_lecture_and_shows_for_tts

Viewer • Updated May 29, 2024 • 331k • 10 • 8

laion/datacomp-hq

Viewer • Updated Mar 13, 2024 • 20.7M • 23 • 11

laion/Subjects-for-curricular

Viewer • Updated Dec 20, 2023 • 3.99M • 38 • 5

laion/strategic_game_maze

Viewer • Updated Oct 20, 2023 • 345M • 4.87k • 11

mlabonne/llmtwin

Viewer • Updated Aug 27, 2024 • 3.34k • 112 • 8

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 42

dunzhang/stella_en_400M_v5

dunzhang/stella_en_1.5B_v5

RhapsodyAI/MiniCPM-V-Embedding-preview

Feature Extraction • Updated Aug 20, 2024 • 66 • 44

agentsea/wave-ui-25k

Viewer • Updated Jul 3, 2024 • 25k • 161 • 18

TencentARC/StoryStream

Preview • Updated Jul 17, 2024 • 272 • 25

apple/DCLM-7B

Updated Jul 26, 2024 • 940 • 828

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 51.6k • 271

HuggingFaceTB/bisac-topics

Viewer • Updated Apr 3, 2024 • 5.5k • 6 • 2

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Paper • 2407.11239 • Published Jul 15, 2024 • 7

mistralai/Mistral-Nemo-Base-2407

Text Generation • Updated Nov 6, 2024 • 3.45M • 271

TencentARC/SEED-Story

Text-to-Image • Updated Aug 26, 2024 • 30 • 26

xlangai/BRIGHT

Viewer • Updated Nov 18, 2024 • 1.35M • 1.51k • 19

glaiveai/RAG-v1

Viewer • Updated Jun 25, 2024 • 51.4k • 178 • 68

QuietImpostor/Claude-3-Opus-Claude-3.5-Sonnnet-9k

Viewer • Updated Jun 30, 2024 • 9.94k • 70 • 19

PawanKrd/gpt-4o-200k

Viewer • Updated Jun 29, 2024 • 200k • 69 • 24

kalomaze/Opus_Instruct_3k

Viewer • Updated Jul 19, 2024 • 2.95k • 59 • 24

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Paper • 2206.07643 • Published Jun 15, 2022 • 1

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Paper • 2303.15256 • Published Mar 27, 2023 • 1

fireworks-ai/llama-3-firefunction-v2

Text Generation • Updated Jun 18, 2024 • 211 • 139

Stateful Memory-Augmented Transformers for Dialogue Modeling

Paper • 2209.07634 • Published Sep 15, 2022 • 1

cognitivecomputations/SystemChat-2.0

Preview • Updated May 31, 2024 • 73 • 54

CollectiveCognition/chats-data-2023-10-16

Viewer • Updated Oct 16, 2023 • 200 • 41 • 21

Izazk/Sequence-of-action-prediction-mind2web

Viewer • Updated Feb 22, 2024 • 68.9k • 43 • 3

BigAction/mind2web_clean

Viewer • Updated Apr 25, 2024 • 199 • 51 • 4

osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 354 • 94

magicgh/MT-Mind2Web

Viewer • Updated Feb 23, 2024 • 259 • 59 • 2

TencentARC/PhotoMaker-V2

Text-to-Image • Updated Jul 22, 2024 • 19.1k • 126

KevSun/Personality_LM

Text Classification • Updated Jul 29, 2024 • 2.64k • 17

Running

241

♾️📚

Infinite Dataset Hub

Search and save datasets generated with a LLM in real time

chargoddard/SlimOrcaDedupCleaned-Sonnet3.5-DPO

Viewer • Updated Jul 23, 2024 • 168k • 41 • 7

nvidia/Minitron-8B-Base

Updated Aug 20, 2024 • 22 • 63

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21, 2024 • 623M • 147k • 80

mlfoundations/MINT-1T-ArXiv

Viewer • Updated Sep 19, 2024 • 5.6M • 1.47k • 49

mlfoundations/MINT-1T-PDF-CC-2024-18

Updated Sep 19, 2024 • 4.12k • 19

AI-MO/NuminaMath-TIR

Viewer • Updated Nov 25, 2024 • 72.5k • 569 • 70

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Paper • 2406.00856 • Published Jun 2, 2024 • 11

mlabonne/FineTome-100k

Viewer • Updated Jul 29, 2024 • 100k • 12.2k • 139

LiruiZhao/Diffree

Image-to-Image • Updated Jul 29, 2024 • 16 • 18

BAAI/bge-multilingual-gemma2

Feature Extraction • Updated Jul 31, 2024 • 133k • 156

BAAI/bge-reranker-v2.5-gemma2-lightweight

Text Classification • Updated Sep 6, 2024 • 1.71k • 43

BAAI/IndustryCorpus

Viewer • Updated Jul 23, 2024 • 595M • 6.28k • 51

jspringer/echo-mistral-7b-instruct-lasttoken

Feature Extraction • Updated Feb 26, 2024 • 302 • 6

BAAI/bge-en-icl

Feature Extraction • Updated Sep 25, 2024 • 29.5k • 114

AlekseyKorshuk/full_user_edit_responses-clean

Viewer • Updated Mar 30, 2023 • 364k • 32 • 1

m-a-p/MMRA

Viewer • Updated Jul 31, 2024 • 1.02k • 78 • 13

m-a-p/II-Bench

Viewer • Updated Jun 29, 2024 • 1.43k • 198 • 9

BEE-spoke-data/fineweb-1000_64k

Viewer • Updated Jun 23, 2024 • 2k • 46 • 4

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated Sep 18, 2024 • 1.95k • 185

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 1.19M • • 7.65k

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 724k • • 3.14k

numind/NuExtract

Text Generation • Updated Oct 17, 2024 • 1.46k • 213

numind/NuSentiment-multilingual

Feature Extraction • Updated Jan 26, 2024 • 88.3k • 11

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated about 1 month ago • 14k • 258

aipicasso/megalith-10m-florence2

Viewer • Updated Jul 31, 2024 • 9.14M • 40 • 23

ZhengPeng7/BiRefNet

Image Segmentation • Updated 14 days ago • 834k • 285

nvidia/quality-classifier-deberta

Updated Aug 6, 2024 • 1.62k • 52

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

tiiuae/falcon-mamba-7b-4bit

Text Generation • Updated Oct 10, 2024 • 73 • 11

nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 78 • 102

THUDM/LongWriter-6k

Viewer • Updated Aug 14, 2024 • 6k • 293 • 170

anthracite-org/Stheno-Data-Filtered

Viewer • Updated Aug 18, 2024 • 31.1k • 31 • 14

anthracite-org/kalo-opus-instruct-22k-no-refusal

Viewer • Updated Aug 13, 2024 • 22.3k • 146 • 23

anthracite-org/nopm_claude_writing_fixed

Viewer • Updated Aug 18, 2024 • 6.35k • 93 • 10

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Sep 26, 2024 • 264k • 629

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated Oct 24, 2024 • 40.8k • 543

fal/AuraFace-v1

Updated Aug 26, 2024 • 77

NexaAIDev/Squid

Updated Sep 3, 2024 • 20 • 33

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 42

HuggingFaceTB/everyday-conversations-llama3.1-2k

Viewer • Updated Aug 17, 2024 • 2.38k • 662 • 85

NousResearch/hermes-function-calling-v1

Viewer • Updated Aug 30, 2024 • 11.6k • 739 • 226

multimodalart/product-design

Text-to-Image • Updated Sep 22, 2024 • 951 • • 31

novateur/WavTokenizer

Text-to-Speech • Updated Dec 2, 2024 • 46

facebook/sapiens

Updated Sep 20, 2024 • 146 • 226

Shakker-Labs/AWPortrait-FL

Text-to-Image • Updated Sep 5, 2024 • 166k • 427

sequelbox/Supernova

Viewer • Updated Sep 27, 2024 • 178k • 137 • 8

Running

539

🖼💬

Vision Arena (Testing VLMs side-by-side)

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 714 • 1.71k

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated 22 days ago • 11.8k • 681

deepseek-ai/ESFT-vanilla-lite

Text Generation • Updated Jul 23, 2024 • 29 • 8

yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • Updated Sep 6, 2024 • 543 • 129

gabrielmbmb/distilabel-reflection-tuning

Viewer • Updated Sep 6, 2024 • 5 • 42 • 55

TencentARC/Open-MAGVIT2

Image Feature Extraction • Updated Sep 9, 2024 • 12

openbmb/MiniCPM3-4B

Text Generation • Updated Nov 30, 2024 • 43.1k • 396

THUDM/LongCite-glm4-9b

Text Generation • Updated 17 days ago • 528 • 30

jinaai/reader-lm-1.5b

Text Generation • Updated Sep 20, 2024 • 13.2k • 501

Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Sep 15, 2024 • 29 • 35

tencent/DepthCrafter

Depth Estimation • Updated Sep 24, 2024 • 75k • 79

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated 7 days ago • 558

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated Sep 18, 2024 • 885k • 1.31k

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published Sep 19, 2024 • 15

THUdyh/Oryx-7B

Text Generation • Updated Sep 25, 2024 • 86 • 11

THUdyh/Oryx-7B-Image

Text Generation • Updated Sep 23, 2024 • 13 • 3

THUdyh/Oryx-ViT

Image Classification • Updated Sep 23, 2024 • 5

BAAI/SegGPT

Updated Apr 21, 2023 • 17

Salesforce/fineweb_deduplicated

Viewer • Updated Sep 14, 2024 • 6.43B • 3.57k • 30

KbsdJames/Omni-MATH

Viewer • Updated Oct 12, 2024 • 4.43k • 827 • 62

BAAI/Emu3-Gen

Any-to-Any • Updated Oct 23, 2024 • 4.26k • 199

CultriX/elitebabes-flux

Text-to-Image • Updated Sep 20, 2024 • 2.26k • • 14

RED-AIGC/StoryMaker

Text-to-Image • Updated Nov 9, 2024 • 221 • 73

google/frames-benchmark

Viewer • Updated Oct 15, 2024 • 824 • 1.65k • 177

Anthropic/discrim-eval

Viewer • Updated Jan 5, 2024 • 18.9k • 494 • 44

facebook/sam2.1-hiera-large

Mask Generation • Updated Sep 24, 2024 • 19.9k • 49

Zyphra/Zamba2-2.7B-instruct

Text Generation • Updated Oct 18, 2024 • 1.48k • 79

princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

Updated Oct 31, 2024 • 3.2k • 19

jxm/cde-small-v1

Feature Extraction • Updated Oct 30, 2024 • 16.3k • 287

PrincetonPLI/Instruct-SkillMix-SDD

Viewer • Updated Sep 9, 2024 • 8k • 47 • 5

THUDM/cogvlm2-llama3-caption

Video-Text-to-Text • Updated Sep 26, 2024 • 12.1k • 75

julien040/hacker-news-posts

Viewer • Updated Jun 6, 2023 • 4.01M • 80 • 5

princeton-nlp/Llama-3-8B-ProLong-512k-Base

Updated Oct 31, 2024 • 127 • 8

LLM360/TxT360

Preview • Updated Nov 8, 2024 • 84.1k • 217

bingbangboom/flux-waterscape

Text-to-Image • Updated Oct 10, 2024 • 102 • • 13

facebook/Self-taught-evaluator-DPO-data

Viewer • Updated Sep 30, 2024 • 57.5k • 101 • 31

facebook/layerskip-llama2-13B

Text Generation • Updated Oct 19, 2024 • 358 • 5

ibm-granite/granite-8b-code-instruct-accelerator

Updated May 29, 2024 • 18 • 1

peakji/steiner-32b-preview

Updated Oct 21, 2024 • 28 • 42

CohereForAI/aya-expanse-32b

Text Generation • Updated 27 days ago • 19.5k • 197

CohereForAI/aya-expanse-8b

Text Generation • Updated 27 days ago • 29.7k • 310

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10, 2024 • 4

McGill-NLP/FaithDial

Viewer • Updated Feb 5, 2023 • 32.3k • 251 • 17

relaxml/Llama-3.1-8b-Instruct-QTIP-4Bit

Updated Oct 28, 2024 • 10 • 2

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Paper • 2410.09918 • Published Oct 13, 2024 • 3

GAIR/o1-journey

Viewer • Updated Oct 16, 2024 • 327 • 1.11k • 126

marcelbinz/Psych-101

Viewer • Updated Nov 2, 2024 • 60.1k • 173 • 39

nvidia/Nemotron-4-Mini-Hindi-4B-Base

Updated Oct 23, 2024 • 13 • 11

nvidia/Nemotron-4-Mini-Hindi-4B-Instruct

Updated Nov 15, 2024 • 46 • 17

Etched/oasis-500m

Updated Nov 4, 2024 • 265 • 432

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 29 days ago • 98.1k • 449

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Nov 24, 2024 • 111 • 537

THUDM/webrl-llama-3.1-8b

Updated Nov 6, 2024 • 56 • 3

THUDM/webrl-glm-4-9b

Updated Nov 5, 2024 • 38 • 8

hbseong/HarmAug-Guard

Text Classification • Updated Oct 14, 2024 • 1.17k • 36

BAAI/IndustryCorpus2

Viewer • Updated 16 days ago • 826M • 5.39k • 43

qq8933/OpenLongCoT-Pretrain

Viewer • Updated Oct 28, 2024 • 103k • 120 • 86

microsoft/maira-2

Text Generation • Updated Oct 21, 2024 • 4.87k • 39

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 37

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1, 2024 • 1.05M • 12.1k • 407

Nexusflow/Athene-V2-Chat

Text Generation • Updated Nov 26, 2024 • 7.8k • 252

Nexusflow/Athene-V2-Agent

Text Generation • Updated Nov 21, 2024 • 1.63k • 110

numind/NuExtract-1.5-tiny

Text Generation • Updated Nov 18, 2024 • 15.5k • 15

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 49

allenai/ACE2-ERA5

Updated Nov 21, 2024 • 2

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 9

nvidia/Hymba-1.5B-Base

Text Generation • Updated 18 days ago • 3.82k • 129

AIDC-AI/Marco-o1

Text Generation • Updated Nov 23, 2024 • 10.5k • 679

allenai/Llama-3.1-Tulu-3-70B

Text Generation • Updated Nov 26, 2024 • 2.51k • 44

nachoyawn/three-million-bluesky

Viewer • Updated Nov 28, 2024 • 3.01M • 91 • 10

huihui-ai/QwQ-32B-Preview-abliterated

Text Generation • Updated Nov 28, 2024 • 1.2k • 87

data-is-better-together/open-image-preferences-v1

Viewer • Updated 24 days ago • 8.67k • 5.87k • 19

showlab/ShowUI-desktop-8K

Viewer • Updated 16 days ago • 7.5k • 1.13k • 19

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 41

nvidia/multilingual-domain-classifier

Updated 23 days ago • 3.36k • 11

TencentARC/Divot

Updated 23 days ago • 17 • 6

microsoft/RedStone

Updated 28 days ago • 553 • 27

ruliad/deepthought-8b-llama-v0.01-alpha

Text Generation • Updated 26 days ago • 35.3k • 137

TIGER-Lab/ScholarCopilot-v1

Updated 25 days ago • 51 • 3

TIGER-Lab/ScholarCopilot-Data-v1

Viewer • Updated 18 days ago • 677k • 295 • 2

facebook/sparsh-dino-base

Updated Oct 21, 2024 • 5

deepseek-ai/DeepSeek-V2.5-1210

Text Generation • Updated 22 days ago • 253k • 233

facebook/metamotivo-M-1

Updated 21 days ago • 351 • 4

deepseek-ai/DeepSeek-Prover-V1.5-RL

Updated Aug 29, 2024 • 7.32k • 39

tiiuae/Falcon3-10B-Base

Text Generation • Updated 15 days ago • 3.14k • 33

answerdotai/ModernBERT-base

Fill-Mask • Updated 7 days ago • 57.6k • 572

HuggingFaceTB/finemath

Viewer • Updated 10 days ago • 48.3M • 25.9k • 209

google/reveal

Viewer • Updated Apr 9, 2024 • 6.1k • 63 • 29