Maziyar Panahi's picture

Maziyar Panahi PRO

MaziyarPanahi

·

AI & ML interests

Fine-Tuning, RLHF, Merging, Quantizations, Leaderboards

Recent Activity

updated a dataset about 14 hours ago

MaziyarPanahi/mimic-iii-corpus

updated a collection about 18 hours ago

updated a model about 18 hours ago

MaziyarPanahi/Codepy-Deepthink-3B-GGUF

View all activity

Organizations

MaziyarPanahi's activity

upvoted a collection 8 days ago

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated about 6 hours ago • 23

upvoted an article 9 days ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 24

upvoted a paper 12 days ago

NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data

Paper • 2402.15343 • Published Feb 23 • 13

upvoted a collection 12 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 12 days ago • 107

upvoted a collection 19 days ago

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 9 items • Updated Nov 28 • 58

upvoted a collection 20 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 17 items • Updated Nov 27 • 58

upvoted a collection 26 days ago

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated 26 days ago • 28

upvoted an article 26 days ago

Article

They Said It Couldn’t Be Done

By

•

26 days ago

• 76

upvoted a paper 27 days ago

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21 • 29

upvoted 2 collections about 1 month ago

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8 • 21

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated Nov 27 • 63

upvoted a paper about 1 month ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 36

upvoted 3 collections about 1 month ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 124

OpenScholar_V1

The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated Nov 22 • 30

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated Nov 27 • 29

upvoted 2 articles about 1 month ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13

• 386

Article

Halo: Open Source Health Tracking with Wearables

By

•

Nov 19

• 99

upvoted 2 papers about 2 months ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14 • 18

TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees

Paper • 2410.12854 • Published Oct 10 • 1

upvoted a collection about 2 months ago

Nov 15 Releases 🍂

15 items • Updated Nov 15 • 6