5 21 180

mike dikka

fgdrfgrgrdgdr

AI & ML interests

None yet

Recent Activity

liked a model about 9 hours ago

Kijai/HunyuanVideo_comfy

liked a model about 9 hours ago

zer0int/LongCLIP-SAE-ViT-L-14

liked a model about 21 hours ago

nomic-ai/modernbert-embed-base

View all activity

Organizations

None yet

fgdrfgrgrdgdr's activity

liked 2 models about 9 hours ago

Kijai/HunyuanVideo_comfy

Updated 15 days ago • 194

zer0int/LongCLIP-SAE-ViT-L-14

Zero-Shot Image Classification • Updated 13 days ago • 43 • 4

liked a model about 21 hours ago

nomic-ai/modernbert-embed-base

reacted to tomaarsen's post with ❤️ about 21 hours ago

Post

1210

That didn't take long! Nomic AI has finetuned the new ModernBERT-base encoder model into a strong embedding model for search, classification, clustering and more!

Details:
🤖 Based on ModernBERT-base with 149M parameters.
📊 Outperforms both nomic-embed-text-v1 and nomic-embed-text-v1.5 on MTEB!
🏎️ Immediate FA2 and unpacking support for super efficient inference.
🪆 Trained with Matryoshka support, i.e. 2 valid output dimensionalities: 768 and 256.
➡️ Maximum sequence length of 8192 tokens!
2️⃣ Trained in 2 stages: unsupervised contrastive data -> high quality labeled datasets.
➕ Integrated in Sentence Transformers, Transformers, LangChain, LlamaIndex, Haystack, etc.
🏛️ Apache 2.0 licensed: fully commercially permissible

Try it out here: nomic-ai/modernbert-embed-base

Very nice work by Zach Nussbaum and colleagues at Nomic AI.

liked a model about 21 hours ago

mradermacher/AetherDrake-SFT-GGUF

Updated 8 days ago • 279 • 2

liked a Space about 22 hours ago

Running

🧐📜👉🥇

Open LLM Leaderboard Results => Modelcard

Adds Open LLM Leaderboard results to a target modelcard

liked 2 models about 22 hours ago

Daemontatox/AetherDrake-SFT

Text Generation • Updated 7 days ago • 77 • 2

prithivMLmods/Llama-3.1-8B-Open-SFT-GGUF

Text Generation • Updated 14 days ago • 246 • 7

liked a dataset about 22 hours ago

Lyte/Reasoning-Paused

Viewer • Updated Oct 13, 2024 • 1.3k • 42 • 2

liked a dataset 1 day ago

allenai/swag

Viewer • Updated Jun 14, 2024 • 207k • 1.74k • 22

liked a model 1 day ago

davidschulte/ESM_social_bias_frames_default

Updated 27 days ago • 12 • 1

liked 2 datasets 1 day ago

allenai/ropes

Viewer • Updated Jan 4, 2024 • 14.3k • 206 • 46

allenai/ultrafeedback_binarized_cleaned

Viewer • Updated Dec 1, 2023 • 186k • 794 • 70

liked 3 Spaces 1 day ago

Running

🦓

Zebra Logic Bench

Running

303

📐

Reward Bench Leaderboard

Running

📊

ZeroEval Leaderboard

liked a dataset 1 day ago

allenai/UNcommonsense

Viewer • Updated Jan 19, 2024 • 18.3k • 63 • 10

liked a model 1 day ago

facebook/galactica-120b

Text Generation • Updated Jan 24, 2023 • 718 • 154

upvoted a paper 1 day ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 7 days ago • 75

reacted to lewtun's post with 🔥 1 day ago

Post

1736

This paper ( HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs (2412.18925)) has a really interesting recipe for inducing o1-like behaviour in Llama models:

* Iteratively sample CoTs from the model, using a mix of different search strategies. This gives you something like Stream of Search via prompting.
* Verify correctness of each CoT using GPT-4o (needed because exact match doesn't work well in medicine where there are lots of aliases)
* Use GPT-4o to reformat the concatenated CoTs into a single stream that includes smooth transitions like "hmm, wait" etc that one sees in o1
* Use the resulting data for SFT & RL
* Use sparse rewards from GPT-4o to guide RL training. They find RL gives an average ~3 point boost across medical benchmarks and SFT on this data already gives a strong improvement.

Applying this strategy to other domains could be quite promising, provided the training data can be formulated with verifiable problems!