Satyam's picture

Satyam

satyamt

·

AI & ML interests

Biotechnology

Recent Activity

liked a model 2 days ago

unsloth/DeepSeek-V3-GGUF

upvoted a paper 12 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

liked a model 15 days ago

deepseek-ai/DeepSeek-V3

View all activity

Organizations

satyamt's activity

upvoted a paper 12 days ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 16 days ago • 89

upvoted a paper 17 days ago

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published 21 days ago • 22

upvoted 2 papers 18 days ago

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published 18 days ago • 29

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 73

upvoted 2 collections 3 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 4 days ago • 150

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 4 days ago • 292

upvoted an article 4 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

By

•

Jul 5, 2024

• 186

upvoted a paper 4 months ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29, 2024 • 53

upvoted an article 5 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 232

upvoted a paper 5 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 53

upvoted a collection 5 months ago

PaliGemma Release

Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 28 days ago • 143

upvoted an article 5 months ago

Article

Constitutional AI with Open LLMs

Feb 1, 2024

• 13

upvoted a collection 5 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 36

upvoted an article 5 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29, 2024

• 27

upvoted an article 6 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 182

upvoted 2 papers 7 months ago

GenQA: Generating Millions of Instructions from a Handful of Prompts

Paper • 2406.10323 • Published Jun 14, 2024 • 5

Show, Don't Tell: Aligning Language Models with Demonstrated Feedback

Paper • 2406.00888 • Published Jun 2, 2024 • 31

upvoted a collection 7 months ago

sentence-transformers-from-synthetic-data

Example of using distilabel to generate synthetic triplets data for fine-tuning a Sentence Transformer model • 4 items • Updated Jun 21, 2024 • 22

upvoted a paper 8 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 37

upvoted an article 8 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

• 72