Anthony Ivan S

anthonyivn

anthonyivn2

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

answerdotai/ModernBERT-large

liked a model about 19 hours ago

answerdotai/ModernBERT-base

liked a Space 3 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Organizations

None yet

anthonyivn's activity

upvoted a paper about 1 month ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5 • 60

upvoted an article 3 months ago

Article

Document Similarity Search with ColPali

•

Sep 21

• 47

upvoted 2 papers 3 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 75

upvoted a paper 4 months ago

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27 • 13

upvoted an article 5 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15

• 78

upvoted a paper 5 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 129

upvoted a collection 6 months ago

InternLM2.5

Collection

14 items • Updated Sep 14 • 70

upvoted 3 papers 6 months ago

upvoted 2 articles 6 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13

• 383

Article

Putting RL back in RLHF

Jun 12

• 65

upvoted a paper 7 months ago

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30 • 21

upvoted an article 7 months ago

Article

Hugging Face on AMD Instinct MI300 GPU

May 21

• 10

upvoted 3 papers 8 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30 • 73

How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior

Paper • 2404.10198 • Published Apr 16 • 7

upvoted an article 8 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 279

upvoted a paper 8 months ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9 • 64