3 34 19

quyettv

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

upstage/open-ko-llm-leaderboard

upvoted a paper 8 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

reacted to cfahlgren1's post with 🚀 17 days ago

The https://huggingface.co/deepseek-ai/DeepSeek-V3 is very good! I have been playing with it and found it is really good at one-shotting a pretty good landing page. You can play with it here: https://deepseek-artifacts.vercel.app All the responses get saved in the https://huggingface.co/datasets/cfahlgren1/react-code-instructions dataset. Hopefully we can build one of the biggest, highest quality frontend datasets on the hub 💪

View all activity

Organizations

None yet

quyettv's activity

upvoted a paper 8 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 9 days ago • 228

upvoted a collection 22 days ago

DeepSeek-V3

Collection

3 items • Updated 12 days ago • 119

upvoted a paper 25 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 29 days ago • 340

upvoted 4 papers 3 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 127

upvoted 4 papers 5 months ago

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 53

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 66

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 118

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 106

upvoted 4 papers 6 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 88

Longhorn: State Space Models are Amortized Online Learners

Paper • 2407.14207 • Published Jul 19, 2024 • 18

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1, 2024 • 86

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3, 2024 • 51

upvoted an article 7 months ago

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27, 2024

• 124

upvoted 4 papers 7 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 90

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs

Paper • 2406.15927 • Published Jun 22, 2024 • 13

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21, 2024 • 63

An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published Jun 13, 2024 • 51