3 3 2

Tao Ge

sggetao

AI & ML interests

None yet

Recent Activity

authored a paper 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

commented on a paper 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

commented on a paper 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

View all activity

Organizations

None yet

sggetao's activity

authored a paper 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11

commented 2 papers 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11 •

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11 •

commented a paper 3 months ago

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA

Paper • 2410.20672 • Published Oct 28, 2024 • 6 •

liked a dataset 7 months ago

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17, 2024 • 57.5k • 536 • 140

New activity in lmsys/chatbot_arena_conversations 7 months ago

Update?

#4 opened 10 months ago by

Hypersniper

upvoted a paper 7 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 97

liked a dataset 7 months ago

proj-persona/PersonaHub

Viewer • Updated Oct 5, 2024 • 375k • 4.55k • 505

updated a model 10 months ago

sggetao/icae

Updated Mar 30, 2024 • 3

updated a dataset 10 months ago

sggetao/PwC

Viewer • Updated Mar 30, 2024 • 260k • 534 • 3

authored 8 papers 12 months ago

Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration

Paper • 2307.05300 • Published Jul 11, 2023 • 19

In-context Autoencoder for Context Compression in a Large Language Model

Paper • 2307.06945 • Published Jul 13, 2023 • 28

SCALE: Synergized Collaboration of Asymmetric Language Translation Engines

Paper • 2309.17061 • Published Sep 29, 2023 • 1

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

Paper • 2401.07851 • Published Jan 15, 2024 • 1

Pay Attention to Your Tone: Introducing a New Dataset for Polite Language Rewrite

Paper • 2212.10190 • Published Dec 20, 2022

upvoted 2 papers about 1 year ago

K-Level Reasoning with Large Language Models

Paper • 2402.01521 • Published Feb 2, 2024 • 18

In-context Autoencoder for Context Compression in a Large Language Model

Paper • 2307.06945 • Published Jul 13, 2023 • 28