6 21 50

t1u1

AI & ML interests

None yet

Recent Activity

liked a model 22 days ago

katanemo/Arch-Function-3B

liked a model 23 days ago

nomic-ai/nomic-embed-text-v2-moe

liked a model 25 days ago

bartowski/simplescaling_s1.1-32B-GGUF

View all activity

Organizations

None yet

t1u1's activity

upvoted 2 papers 25 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 29 days ago • 121

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 26 days ago • 142

upvoted a collection 28 days ago

AceCoder

Collection

13 items • Updated 24 days ago • 6

upvoted a paper about 1 month ago

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published Feb 3 • 29

upvoted an article about 1 month ago

Article

FuseO1-Preview: System-II Reasoning Fusion of LLMs

and 4 others •

Jan 20

• 20

upvoted a paper about 1 month ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 41

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 260

upvoted 2 papers 7 months ago

ControlNeXt: Powerful and Efficient Control for Image and Video Generation

Paper • 2408.06070 • Published Aug 12, 2024 • 53

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 159

upvoted 2 papers 9 months ago

What If We Recaption Billions of Web Images with LLaMA-3?

Paper • 2406.08478 • Published Jun 12, 2024 • 40

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Paper • 2406.08392 • Published Jun 12, 2024 • 21

upvoted a collection 11 months ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 717

upvoted 5 papers 12 months ago

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

Paper • 2403.10516 • Published Mar 15, 2024 • 16

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 55

upvoted 3 papers about 1 year ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 63

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 191