ron-wolf
's Collections
Reading list
updated
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper
•
2412.11768
•
Published
•
41
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
Paper
•
2412.14161
•
Published
•
50
HiRED: Attention-Guided Token Dropping for Efficient Inference of
High-Resolution Vision-Language Models in Resource-Constrained Environments
Paper
•
2408.10945
•
Published
•
11
PDFTriage: Question Answering over Long, Structured Documents
Paper
•
2309.08872
•
Published
•
54
Compressed Chain of Thought: Efficient Reasoning Through Dense
Representations
Paper
•
2412.13171
•
Published
•
31
The Matrix Calculus You Need For Deep Learning
Paper
•
1802.01528
•
Published
A Modern Self-Referential Weight Matrix That Learns to Modify Itself
Paper
•
2202.05780
•
Published
Recurrent Memory Transformer
Paper
•
2207.06881
•
Published
•
1
How many words does ChatGPT know? The answer is ChatWords
Paper
•
2309.16777
•
Published
•
1