15 182 221

lhl PRO

leonardlin

https://randomfoo.net/

AI & ML interests

None yet

Recent Activity

liked a Space about 15 hours ago

k-mktr/gpu-poor-llm-arena

liked a model 6 days ago

nisten/tqwendo-36b

liked a model 7 days ago

deepseek-ai/DeepSeek-V3-Base

View all activity

Articles

Organizations

leonardlin's activity

upvoted 7 papers 12 days ago

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published 15 days ago • 31

OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain

Paper • 2412.13018 • Published 15 days ago • 41

Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models

Paper • 2412.12606 • Published 16 days ago • 41

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published 14 days ago • 23

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 15 days ago • 113

No More Adam: Learning Rate Scaling at Initialization is All You Need

Paper • 2412.11768 • Published 16 days ago • 41

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 13 days ago • 334

upvoted 9 papers 5 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 63

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15, 2024 • 10

Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

Paper • 2408.03822 • Published Aug 7, 2024 • 14

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 53

Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

Paper • 2408.02085 • Published Aug 4, 2024 • 17

Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers

Paper • 2408.05506 • Published Aug 10, 2024 • 8

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12, 2024 • 13

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 117

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Paper • 2408.06941 • Published Aug 13, 2024 • 30

upvoted a collection 5 months ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 181

upvoted 3 papers 5 months ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2, 2024 • 8

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

Paper • 2306.14145 • Published Jun 25, 2023 • 1

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 64

lhl PRO

AI & ML interests

Recent Activity

Articles

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Not Legal Advice on AI Training Data in Japan

Evaling llm-jp-eval (evals are hard)

Organizations

leonardlin's activity