12 23 21

Justin Zhao PRO

justinxzhao

AI & ML interests

None yet

Recent Activity

updated a Space 4 days ago

llm-council/alpaca-eval-explorer

updated a Space 4 days ago

llm-council/emotional-intelligence-arena

updated a Space 4 days ago

justinxzhao/name_recommender

View all activity

Organizations

justinxzhao's activity

upvoted a paper about 1 month ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 273

upvoted 2 papers 4 months ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 83

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 170

upvoted a paper 5 months ago

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 37

upvoted 2 papers 6 months ago

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Paper • 2408.02442 • Published Aug 5, 2024 • 21

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 44

upvoted an article 6 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 77

upvoted a paper 6 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 70

upvoted an article 7 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15, 2024

• 81

upvoted a paper 7 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

upvoted an article 7 months ago

Article

MMLU-Pro-NoMath

•

Jul 11, 2024

• 4

upvoted an article 8 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1, 2024

• 64

upvoted a paper 8 months ago

Language Model Council: Benchmarking Foundation Models on Highly Subjective Tasks by Consensus

Paper • 2406.08598 • Published Jun 12, 2024 • 6

upvoted a collection 8 months ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated Nov 28, 2024 • 207

upvoted a paper 8 months ago

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 58

upvoted 2 papers 10 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

upvoted a paper 11 months ago

QLoRA: Efficient Finetuning of Quantized LLMs

Paper • 2305.14314 • Published May 23, 2023 • 49

upvoted a paper 12 months ago

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 58

upvoted a paper about 1 year ago

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69