Joshua Chak's picture

Joshua Chak

JoshuaChak

·

AI & ML interests

None yet

Recent Activity

liked a Space about 19 hours ago

HuggingFaceH4/blogpost-scaling-test-time-compute

new activity about 19 hours ago

vidore/colqwen2-v1.0:what is the processor?

liked a dataset 1 day ago

google-research-datasets/go_emotions

View all activity

Organizations

JoshuaChak's activity

upvoted a paper 1 day ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 3 days ago • 83

upvoted 4 papers about 2 months ago

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29 • 11

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17 • 89

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Paper • 2410.10812 • Published Oct 14 • 15

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published Oct 22 • 14

upvoted 2 papers 2 months ago

Autonomous Character-Scene Interaction Synthesis from Text Instruction

Paper • 2410.03187 • Published Oct 4 • 7

Presto! Distilling Steps and Layers for Accelerating Music Generation

Paper • 2410.05167 • Published Oct 7 • 15

upvoted a paper 5 months ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5 • 27

upvoted 3 papers 6 months ago

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5 • 60

Transformers meet Neural Algorithmic Reasoners

Paper • 2406.09308 • Published Jun 13 • 43

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11 • 37

upvoted an article 7 months ago

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

By

•

Jun 20

• 26

upvoted 3 papers 7 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16 • 126

Many-Shot In-Context Learning in Multimodal Foundation Models

Paper • 2405.09798 • Published May 16 • 26

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87