1 245 99

Richrich

RichardForests

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

thenlper/gte-base

liked a model 13 days ago

nvidia/NVLM-D-72B

liked a model 14 days ago

PatronusAI/glider

View all activity

Organizations

RichardForests's activity

upvoted an article 17 days ago

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 87

upvoted a collection 19 days ago

xLAM models

Collection

xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 11 items • Updated 15 days ago • 45

upvoted a paper 22 days ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published 23 days ago • 17

upvoted 3 articles 27 days ago

Article

Better RAG 3: The text is your friend

•

Mar 14, 2024

• 6

Article

Better RAG 2: Single-shot is not good enough

•

Mar 14, 2024

• 10

Article

Better RAG 1: Advanced Basics

•

Mar 14, 2024

• 19

upvoted a paper about 1 month ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 47

upvoted 2 collections about 1 month ago

MoE_Papers

Collection

4 items • Updated 9 days ago • 1

LLM

Collection

31 items • Updated 8 days ago • 1

upvoted 2 papers about 1 month ago

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

Paper • 2402.10210 • Published Feb 15, 2024 • 32

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Paper • 2402.05099 • Published Feb 7, 2024 • 19

upvoted an article about 1 month ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 42

upvoted a paper about 2 months ago

The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs

Paper • 2210.14986 • Published Oct 26, 2022 • 5

upvoted a paper 5 months ago

KAN or MLP: A Fairer Comparison

Paper • 2407.16674 • Published Jul 23, 2024 • 42

upvoted an article 6 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 104

upvoted 5 papers 7 months ago

Sampling 3D Gaussian Scenes in Seconds with Latent Diffusion Models

Paper • 2406.13099 • Published Jun 18, 2024 • 4

ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning

Paper • 2406.14130 • Published Jun 20, 2024 • 10

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 18

GLiNER multi-task: Generalist Lightweight Model for Various Information Extraction Tasks

Paper • 2406.12925 • Published Jun 14, 2024 • 23

Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities

Paper • 2406.14562 • Published Jun 20, 2024 • 27