Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2405.14860

🔍 Daily Picks in Interpretability & Analysis of LMs

Outstanding research in interpretability and evaluation of language models, summarized

Multi-property Steering of Large Language Models with Dynamic Activation Composition

Paper • 2406.17563 • Published 6 days ago • 4
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP

Paper • 2406.12618 • Published 13 days ago • 4
Confidence Regulation Neurons in Language Models

Paper • 2406.16254 • Published 7 days ago • 10
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

Paper • 2406.13663 • Published 12 days ago • 7

about 15 hours ago

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

Paper • 2402.08714 • Published Feb 13 • 10
Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15 • 18
RLVF: Learning from Verbal Feedback without Overgeneralization

Paper • 2402.10893 • Published Feb 16 • 10
Coercing LLMs to do and reveal (almost) anything

Paper • 2402.14020 • Published Feb 21 • 12

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published 21 days ago • 60
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning

Paper • 2406.06469 • Published 21 days ago • 22
Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published 25 days ago • 26
Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published 27 days ago • 35

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39

Phased Consistency Model

Paper • 2405.18407 • Published May 28 • 44
AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct

Paper • 2405.14906 • Published May 23 • 21
Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39
TimeGPT-1

Paper • 2310.03589 • Published Oct 5, 2023 • 3
A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Paper • 2405.00332 • Published May 1 • 30
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture

Paper • 2405.18991 • Published May 29 • 12

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published about 1 month ago • 60

Models and Linearity

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 143
Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39

Interesting things.

AtP*: An efficient and scalable method for localizing LLM behaviour to components

Paper • 2403.00745 • Published Mar 1 • 8
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 574
MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT

Paper • 2402.16840 • Published Feb 26 • 23
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens

Paper • 2402.13753 • Published Feb 21 • 106

Relative representations enable zero-shot latent space communication

Paper • 2209.15430 • Published Sep 30, 2022 • 1
Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23 • 39
The Geometry of Categorical and Hierarchical Concepts in Large Language Models

Paper • 2406.01506 • Published 28 days ago • 3

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs