raincandy_U's picture

raincandy_U

raincandy-u

·

AI & ML interests

幻覚。

Organizations

raincandy-u's activity

upvoted a paper 6 months ago

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 9

upvoted a paper 7 months ago

Needle In A Multimodal Haystack

Paper • 2406.07230 • Published Jun 11, 2024 • 53

upvoted an article 7 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 386

upvoted a paper 7 months ago

LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

Paper • 2405.18377 • Published May 28, 2024 • 18

upvoted a collection 8 months ago

Mini Pretrain Datasets

9 items • Updated Jul 9, 2024 • 9

upvoted 2 papers 8 months ago

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

Paper • 2405.08707 • Published May 14, 2024 • 27

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 87

upvoted a collection 8 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Nov 14, 2024 • 538

upvoted a collection 9 months ago

Llamafied Models

This is a collection of llamafied models - such as Qwen. • 5 items • Updated Apr 19, 2024 • 1