Aryanne's picture

Aryanne

Aryanne

·

AI & ML interests

LLMs, AI, GPU/CPU poor, any help is welcome 0x190ac445974a989a87dd223f212a76ca0090c804

Recent Activity

liked a model 11 days ago

PygmalionAI/Pygmalion-3-12B

liked a model 11 days ago

PygmalionAI/Pygmalion-3-12B-GGUF

liked a model 15 days ago

bartowski/cognitivecomputations_Dolphin3.0-R1-Mistral-24B-GGUF

View all activity

Organizations

Aryanne's activity

upvoted a collection 3 months ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 12 days ago • 88

upvoted a paper 4 months ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 50

upvoted a paper 5 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

upvoted 2 collections 5 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 570

Llama3-8B-1.58

A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! • 3 items • Updated Sep 14, 2024 • 11

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 223

upvoted 2 articles 6 months ago

Article

Introduction to ggml

Aug 13, 2024

• 155

Article

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

By

•

Feb 8, 2024

• 6

upvoted a collection 8 months ago

MatMulfree LM

Pre-trined models for Matmulfree LM. • 4 items • Updated Jun 10, 2024 • 25

upvoted 3 papers 11 months ago

Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

Paper • 2401.00711 • Published Jan 1, 2024 • 2

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20, 2024 • 20

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 76

upvoted 2 papers 12 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6, 2024 • 63

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 609

upvoted 2 papers about 1 year ago

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26, 2024 • 72

Universal Neurons in GPT2 Language Models

Paper • 2401.12181 • Published Jan 22, 2024 • 5

upvoted a collection about 1 year ago

Testing Might be broken

testing only models, • 10 items • Updated Feb 3, 2024 • 2

upvoted a paper about 1 year ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 259

upvoted a collection about 1 year ago

Merged Models

Using mergekit • 10 items • Updated Mar 1, 2024 • 3

upvoted a collection over 1 year ago

StableLM (.gguf)

Models based on StableLM Models by Stability AI • 19 items • Updated Nov 27, 2023 • 3