Joseph's picture

Joseph

Joseph717171

·

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

NyxKrage/Microsoft_Phi-4

upvoted a paper 5 days ago

Deliberation in Latent Space via Differentiable Cache Augmentation

liked a model 7 days ago

black-forest-labs/FLUX.1-schnell

View all activity

Organizations

Joseph717171's activity

upvoted a paper 5 days ago

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published 6 days ago • 25

upvoted a paper 9 days ago

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published 14 days ago • 25

upvoted a paper 16 days ago

Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation

Paper • 2410.08371 • Published Oct 10 • 1

upvoted a collection 17 days ago

GGUF Llama-3.2-Instruct-OQ8_0-F32.EF32.IQ4_K-Q8_0 IQuants

Custom GGUF quants of Meta’s Llama-3.2-Instruct's finetunes, where the Output Tensors are quantized to Q8_0 or F32 and the Embeddings are kept @F32 • 3 items • Updated 17 days ago • 2

upvoted a collection about 1 month ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 23 days ago • 548

upvoted a collection about 2 months ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28 • 258

upvoted a paper 3 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

upvoted 2 collections 3 months ago

Recent highlights

Some recent models worth checking out • 18 items • Updated Nov 1 • 43

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28 • 446

upvoted a paper 4 months ago

BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation

Paper • 2402.16880 • Published Feb 18 • 2

upvoted a collection 5 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 23 days ago • 636

upvoted a collection 6 months ago

LLM Compiler

Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. • 4 items • Updated Jun 27 • 146

upvoted an article 7 months ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

By

•

Apr 28

• 37

upvoted a collection 10 months ago

Honorable mentions

Some models I've made and I liked but isn't part of a serie. • 10 items • Updated Feb 4 • 6