Kale

Zyn123

AI & ML interests

None yet

Recent Activity

upvoted an article 25 days ago

Decoding Strategies in Large Language Models

upvoted an article about 1 month ago

liked a model about 1 month ago

EmergentMethods/gliner_medium_news-v2.1

View all activity

Organizations

None yet

Zyn123's activity

upvoted an article 25 days ago

Article

Decoding Strategies in Large Language Models

•

about 1 month ago

• 38

upvoted 2 articles about 1 month ago

Article

Fine-tune Llama 2 with DPO

Aug 8, 2023

• 34

Article

How to build a custom text classifier without days of human labeling

•

Oct 17

• 55

upvoted an article about 2 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 204

upvoted 3 articles 3 months ago

Article

Fine-Tune Whisper with 🤗 Transformers

Nov 3, 2022

• 121

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19

• 73

Article

Merge Large Language Models with mergekit

•

Jan 9

• 82

upvoted an article 4 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18

• 51

upvoted a paper 5 months ago

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21 • 61

upvoted 2 articles 6 months ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7

• 40

Article

Everything About Long Context Fine-tuning

•

May 10

• 32

upvoted a paper 6 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 118

upvoted an article 6 months ago

Article

Let's talk about LLM evaluation

•

May 23

• 134

upvoted an article 7 months ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

•

Jun 3

• 42

upvoted a paper 8 months ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 30

upvoted 2 papers 9 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 62

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 603