15 12 106

Igor Kuzmin

igorktech

igorktech

AI & ML interests

AI, Chatbots, NLP, Reinforcement Learning in conversational assistants.

Recent Activity

upvoted a paper 20 days ago

Rho-1: Not All Tokens Are What You Need

upvoted a paper 20 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

liked a model 20 days ago

answerdotai/ModernBERT-base

View all activity

Organizations

igorktech's activity

upvoted 2 papers 20 days ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 88

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 22 days ago • 120

upvoted a collection about 2 months ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 3 days ago • 64

upvoted a paper 3 months ago

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 64

upvoted 2 articles 4 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 110

Article

Introduction to ggml

Aug 13, 2024

• 125

upvoted a paper 4 months ago

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

Paper • 2408.12503 • Published Aug 22, 2024 • 23

upvoted an article 5 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 70

upvoted a paper 5 months ago

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 79

upvoted an article 6 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 171

upvoted an article 7 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 391

upvoted a paper 8 months ago

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22, 2024 • 54