Nikita

PQlet

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

A Primer on the Inner Workings of Transformer-based Language Models

upvoted a paper 21 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

liked a Space 26 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

None yet

PQlet's activity

upvoted a paper 9 days ago

A Primer on the Inner Workings of Transformer-based Language Models

Paper • 2405.00208 • Published Apr 30, 2024 • 10

upvoted a paper 21 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 25 days ago • 163

liked a Space 26 days ago

2.27k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper about 1 month ago

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Paper • 2502.06394 • Published Feb 10 • 86

upvoted an article about 2 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 572

liked a dataset 3 months ago

roneneldan/TinyStories

Viewer • Updated Aug 12, 2024 • 2.14M • 22.9k • 630

liked 2 models 4 months ago

BAAI/bge-m3

jinaai/jina-embeddings-v2-base-en

Feature Extraction • Updated Jan 6 • 228k • • 715

upvoted a paper 4 months ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published Oct 23, 2024 • 203

liked a dataset 4 months ago

laion/laion-coco

Viewer • Updated Jul 14, 2024 • 641M • 1.87k • 79

upvoted an article 5 months ago

Article

Understanding InstaFlow/Rectified Flow

•

Oct 6, 2023

• 27

liked a dataset 5 months ago

Rowan/hellaswag

Viewer • Updated Sep 28, 2023 • 60k • 407k • 112

upvoted a paper 5 months ago

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 18

updated a model 5 months ago

PQlet/SkGenAI-InternVL2_1B-demo

Updated Oct 12, 2024

upvoted a collection 5 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 105 items • Updated 4 days ago • 97

updated 2 models 5 months ago

PQlet/SkGenAI-InternVL2_1B-demobest_model

Updated Oct 5, 2024

PQlet/test1

Updated Oct 3, 2024

liked a model 6 months ago

OpenGVLab/InternVL2-1B

Image-Text-to-Text • Updated Feb 5 • 79.1k • 67

upvoted an article 6 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 243

upvoted a paper 7 months ago

Layerwise Recurrent Router for Mixture-of-Experts

Paper • 2408.06793 • Published Aug 13, 2024 • 32