Natyren's picture

Natyren PRO

GeorgeBredis

·

https://t.me/George_B

Natyren

AI & ML interests

Self-Supervised Learning, Generative Modeling, Image-text models

Recent Activity

updated a dataset 1 day ago

GeorgeBredis/ERQA

published a dataset 1 day ago

GeorgeBredis/ERQA

liked a dataset 3 days ago

EmbodiedBench/EB-ALFRED

View all activity

Organizations

GeorgeBredis's activity

upvoted an article 13 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

22 days ago

• 205

upvoted a paper 15 days ago

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published 17 days ago • 63

upvoted a paper 18 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 22 days ago • 162

upvoted 2 papers 23 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 24 days ago • 56

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published 29 days ago • 34

upvoted 3 papers about 1 month ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 58

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 112

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

upvoted a paper 3 months ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published Dec 12, 2024 • 35

upvoted a paper 5 months ago

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 18

upvoted a collection 5 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated about 17 hours ago • 299

upvoted 2 collections 6 months ago

XGen-MM-1 models and datasets

A collection of all XGen-MM (Foundation LMM) models! • 18 items • Updated 24 days ago • 38

Multimodal RAG

10 items • Updated Sep 5, 2024 • 26

upvoted 2 collections 7 months ago

PDF Document / OCR Datasets

Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30, 2024 • 47

Visual Scorers!

Variants of Visual Evaluation Models proposed by [Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-defined Levels]. Use by `model.score()`! • 10 items • Updated Dec 2, 2024 • 3

upvoted a collection 8 months ago

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated 2 days ago • 79

upvoted 4 papers 8 months ago

Lessons from Learning to Spin "Pens"

Paper • 2407.18902 • Published Jul 26, 2024 • 21

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents

Paper • 2407.18901 • Published Jul 26, 2024 • 33

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Paper • 2407.20179 • Published Jul 29, 2024 • 47

Wolf: Captioning Everything with a World Summarization Framework

Paper • 2407.18908 • Published Jul 26, 2024 • 32