Michael Toker's picture

Michael Toker

tokeron

·

https://tokeron.github.io/

AI & ML interests

NLP, Computer Vision, Text to Image, LLMs, VLMs, Interpretability, Explainability

Recent Activity

liked a model about 2 months ago

OpenGVLab/InternVL2_5-8B-MPO

authored a paper 2 months ago

Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models

upvoted a paper 2 months ago

Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models

View all activity

Organizations

tokeron's activity

upvoted a paper 2 months ago

Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models

Paper • 2501.06751 • Published Jan 12 • 31

upvoted 8 papers 5 months ago

GLEE: A Unified Framework and Benchmark for Language-based Economic Environments

Paper • 2410.05254 • Published Oct 7, 2024 • 81

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Paper • 2410.02707 • Published Oct 3, 2024 • 48

Customizing Text-to-Image Models with a Single Image Pair

Paper • 2405.01536 • Published May 2, 2024 • 22

Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs

Paper • 2406.20086 • Published Jun 28, 2024 • 6

NNsight and NDIF: Democratizing Access to Foundation Model Internals

Paper • 2407.14561 • Published Jul 18, 2024 • 34

Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

Paper • 2311.12092 • Published Nov 20, 2023 • 23

NL-Eye: Abductive NLI for Images

Paper • 2410.02613 • Published Oct 3, 2024 • 23

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

Paper • 2403.05846 • Published Mar 9, 2024 • 1

upvoted an article 6 months ago

Article

Introducing the Open Leaderboard for Hebrew LLMs!

May 5, 2024

• 38

upvoted a paper 7 months ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 59

upvoted 5 papers 9 months ago

Simulating Classroom Education with LLM-Empowered Agents

Paper • 2406.19226 • Published Jun 27, 2024 • 31

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27, 2024 • 52

MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data

Paper • 2406.18790 • Published Jun 26, 2024 • 34

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs

Paper • 2406.18629 • Published Jun 26, 2024 • 42

Make It Count: Text-to-Image Generation with an Accurate Number of Objects

Paper • 2406.10210 • Published Jun 14, 2024 • 77