Makoto Shing's picture

Makoto Shing

mkshing

·

AI & ML interests

NLP, Compter Vision

Recent Activity

upvoted a paper 11 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

upvoted a paper 22 days ago

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

updated a model 23 days ago

SakanaAI/TAID-VLM-2B

View all activity

Organizations

mkshing's activity

upvoted a paper 11 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 15 days ago • 113

upvoted a paper 22 days ago

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published 25 days ago • 5

updated 6 models 23 days ago

SakanaAI/TAID-VLM-2B

Visual Question Answering • Updated 23 days ago • 67 • 2

SakanaAI/TAID-LLM-1.5B

Text Generation • Updated 23 days ago • 928 • 4

SakanaAI/TinySwallow-1.5B-Instruct-GGUF

Text Generation • Updated 23 days ago • 8.5k • 20

SakanaAI/TinySwallow-1.5B-Instruct-q4f32_1-MLC

Text Generation • Updated 23 days ago • 3

SakanaAI/TinySwallow-1.5B-Instruct

Text Generation • Updated 23 days ago • 16.7k • • 34

SakanaAI/TinySwallow-1.5B

Text Generation • Updated 23 days ago • 3.95k • • 20

authored 2 papers 23 days ago

Release of Pre-Trained Models for the Japanese Language

Paper • 2404.01657 • Published Apr 2, 2024 • 1

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published 25 days ago • 5

published 6 models 23 days ago

SakanaAI/TAID-VLM-2B

Visual Question Answering • Updated 23 days ago • 67 • 2

SakanaAI/TAID-LLM-1.5B

Text Generation • Updated 23 days ago • 928 • 4

SakanaAI/TinySwallow-1.5B-Instruct-GGUF

Text Generation • Updated 23 days ago • 8.5k • 20

SakanaAI/TinySwallow-1.5B-Instruct

Text Generation • Updated 23 days ago • 16.7k • • 34

SakanaAI/TinySwallow-1.5B

Text Generation • Updated 23 days ago • 3.95k • • 20

SakanaAI/TinySwallow-1.5B-Instruct-q4f32_1-MLC

Text Generation • Updated 23 days ago • 3

updated a collection 24 days ago

TinySwallow

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 23 days ago • 16

updated a collection 25 days ago

TinySwallow

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 23 days ago • 16

updated a model 28 days ago

mkshing/toy

Updated 28 days ago