Joao Pedro Silva Dias Moura Mesquita's picture

Joao Pedro Silva Dias Moura Mesquita

inkasaras

·

joaopedrosdmm

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

upvoted an article 5 days ago

Open-R1: Update #1

upvoted a collection 5 days ago

View all activity

Organizations

None yet

inkasaras's activity

upvoted a paper 1 day ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 2 days ago • 52

upvoted an article 5 days ago

Article

Open-R1: Update #1

By

and 7 others •

26 days ago

• 288

upvoted a collection 5 days ago

SigLIP2

36 items • Updated 7 days ago • 50

upvoted a collection 6 days ago

SmolVLM2 📺 Smallest video LM ever 🤏🏻

11 items • Updated 2 days ago • 40

upvoted an article 7 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

8 days ago

• 174

upvoted a collection 7 days ago

PaliGemma 2 Mix

13 items • Updated 8 days ago • 59

upvoted a collection 10 days ago

Step-Audio

Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 10 days ago • 28

upvoted 2 articles 12 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

about 1 month ago

• 780

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

24 days ago

• 109

upvoted a paper 15 days ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published 18 days ago • 32

upvoted an article 15 days ago

Article

Open R1: Update #2

By

and 6 others •

17 days ago

• 190

upvoted a paper 16 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 20 days ago • 120

upvoted a collection 21 days ago

SYNTHETIC-1

A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 7 days ago • 49

upvoted an article 23 days ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

27 days ago

• 35

upvoted a paper 23 days ago

Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models

Paper • 2501.18119 • Published 29 days ago • 24

upvoted an article 23 days ago

Article

Open-source DeepResearch – Freeing our search agents

24 days ago

• 1.11k

upvoted an article 29 days ago

Article

Janus Pro: DeepSeek's Revolutionary Multimodal AI Model

By

•

about 1 month ago

• 31

upvoted a collection about 1 month ago

Albertina

Albertina family of encoders for Portuguese • 9 items • Updated Jul 26, 2024 • 2

upvoted an article about 1 month ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

Jan 23

• 64

upvoted a collection about 2 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated Jan 17 • 265