samusenps

AI & ML interests

Foundational Architectures, Multi-Modality, Interpretability, Benchmarking w/ simulations, Robotics, Integration with Non envasive Open Source stack RISC-V BCI. Extremely high quality training data. Fully Open Source ML/AI.

Recent Activity

upvoted a paper about 2 hours ago

SCOPE: Optimizing Key-Value Cache Compression in Long-context Generation

upvoted a paper about 2 hours ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

upvoted a paper about 2 hours ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

View all activity

Organizations

samusenps's activity

upvoted 20 papers about 2 hours ago

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

Paper • 2412.16849 • Published 12 days ago • 7

NILE: Internal Consistency Alignment in Large Language Models

Paper • 2412.16686 • Published 13 days ago • 8

Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

Paper • 2412.17295 • Published 11 days ago • 9

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published 15 days ago • 10

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

Paper • 2412.17589 • Published 11 days ago • 11

ResearchTown: Simulator of Human Research Community

Paper • 2412.17767 • Published 11 days ago • 11

Outcome-Refining Process Supervision for Code Generation

Paper • 2412.15118 • Published 15 days ago • 18

LearnLM: Improving Gemini for Learning

Paper • 2412.16429 • Published 13 days ago • 19

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published 11 days ago • 20

Revisiting In-Context Learning with Long Context Language Models

Paper • 2412.16926 • Published 12 days ago • 25

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published 11 days ago • 27

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

Paper • 2412.17153 • Published 12 days ago • 33

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published 11 days ago • 39

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published 11 days ago • 41

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 15 days ago • 81

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Paper • 2412.16153 • Published 14 days ago • 6