HUTAO's picture

20 10

HUTAO

IRISA2

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

liked a model 22 days ago

deepseek-ai/DeepSeek-R1

upvoted a paper 22 days ago

Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models

View all activity

Organizations

None yet

IRISA2's activity

upvoted a paper 22 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 24 days ago • 182

liked a model 22 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 13 days ago • 3.64M • • 11k

upvoted 18 papers 22 days ago

Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models

Paper • 2408.04594 • Published Aug 8, 2024 • 15

VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads

Paper • 2407.18245 • Published Jul 25, 2024 • 10

Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics

Paper • 2408.04631 • Published Aug 8, 2024 • 10

Better Alignment with Instruction Back-and-Forth Translation

Paper • 2408.04614 • Published Aug 8, 2024 • 16

LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection

Paper • 2408.04284 • Published Aug 8, 2024 • 26

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8, 2024 • 19

Generating novel experimental hypotheses from language models: A case study on cross-dative generalization

Paper • 2408.05086 • Published Aug 9, 2024 • 6

MulliVC: Multi-lingual Voice Conversion With Cycle Consistency

Paper • 2408.04708 • Published Aug 8, 2024 • 8

MooER: LLM-based Speech Recognition and Translation Models from Moore Threads

Paper • 2408.05101 • Published Aug 9, 2024 • 8

BRAT: Bonus oRthogonAl Token for Architecture Agnostic Textual Inversion

Paper • 2408.04785 • Published Aug 8, 2024 • 9

Kalman-Inspired Feature Propagation for Video Face Super-Resolution

Paper • 2408.05205 • Published Aug 9, 2024 • 10

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

Paper • 2408.04682 • Published Aug 8, 2024 • 18

UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling

Paper • 2408.04810 • Published Aug 9, 2024 • 24

Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers

Paper • 2408.05506 • Published Aug 10, 2024 • 10

Body Transformer: Leveraging Robot Embodiment for Policy Learning

Paper • 2408.06316 • Published Aug 12, 2024 • 10

UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization

Paper • 2408.05939 • Published Aug 12, 2024 • 15

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors

Paper • 2408.06019 • Published Aug 12, 2024 • 15