sergicalsix's picture

1 97

sergicalsix

sergicalsix

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

upvoted a paper 1 day ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

upvoted a paper 2 days ago

Watermark Anything with Localized Messages

View all activity

Organizations

sergicalsix's activity

upvoted 2 papers 1 day ago

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published 11 days ago • 60

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published 13 days ago • 44

upvoted 17 papers 2 days ago

Watermark Anything with Localized Messages

Paper • 2411.07231 • Published 11 days ago • 19

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published 11 days ago • 30

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published 10 days ago • 59

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published 7 days ago • 93

Generative World Explorer

Paper • 2411.11844 • Published 4 days ago • 55

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published 16 days ago • 30

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published 15 days ago • 34

RetrieveGPT: Merging Prompts and Mathematical Models for Enhanced Code-Mixed Information Retrieval

Paper • 2411.04752 • Published 15 days ago • 16

Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published 16 days ago • 22

Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?

Paper • 2411.05000 • Published 15 days ago • 21

M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding

Paper • 2411.04952 • Published 15 days ago • 27

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 15 days ago • 108

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published 17 days ago • 60

LLaMo: Large Language Model-based Molecular Graph Assistant

Paper • 2411.00871 • Published 23 days ago • 21

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published 18 days ago • 63

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published 22 days ago • 16

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published 23 days ago • 59

upvoted a paper 3 days ago

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published about 1 month ago • 199