jiakai

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a model about 11 hours ago

simplescaling/s1-32B

upvoted an article 1 day ago

Open-source DeepResearch – Freeing our search agents

liked a Space 2 days ago

m-ric/open_Deep-Research

View all activity

Organizations

real-jiakai's activity

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 677

upvoted an article 4 days ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

and 1 other •

Oct 14, 2024

• 68

upvoted a collection 6 days ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 86

upvoted a paper 7 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 8 days ago • 50

upvoted a paper 10 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 14 days ago • 48

upvoted an article 10 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

23 days ago

• 137

upvoted a collection 10 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 3 items • Updated 11 days ago • 322

upvoted a paper 14 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 301

upvoted 2 papers 23 days ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation

Paper • 2412.02592 • Published Dec 3, 2024 • 22

upvoted a paper 27 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 29 days ago • 253

upvoted 2 papers 28 days ago

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published 30 days ago • 33

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 30 days ago • 84

upvoted a paper 29 days ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published about 1 month ago • 48

upvoted a paper 30 days ago

Personalized Graph-Based Retrieval for Large Language Models

Paper • 2501.02157 • Published Jan 4 • 28

upvoted a paper about 1 month ago

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

Paper • 2412.20005 • Published Dec 28, 2024 • 17

upvoted a collection about 1 month ago

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 213

upvoted an article about 1 month ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

and 1 other •

Jan 3

• 13

upvoted a paper about 1 month ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 48

upvoted an article about 1 month ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

and 1 other •

Nov 21, 2024

• 35