Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University

university

https://github.com/THUDM

thukeg

THUDM

AI & ML interests

AGI, LLMs, ChatGLM

Recent Activity

NeoZ123 authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

xujz0703 authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

yuxiaod authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

View all activity

THUDM's activity

NeoZ123

authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 3 days ago • 27

xujz0703

authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 3 days ago • 27

yuxiaod

authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 3 days ago • 27

jerytang

authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 3 days ago • 27

bys0318

authored a paper 2 days ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published 3 days ago • 27

bys0318

updated a dataset 2 days ago

THUDM/LongBench-v2

Viewer • Updated 2 days ago • 503 • 108 • 4

akhaliq

posted an update 3 days ago

Post

1776

Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: akhaliq/anychat

ShiyuHuang

updated 2 Spaces 4 days ago

LVBench Leaderboard

MotionBench Leaderboard

bys0318

updated a dataset 4 days ago

THUDM/LongBench

Viewer • Updated 4 days ago • 8.42k • 41k • 130

yuxiaod

authored a paper 5 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 6 days ago • 15

THUDM-Space

in THUDM/CogVideoX1.5-5B-I2V 6 days ago

AttributeError: 'CogVideoXTransformer3DModel' object has no attribute 'ofs_embedding'. Did you mean: 'time_embedding'?

#3 opened about 1 month ago by

NeoZ123

updated 2 models 6 days ago

THUDM/LongCite-llama3.1-8b

Text Generation • Updated 6 days ago • 659 • 28

THUDM/LongCite-glm4-9b

Text Generation • Updated 6 days ago • 532 • 29

zRzRzRzRzRzRzR

updated a model 12 days ago

THUDM/codegeex2-6b

Updated 12 days ago • 347 • 253

ShawLiu

updated a model 13 days ago

THUDM/webrl-orm-llama-3.1-8b

Updated 13 days ago • 19

zRzRzRzRzRzRzR

updated 3 models 17 days ago

THUDM/chatglm3-6b-128k

Updated 17 days ago • 405 • 77

THUDM/chatglm3-6b-base

Updated 17 days ago • 32.2k • 88

THUDM/chatglm3-6b

Updated 17 days ago • 33.8k • 1.1k

zRzRzRzRzRzRzR

in THUDM/glm-4-9b-chat-1m 18 days ago

Converting to native Transformers

#17 opened 3 months ago by