Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
THUDM
/
glm-4-9b-chat
like
630
Follow
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University
1,228
Transformers
Safetensors
Chinese
English
chatglm
glm
thudm
custom_code
arxiv:
2406.12793
License:
glm-4
Model card
Files
Files and versions
Community
83
Train
Use this model
flash_attention_2
#51
by
zxdu20
- opened
Jun 24
base:
refs/heads/main
←
from:
refs/pr/51
Discussion
Files changed
+345
-232
Add eager and sdpa attention implementations
835c7179
Add support for flash attention 2
a7eaddd0
Merge branch 'main' into attention
29038ea1
zxdu20
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University org
Jun 24
No description provided.
zxdu20
changed pull request status to
open
Jun 24
zxdu20
changed pull request status to
merged
Jun 24
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment