Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
9
14
Shwai He
Shwai
Follow
Vfrz's profile picture
optimusPrimeBee's profile picture
s1ghhh's profile picture
3 followers
·
10 following
https://shwai-he.github.io/
Shwai-He
AI & ML interests
Deep Learning, Mechine Learning, Natural Language Processing.
Recent Activity
upvoted
a
paper
about 17 hours ago
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
upvoted
a
paper
1 day ago
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
commented
on
a paper
1 day ago
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts
View all activity
Organizations
Shwai
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
14 models
5 months ago
s1ghhh/Mistral-7B-v0.1-Drop4Attn
Updated
Sep 8, 2024
•
8
•
2
s1ghhh/Llama-2-13b-Drop8Block
Updated
Sep 8, 2024
•
8
•
2
s1ghhh/Llama-2-13b-Drop4Block
Updated
Sep 8, 2024
•
11
•
2
s1ghhh/Llama-2-13b-Drop8Attn
Updated
Sep 8, 2024
•
12
•
2
s1ghhh/Llama-2-13b-Drop4Attn
Updated
Sep 8, 2024
•
7
•
2
s1ghhh/Llama-2-13b-Drop4MLP
Updated
Sep 8, 2024
•
8
•
2
s1ghhh/Llama-2-13b-Drop8MLP
Updated
Sep 8, 2024
•
15
•
2
s1ghhh/Mistral-7B-v0.1-Drop4Block
Updated
Sep 8, 2024
•
10
•
2
s1ghhh/Mistral-7B-v0.1-Drop8Block
Updated
Sep 8, 2024
•
7
•
2
s1ghhh/Mistral-7B-v0.1-Drop8Attn
Updated
Sep 8, 2024
•
13
•
2
s1ghhh/Mistral-7B-v0.1-Drop4MLP
Updated
Sep 8, 2024
•
9
•
2
s1ghhh/Mistral-7B-v0.1-Drop8MLP
Updated
Sep 8, 2024
•
9
•
2
s1ghhh/Llama-3-70b-Drop
Text Generation
•
Updated
Oct 23, 2024
•
10
•
3
s1ghhh/Llama-2-70b-Drop
Text Generation
•
Updated
Oct 23, 2024
•
14
•
2