SAMBIT CHAKRABORTY
sambitchakhf03
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
5 days ago
Token-Budget-Aware LLM Reasoning
upvoted
a
paper
21 days ago
Phi-4 Technical Report
upvoted
a
paper
23 days ago
Training Large Language Models to Reason in a Continuous Latent Space
Organizations
Collections
5
-
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU
Paper • 2403.06504 • Published • 53 -
Stealing Part of a Production Language Model
Paper • 2403.06634 • Published • 90 -
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression
Paper • 2406.14909 • Published • 14
models
2
datasets
None public yet