Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

commented on their article 9 days ago

Multivariate Probabilistic Time Series Forecasting with Informer

upvoted an article 12 days ago

Open R1: Update #2

upvoted an article 22 days ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

View all activity

Organizations

kashif's activity

commented on Multivariate Probabilistic Time Series Forecasting with Informer 9 days ago

what is the exact command that is giving the error?

upvoted an article 12 days ago

Article

Open R1: Update #2

By

and 6 others •

12 days ago

• 182

upvoted an article 22 days ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

By

•

22 days ago

• 35

published a model about 1 month ago

kashif/Qwen2-0.5B-SFT

updated a model about 1 month ago

kashif/Gemma2-2B-SFT

Text Generation • Updated Jan 20 • 15

published a model about 1 month ago

kashif/Gemma2-2B-SFT

Text Generation • Updated Jan 20 • 15

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 257

upvoted an article about 2 months ago

Article

Process Reinforcement through Implicit Rewards

By

and 1 other •

Jan 3

• 24

liked 2 Spaces 2 months ago

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

Fev Leaderboard

Display benchmark results for time series models

liked a model 2 months ago

nicolas-dufour/PLONK_YFCC

Updated Dec 12, 2024 • 170 • 13

updated a model 3 months ago

huggingface/timesfm-tourism-monthly

Updated Dec 9, 2024 • 35 • 1

upvoted a paper 3 months ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24, 2024 • 5

liked a model 3 months ago

flair/bueble-lm-2b

Text Generation • Updated Dec 6, 2024 • 3.28k • 20

upvoted a paper 3 months ago

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 5

liked a model 3 months ago

TianqiLiuAI/RM-gemma2-2b

Text Generation • Updated Nov 18, 2024 • 94 • 1

updated a dataset 3 months ago

trl-lib/alpaca-cleaned

Viewer • Updated Nov 28, 2024 • 51.8k • 54

liked a dataset 3 months ago

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 30.9k • 158

updated a model 3 months ago

HuggingFaceTB/SmolVLM-Instruct-DPO

Image-Text-to-Text • Updated Nov 26, 2024 • 208 • 18

liked a model 3 months ago

apple/coreml-mobileclip

Updated Nov 19, 2024 • 299 • 40