Bhadresh Savani's picture

Bhadresh Savani

bhadresh-savani

·

https://www.linkedin.com/in/bhadreshsavani/

AI & ML interests

NLP, Deep Learning, ML

Recent Activity

liked a model about 1 month ago

Datou1111/shou_xin

reacted to lin-tan's post with 🔥 2 months ago

Can language models replace developers? #RepoCod says “Not Yet”, because GPT-4o and other LLMs have <30% accuracy/pass@1 on real-world code generation tasks. - Leaderboard https://lt-asset.github.io/REPOCOD/ - Dataset: https://huggingface.co/datasets/lt-asset/REPOCOD @jiang719 @shanchao @Yiran-Hu1007 Compared to #SWEBench, RepoCod tasks are - General code generation tasks, while SWE-Bench tasks resolve pull requests from GitHub issues. - With 2.6X more tests per task (313.5 compared to SWE-Bench’s 120.8). Compared to #HumanEval, #MBPP, #CoderEval, and #ClassEval, RepoCod has 980 instances from 11 Python projects, with - Whole function generation - Repository-level context - Validation with test cases, and - Real-world complex tasks: longest average canonical solution length (331.6 tokens) and the highest average cyclomatic complexity (9.00) Introducing hashtag #RepoCod-Lite 🐟 for faster evaluations: 200 of the toughest tasks from RepoCod with: - 67 repository-level, 67 file-level, and 66 self-contains tasks - Detailed problem descriptions (967 tokens) and long canonical solutions (918 tokens) - GPT-4o and other LLMs have < 10% accuracy/pass@1 on RepoCod-Lite tasks. - Dataset: https://huggingface.co/datasets/lt-asset/REPOCOD_Lite #LLM4code #LLM #CodeGeneration #Security

upvoted an article 5 months ago

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

View all activity

Organizations

spaces 1

Bhadresh Savani Distilbert Base Uncased Emotion

models 31

bhadresh-savani/distilbert-base-uncased-emotion

Text Classification • Updated Aug 14, 2024 • 49.9k • • 130

bhadresh-savani/t5-small-finetuned-xsum

Text2Text Generation • Updated Jun 5, 2023 • 2

bhadresh-savani/a2c-PandaReachDense-v2

Reinforcement Learning • Updated Apr 29, 2023

bhadresh-savani/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Apr 29, 2023

bhadresh-savani/SoccerTwos

Reinforcement Learning • Updated Apr 29, 2023 • 14

bhadresh-savani/a2c-AntBulletEnv-v0

Reinforcement Learning • Updated Apr 29, 2023 • 1

bhadresh-savani/ppo-PyramidRND

Reinforcement Learning • Updated Apr 29, 2023 • 3

bhadresh-savani/ppo-SnowballTargetTESTCOLAB

Reinforcement Learning • Updated Apr 29, 2023 • 11

bhadresh-savani/Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning • Updated Apr 29, 2023

bhadresh-savani/Reinforce-Pixelcopter-PLE-v1

Updated Apr 28, 2023

datasets 4

bhadresh-savani/photo-to-cartoon

Viewer • Updated Jul 26, 2024 • 76 • 41 • 8

bhadresh-savani/translate_code_geeksforgeeks_for_t5

Viewer • Updated Apr 6, 2023 • 7.12k • 54 • 4

bhadresh-savani/image-to-style

Viewer • Updated Jul 20, 2022 • 102 • 13

bhadresh-savani/web_split

Viewer • Updated Oct 15, 2021 • 1.42M • 21 • 1