9 33 24

Rajdeep Borgohain

rbgo

RajdeepBorgohain

AI & ML interests

Solving language barriers.

Recent Activity

upvoted a paper 3 days ago

Qwen2.5 Technical Report

liked a dataset 5 days ago

NTU-NLP-sg/xCodeEval

upvoted a collection 7 days ago

Qwen2.5

View all activity

Organizations

rbgo's activity

upvoted a paper 3 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 4 days ago • 304

liked a dataset 5 days ago

NTU-NLP-sg/xCodeEval

Updated Jun 6 • 120k • 38

upvoted a collection 7 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 25 days ago • 438

liked a Space 9 days ago

Running

300

💻

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 25 days ago • 351

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 17 days ago • 547

upvoted a collection 12 days ago

Qwen

Collection

Qwen • 16 items • Updated 25 days ago • 14

updated a collection 17 days ago

All About LLMs

Collection

2 items • Updated 17 days ago

liked a Space 17 days ago

Running

📈

Qwen/QwQ-32B-Preview

Text Generation • Updated 24 days ago • 119k • • 1.39k

reacted to m-ric's post with 👍 about 1 month ago

Post

786

🔍 Meta teams use a fine-tuned Llama model to fix production issues in seconds

One of Meta's engineering teams shared how they use a fine-tuned small Llama (Llama-2-7B, so not even a very recent model) to identify the root cause of production issues with 42% accuracy.

🤔 42%, is that not too low?
➡️ Usually, whenever there's an issue in production, engineers dive into recent code changes to find the offending commit. At Meta's scale (thousands of daily changes), this is like finding a needle in a haystack.
💡 So when the LLM-based suggestion is right, it cuts incident resolution time from hours to seconds!

How did they do it?

🔄 Two-step approach:
‣ Heuristics (code ownership, directory structure, runtime graphs) reduce thousands of potential changes to a manageable set
‣ Fine-tuned Llama 2 7B ranks the most likely culprits

🎓 Training pipeline:
‣ Continued pre-training on Meta's internal docs and wikis
‣ Supervised fine-tuning on past incident investigations
‣ Training data mimicked real-world constraints (2-20 potential changes per incident)

🔮 Now future developments await:
‣ Language models could handle more of the incident response workflow (runbooks, mitigation, post-mortems)
‣ Improvements in model reasoning should boost accuracy further

Read it in full 👉 https://www.tryparity.com/blog/how-meta-uses-llms-to-improve-incident-response