16 15 1

Ajith V Prabhakar

ajithprabhakar

https://www.ajithp.com

ajithprabhakar

AI & ML interests

NLP, Responsible AI, Generative AI

Recent Activity

commented on a paper 2 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

commented on a paper 8 days ago

Qwen2.5-1M Technical Report

commented on a paper 8 days ago

Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

View all activity

Organizations

Posts 2

Post

532

Hi All,
In my latest blog post, I created a comprehensive guide on LLM Benchmarking.
➟ 20+ key benchmarks, from MMLU to TruthfulQA
➟ How each benchmark assesses different LLM capabilities
➟ Why benchmarking matters for real-world AI applications
➟ Future trends in AI evaluation
Read the blog here: https://wp.me/p7Qix-wO

Please let me know your thoughts, suggestions, and comments.

Post

1372

Can AI cheat or lie?

In this blog, we will explore the research conducted by experts from MIT, Australian Catholic University, and the Center for AI Safety to better understand the nature of AI deception, its various forms, and the potential risks it poses. We will examine real-world examples and the underlying mechanisms that enable AI systems to deceive.

Learn more at: https://ajithp.com/2024/05/12/ai-deception-risks-real-world-examples-and-proactive-solutions/

View all Posts

Collections 1

models

None public yet

datasets

None public yet

Ajith V Prabhakar

AI & ML interests

Recent Activity

Organizations

Posts 2

Collections 1

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

OneLLM: One Framework to Align All Modalities with Language

Generative Multimodal Models are In-Context Learners

The LLM Surgeon

models

datasets