Feynman Innovations's picture

Feynman Innovations

ajibawa-2023

·

AjinkyaBawase

AI & ML interests

LLM, RL, DL, ML, AGI. Developing LLMs (preferably fully fine tuned ) for various use cases.

Recent Activity

replied to MoritzLaurer's post 4 days ago

"Open-source AI: year in review 2024": amazing Space with lots of data-driven insights into AI in 2024! Check it out 👇 https://huggingface.co/spaces/huggingface/open-source-ai-year-in-review-2024

reacted to MoritzLaurer's post with 🔥 4 days ago

"Open-source AI: year in review 2024": amazing Space with lots of data-driven insights into AI in 2024! Check it out 👇 https://huggingface.co/spaces/huggingface/open-source-ai-year-in-review-2024

replied to qq8933's post 4 days ago

News! ChemVLM Codes Opensource Now! https://github.com/AI4Chem/ChemVlm

View all activity

Organizations

ajibawa-2023's activity

upvoted an article about 2 months ago

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

By

•

Oct 21

• 18

upvoted 2 papers 3 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 47

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 104

upvoted 2 collections 3 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 25 days ago • 289

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 224

upvoted a paper 3 months ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11 • 50

upvoted 2 papers 4 months ago

τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Paper • 2406.12045 • Published Jun 17 • 6

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 48

upvoted a collection 4 months ago

GLiNER bi-encoders

Bi-encoder and poly-encoder architectures of GLiNER • 5 items • Updated Sep 10 • 12

upvoted 8 papers 4 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 95

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1 • 86

Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7 • 55

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27 • 86

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13 • 66

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

Paper • 2408.04682 • Published Aug 8 • 14

Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM

Paper • 2408.07246 • Published Aug 14 • 21

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

upvoted an article 5 months ago

Article

Introducing TextImage Augmentation for Document Images

Aug 6

• 32

upvoted a collection 7 months ago

Granite Code Models

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 4 days ago • 180

upvoted a paper 8 months ago

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15 • 20