Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published 5 days ago • 23
view post Post 1495 📢 With the recent release of Gemma-3, If you interested to play with textual chain-of-though, the notebook below is a wrapper over the the model (native transformers inference API) for passing the predefined schema of promps in batching mode.https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynbLimitation: schema supports texts only (for now), while gemma-3 is a text+image to text.Model: google/gemma-3-1b-itProvider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_gemma3.py See translation 1 reply · 🔥 5 5 + Reply