millionwords's picture

3 2 7

millionwords PRO

millionwords

·

AI & ML interests

None yet

Recent Activity

new activity 2 days ago

sesame/csm-1b:Disappointed with the results - Gibberish, Pauses, Inconsistent Voice, and Pitch is unstable

upvoted a paper 2 days ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

reacted to nicolay-r's post with 🔥 3 days ago

📢 With the recent release of Gemma-3, If you interested to play with textual chain-of-though, the notebook below is a wrapper over the the model (native transformers inference API) for passing the predefined schema of promps in batching mode. https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynb Limitation: schema supports texts only (for now), while gemma-3 is a text+image to text. Model: https://huggingface.co/google/gemma-3-1b-it Provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_gemma3.py

View all activity

Organizations

None yet

millionwords's activity

New activity in sesame/csm-1b 2 days ago

Disappointed with the results - Gibberish, Pauses, Inconsistent Voice, and Pitch is unstable

#14 opened 2 days ago by

upvoted a paper 2 days ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published 5 days ago • 23

reacted to nicolay-r's post with 🔥 3 days ago

Post

1495

📢 With the recent release of Gemma-3, If you interested to play with textual chain-of-though, the notebook below is a wrapper over the the model (native transformers inference API) for passing the predefined schema of promps in batching mode.
https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_gemma_3.ipynb

Limitation: schema supports texts only (for now), while gemma-3 is a text+image to text.

Model: google/gemma-3-1b-it
Provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/transformers_gemma3.py

1 reply

·

New activity in CohereForAI/c4ai-command-a-03-2025 4 days ago

Finetuning instruction & GPU advise

#2 opened 4 days ago by

liked 2 models 7 months ago

mlc-ai/Hermes-2-Pro-Mistral-7B-q4f16_1-MLC

Updated Aug 5, 2024 • 827 • 1

openbmb/MiniCPM-V-2_6-int4

Image-Text-to-Text • Updated 18 days ago • 129k • 75

New activity in bartowski/gemma-2-27b-it-GGUF 9 months ago

'LlamaCppModel' object has no attribute 'model'

#2 opened 9 months ago by

liked 2 models 9 months ago

ZeusLabs/L3-Aethora-15B-V2

Text Generation • Updated Jul 24, 2024 • 149 • 41

bartowski/gemma-2-27b-it-GGUF

Text Generation • Updated Aug 3, 2024 • 18.6k • 167

liked 2 Spaces 9 months ago

LLM Training Cost Calculator

Open Medical-LLM Leaderboard

Browse and submit LLM evaluations

liked a model 9 months ago

Snowflake/snowflake-arctic-instruct

Text Generation • Updated May 21, 2024 • 13k • 352

upvoted a collection 9 months ago

GLM-4

GLM-4 Open Models • 14 items • Updated 23 days ago • 117