Surya Bhupatiraju's picture

71 2 4

Surya Bhupatiraju

suryabhupa

·

AI & ML interests

part of the Gemma Team -- Language models, Reinforcement Learning

Organizations

suryabhupa's activity

New activity in google/gemma-7b 5 months ago

inquiry for gemma-7b : d_model

#61 opened 10 months ago by

New activity in google/gemma-2-27b-it 6 months ago

Hallucinations, misspellings etc. Something seems broken?

#10 opened 6 months ago by

tokenizer chat_template has no role system

#9 opened 6 months ago by

Citation URL redirects to Gemma-1

#8 opened 6 months ago by

Asking same thing twice or thrice in hugging face chat breaks it , same thing on ollama

#7 opened 6 months ago by

transformers load fails?

#6 opened 6 months ago by

New activity in google/gemma-2-9b-it 6 months ago

Flash attention 2 is not working

#9 opened 6 months ago by

New activity in google/gemma-2b 7 months ago

Unable to reproduce the score of gemma_2b at pass@1 in humaneval.

#53 opened 8 months ago by

New activity in google/gemma-2b-it 7 months ago

What do they mean by maj@1 ?

#44 opened 7 months ago by

New activity in google/gemma-7b 7 months ago

Fine-Tune a gemma model for question answering

#62 opened 10 months ago by

Iamexperimenting

New activity in google/gemma-7b 8 months ago

save, loading and inferencing the Gemma model

#64 opened 10 months ago by

Iamexperimenting

New activity in google/gemma-7b-it 8 months ago

Need info on pre-training and instruction-tuning data

#64 opened 10 months ago by

Inference with RTX 3090 got OOM

#89 opened 8 months ago by

New activity in google/gemma-7b 8 months ago

Weird Performance Issue with Gemma-7b compared to Gemma-2b with Qlora

#91 opened 8 months ago by

New activity in google/gemma-7b-it 8 months ago

What's the context window for this model?

#73 opened 10 months ago by

New activity in google/gemma-2b 9 months ago

pretraining Gemma for domain dataset

#41 opened 9 months ago by

Iamexperimenting

gemma -2b with multi-gpu

#44 opened 9 months ago by

Iamexperimenting

New activity in google/gemma-7b-it 9 months ago

<pad> spam issue

#40 opened 10 months ago by

New activity in google/gemma-2b 9 months ago

evaluation loss not calculated during during?

#43 opened 9 months ago by

Iamexperimenting

New activity in google/gemma-7b 9 months ago

Dont download, google scuttled this model

#77 opened 9 months ago by

Tom-Neverwinter