16 2 82

Samuel Azran

SamuelAzran

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

homebrewltd/Ichigo-llama3.1-s-instruct-v0.4

liked a model 5 days ago

homebrewltd/llama3-s-instruct-v0.2

liked a model 7 days ago

Qwen/QVQ-72B-Preview

View all activity

Organizations

None yet

SamuelAzran's activity

New activity in hebrew-llm-leaderboard/leaderboard 5 months ago

New Gemma 2 27B?

#3 opened 6 months ago by

SamuelAzran

New activity in yam-peleg/Hebrew-Gemma-11B-Instruct 10 months ago

Was it train after the latest Huggingface Transformers Gemma fix? if not any update plans?

#4 opened 10 months ago by

SamuelAzran

New activity in cloudyu/Mixtral_34Bx2_MoE_60B 12 months ago

Should not be called mixtral, the models made into the moe are yi based

#2 opened 12 months ago by

teknium

How does the MoE work?

#5 opened 12 months ago by

PacmanIncarnate

New activity in cloudyu/Mixtral_7Bx2_MoE 12 months ago

One or two models during inference?

#3 opened 12 months ago by

Venkman42

New activity in upstage/SOLAR-10.7B-Instruct-v1.0 about 1 year ago

You know Mixtral, Llama 2 70b, GPT3.5... Are All Much Better

#13 opened about 1 year ago by deleted

New activity in VAGOsolutions/SauerkrautLM-SOLAR-Instruct about 1 year ago

Awesome- Could you help with pointers on doing same for Other languages(Swedish)?

#2 opened about 1 year ago by

Olofp

QLora or full fine-tuning?

#1 opened about 1 year ago by

SamuelAzran

New activity in NousResearch/Nous-Capybara-34B about 1 year ago

Was system message used during training?

#8 opened about 1 year ago by

SamuelAzran

New activity in open-llm-leaderboard/open_llm_leaderboard about 1 year ago

NEW! OpenLLMLeaderboard 2023 fall update

#356 opened about 1 year ago by

clefourrier

New activity in NousResearch/Nous-Capybara-34B about 1 year ago

Did you do full model fine tuning (all layers) or only adapters?

#2 opened about 1 year ago by

SamuelAzran

New activity in 01-ai/Yi-34B about 1 year ago

Can you release a chat version soon ?

#8 opened about 1 year ago by

dong0213

New activity in openchat/openchat_v2_w over 1 year ago

Great work, but why only 2048 context length?

#4 opened over 1 year ago by

SamuelAzran

New activity in nomic-ai/gpt4all-mpt over 1 year ago

Would it work well with sequence length > 2048?

#1 opened over 1 year ago by

SamuelAzran

New activity in TheBloke/alpaca-lora-65B-GGML over 1 year ago

Thank you very much!

#2 opened over 1 year ago by

AiCreatornator

New activity in google/flan-ul2 almost 2 years ago

Error running the example code

#6 opened almost 2 years ago by

will33am