25 1

Fahadh

fahadh4ilyas

fahadh4ilyas

AI & ML interests

None yet

Recent Activity

New activity about 2 months ago

google/datagemma-rig-27b-it:Why the example prompt doesn't include prompt format?

View all activity

Organizations

None yet

fahadh4ilyas's activity

New activity in google/datagemma-rig-27b-it about 2 months ago

Why the example prompt doesn't include prompt format?

#8 opened about 2 months ago by

fahadh4ilyas

New activity in defog/llama-3-sqlcoder-8b 5 months ago

Example prompt?

#17 opened 5 months ago by

fahadh4ilyas

New activity in LargeWorldModel/ultrachat_qa_mix_128K 6 months ago

What does it mean to pre-pack UltraChat data?

#1 opened 6 months ago by

fahadh4ilyas

New activity in gradientai/Llama-3-8B-Instruct-Gradient-1048k 6 months ago

Rope Theta Value Difference?

#24 opened 6 months ago by

fahadh4ilyas

New activity in CohereForAI/aya-23-8B 6 months ago

What Does `elif false == true` means in chat template?

#4 opened 6 months ago by

fahadh4ilyas

upvoted a collection 8 months ago

Hermes 2

Collection

Nous' Flagship LLM Series • 23 items • Updated Aug 15 • 101

New activity in mistralai/Mistral-7B-Instruct-v0.2 8 months ago

What is the max. content length of Mistral-7B-Instruct-v0.2?

#43 opened 10 months ago by

hanshupe

New activity in databricks/dbrx-instruct 8 months ago

The fused expert parameters means load_in_4bit doesn't work properly, nor does LoRA

#10 opened 8 months ago by

tdrussell

New activity in LnL-AI/dbrx-base-converted-v2 8 months ago

Ready for Testing...

#1 opened 8 months ago by

Qubitium

Fix import typo

#2 opened 8 months ago by

fahadh4ilyas

New activity in databricks/dbrx-instruct 8 months ago

Failing to 4-bit quantize with BitsAndBytes

#16 opened 8 months ago by

simsim314

New activity in microsoft/phi-2 9 months ago

Target modules {'out_proj', 'Wqkv'} is not found in the phi-2 model how can I fix this error?

#115 opened 9 months ago by

roy1109

New activity in liuhaotian/llava-v1.6-mistral-7b 9 months ago

Some value in config is not used?

#7 opened 9 months ago by

fahadh4ilyas

New activity in sshh12/Mistral-7B-LoRA-AudioWhisper 9 months ago

Where is the adapter_model.bin?

#1 opened 9 months ago by

fahadh4ilyas

New activity in microsoft/phi-2 10 months ago

Model token size is bigger than tokenizer size?

#97 opened 10 months ago by

fahadh4ilyas

Why inside `modeling_phi.py`, the output from Self Attention is not becoming the input of MLP?

#94 opened 10 months ago by

fahadh4ilyas

New activity in openchat/openchat_sharegpt_v3 about 1 year ago

-100 vs 0 in label?

#2 opened about 1 year ago by

fahadh4ilyas

New activity in Yukang/Llama-2-13b-chat-longlora-32k-sft about 1 year ago

Why this model kept generating \n when loaded with text generation web ui?

#2 opened about 1 year ago by

fahadh4ilyas

New activity in TheBloke/falcon-40b-instruct-GPTQ over 1 year ago

Offloading to cpu not working?

#21 opened over 1 year ago by

fahadh4ilyas

New activity in TheBloke/falcon-40b-sft-mix-1226-GGML over 1 year ago

Can it be loaded using text generation web ui?

#2 opened over 1 year ago by

fahadh4ilyas