Muhammad Osama
mosama
AI & ML interests
None yet
Recent Activity
updated
a model
8 minutes ago
mosama/Qwen2.5-0.5B-Pretrained-ar-end-urd-500
updated
a model
12 minutes ago
mosama/Qwen2.5-0.5B-Pretraining-ar-eng-urd-LoRA-Adapters
updated
a model
about 19 hours ago
mosama/Llama3.2-1B-Pretrained-ar-end-urd-2450
Organizations
None yet
mosama's activity
tensor size mismatch
2
#9 opened 4 months ago
by
Daemontatox
Train Mistral 7B 0.2
9
#2 opened 12 months ago
by
mosama
Error: `rope_scaling`must be a dictionary with two fields
6
#1 opened about 1 year ago
by
LeMoussel
Model loading datatype bfloat16 or simple float16?
#2 opened about 1 year ago
by
mosama
With use_cache=False, the reponse is taking very long
#41 opened about 1 year ago
by
mosama
No chat template in tokenizer
2
#2 opened about 1 year ago
by
mosama
Output Score
4
#7 opened about 1 year ago
by
mosama