21 2 6

Sporkness

SporkySporkness

AI & ML interests

None yet

Recent Activity

New activity 4 days ago

bartowski/Behemoth-123B-v1-GGUF:Failing to load Q6K

updated a model 4 days ago

SporkySporkness/FLUX.1-Fill-dev-LoRA-extract

New activity 4 days ago

SporkySporkness/FLUX.1-Fill-dev-LoRA-extract:你测试过了吗

View all activity

Organizations

SporkySporkness's activity

New activity in bartowski/Behemoth-123B-v1-GGUF 4 days ago

Failing to load Q6K

#1 opened 4 days ago by

SporkySporkness

updated a model 4 days ago

SporkySporkness/FLUX.1-Fill-dev-LoRA-extract

Updated 4 days ago

New activity in SporkySporkness/FLUX.1-Fill-dev-LoRA-extract 4 days ago

你测试过了吗

#1 opened 4 days ago by

ss996

New activity in black-forest-labs/FLUX.1-Fill-dev 4 days ago

lora version?

#1 opened 4 days ago by

cchance27

updated 3 models 4 days ago

Longcal?

#1 opened 5 days ago by

SporkySporkness

New activity in MikeRoz/mistralai_Mistral-Large-Instruct-2411-2.25bpw-h6-exl2 5 days ago

Thanks

#1 opened 5 days ago by

SporkySporkness

New activity in MikeRoz/mistralai_Mistral-Large-Instruct-2411-2.5bpw-h6-exl2 7 days ago

Quant request

#1 opened 7 days ago by

SporkySporkness

New activity in bartowski/EVA-Qwen2.5-72B-v0.1-GGUF 18 days ago

Thanks

#1 opened 18 days ago by

SporkySporkness

New activity in CalamitousFelicitousness/EVA-Qwen2.5-72B-v0.1-exl2 18 days ago

Request for 3.75bpw

#1 opened 18 days ago by

SporkySporkness

New activity in EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0 21 days ago

Great

#2 opened 21 days ago by

SporkySporkness

New activity in EVA-UNIT-01/EVA-Qwen2.5-72B-v0.0 24 days ago

Props to how you handle the example dialogue.

#1 opened 25 days ago by

jackboot

upvoted a collection 24 days ago

Recent highlights

Collection

Some recent models worth checking out • 18 items • Updated 24 days ago • 41

New activity in bartowski/magnum-v4-72b-GGUF about 1 month ago

EXL2 quant request

#1 opened about 1 month ago by

SporkySporkness

Reacted to ImranzamanML's post with 👍 about 1 month ago

Post

1306

Here is how we can calculate the size of any LLM model:

Each parameter in LLM models is typically stored as a floating-point number. The size of each parameter in bytes depends on the precision.

32-bit precision: Each parameter takes 4 bytes.
16-bit precision: Each parameter takes 2 bytes

To calculate the total memory usage of the model:
Memory usage (in bytes) = No. of Parameters × Size of Each Parameter

For example:
32-bit Precision (FP32)
In 32-bit floating-point precision, each parameter takes 4 bytes.
Memory usage in bytes = 1 billion parameters × 4 bytes
1,000,000,000 × 4 = 4,000,000,000 bytes
In gigabytes: ≈ 3.73 GB

16-bit Precision (FP16)
In 16-bit floating-point precision, each parameter takes 2 bytes.
Memory usage in bytes = 1 billion parameters × 2 bytes
1,000,000,000 × 2 = 2,000,000,000 bytes
In gigabytes: ≈ 1.86 GB

It depends on whether you use 32-bit or 16-bit precision, a model with 1 billion parameters would use approximately 3.73 GB or 1.86 GB of memory, respectively.

Reacted to bartowski's post with ❤️ about 1 month ago

Post

17561

In regards to the latest mistral model and GGUFs for it:

Yes, they may be subpar and may require changes to llama.cpp to support the interleaved sliding window

Yes, I got excited when a conversion worked and released them ASAP

That said, generation seems to work right now and seems to mimic the output from spaces that are running the original model

I have appended -TEST to the model names in an attempt to indicate that they are not final or perfect, but if people still feel mislead and that it's not the right thing to do, please post (civilly) below your thoughts, I will highly consider pulling the conversions if that's what people think is best. After all, that's what I'm here for, in service to you all !

6 replies

New activity in bigstorm/Llama-3.1-Nemotron-70B-Instruct-HF-7.0bpw-8hb-exl2 about 1 month ago

3.5 - 4.0 bpw quant request

#1 opened about 1 month ago by

SporkySporkness

liked a model 2 months ago

THUDM/CogVideoX-5b-I2V

Image-to-Video • Updated 3 days ago • 102k • 225