VRAM-estimator / README.md
tvosch's picture
quick qlora support
a905447

A newer version of the Gradio SDK is available: 5.6.0

Upgrade
metadata
title: VRAM Estimator
emoji: 🧮
colorFrom: yellow
colorTo: blue
sdk: gradio
sdk_version: 4.39.0
app_file: app.py
pinned: false

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

TODO:

  • include vocab size as manual parameter
  • include finetuning techniques (LoRA/QLoRA/LoftQ)
  • include inference (KV cache memory)
  • include number of experts for Mixture of Experts (MoE) models
  • include DeepSpeed communication memory overhead (allgather_bucket_size)