File size: 996 Bytes
3d4d40d 29c9647 cf49f13 3d4d40d 29c9647 964a6f1 2a9343e 964a6f1 2a9343e 3d4d40d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 |
# Pipeline Parallellism with Controllable Memory
Pipeline Parallelism with Controllable Memory creates a framework on designing pipeline schedules and uses the framework to find memory optimal efficient schedules.
From our findings, we need approximately 1/3 memory under ideal conditions (F, B and W have same runtime), and 1/2 memory to create zero bubble schedule in realistic scenarios (with the necessary condition being W + 2B ≥ 2F and W + 2F ≥ 2B ).
Check out our paper at [Arxiv](https://arxiv.org/abs/2405.15362).
| Comparison assuming T_F=T_B=T_W | 1F1B | V-Min | V-Half | V-ZB |
| ----------------------------------------------------- | ------- |------- | ---------- | ---- |
| Bubble Rate | (p-1)/m | ~ 2p/3m | ~ p/ 2m | 0 |
| Activation Memory <br> (Compared to 1F1B) | p | (p+4)/3 | (p+2)/2 | p |
Bubble Rate here is calculated as (1 - longest stage time/(F+B+W)/m). |