Upload fp32.sft
Browse filesReceived additional training on my 7900XTX (AMP bfloat16, SDPA):
* a number of unshuffled samples from 12 seconds to 24 seconds, to verify that my ROCm setup works.
* another number of shuffled samples from 3 seconds to 32 seconds, to re-"teach" the model to work for any duration rather than the last duration it was trained against.
* another number of shuffled samples from 3 seconds to 60 seconds but with a RVQ distribution favoring the higher levels, to see if it lobotomizes the AR and clean up the NAR's audio (it seems fine).
models/ckpt/ar+nar-llama-8/fp32.sft
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2bbafd8afb5403c206c28f51ea3e872769dab8de99b5f441825ff31c893b0911
|
3 |
+
size 455745602
|