metadata
license: mit
This model is my starting point zero for trying to finetune model based on bitnet architecture. I just added new layers with random weights to the finished model.
Maybe it can be broken.
It is not recommended for use: the results show an improvement in test results at the margin of error.