JetBrains
/

CodeLlama-7B-Kexer

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Titovs commited on May 22

Commit

78c8aad

•

1 Parent(s): 006a464

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -69,11 +69,11 @@ The model was trained on one A100 GPU with the following hyperparameters:
 |        `total_batch_size`        |          256 (~130K tokens per step)          |
 |        `num_epochs`        |          4          |
-More details about fine-tuning can be found in the technical report.
 # Fine-tuning data
-For tuning this model, we used 15K exmaples from the synthetically generated [Kotlin Exercices dataset](https://huggingface.co/datasets/JetBrains/KExercises). Every example follows the HumanEval format. In total, the dataset contains about 3.5M tokens.
 # Evaluation

 |        `total_batch_size`        |          256 (~130K tokens per step)          |
 |        `num_epochs`        |          4          |
+More details about fine-tuning can be found in the technical report (coming soon).
 # Fine-tuning data
+For tuning this model, we used 15K exmaples from the synthetically generated [Kotlin Exercices](https://huggingface.co/datasets/JetBrains/KExercises) dataset. Every example follows the HumanEval format. In total, the dataset contains about 3.5M tokens.
 # Evaluation