BEE-spoke-data
/

Qwen2-1.5B-stepbasin-books

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Jul 15, 2024

Commit

900c716

·

verified ·

1 Parent(s): bb2a61f

Update README.md

Files changed (1) hide show

README.md +7 -8

README.md CHANGED Viewed

@@ -5,16 +5,15 @@ tags:
 - generated_from_trainer
 metrics:
 - accuracy
-model-index:
-- name: Qwen2-1.5B-stepbasin-books-vN
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pszemraj/long-generation-tests/runs/ethp25f9)
-# Qwen2-1.5B-stepbasin-books-vN
 This model is a fine-tuned version of [Qwen/Qwen2-1.5B](https://huggingface.co/Qwen/Qwen2-1.5B) on the BEE-spoke-data/stepbasin-books dataset.
 It achieves the following results on the evaluation set:
@@ -64,4 +63,4 @@ The following hyperparameters were used during training:
 - Transformers 4.42.4
 - Pytorch 2.3.1+cu121
 - Datasets 2.20.0
-- Tokenizers 0.19.1

 - generated_from_trainer
 metrics:
 - accuracy
+datasets:
+- BEE-spoke-data/stepbasin-books
 ---
 [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/pszemraj/long-generation-tests/runs/ethp25f9)
+# Qwen2-1.5B-stepbasin-books
+> [!IMPORTANT]
+> this was finetuned at 16384 context length
 This model is a fine-tuned version of [Qwen/Qwen2-1.5B](https://huggingface.co/Qwen/Qwen2-1.5B) on the BEE-spoke-data/stepbasin-books dataset.
 It achieves the following results on the evaluation set:
 - Transformers 4.42.4
 - Pytorch 2.3.1+cu121
 - Datasets 2.20.0
+- Tokenizers 0.19.1