GBaker commited on
Commit
d2e457d
1 Parent(s): d475c31

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -11
README.md CHANGED
@@ -13,10 +13,10 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # nystromformer-4096-medqa-usmle-MiniLM-IR-cs
15
 
16
- This model is a fine-tuned version of [uw-madison/nystromformer-4096](https://huggingface.co/uw-madison/nystromformer-4096) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.3876
19
- - Accuracy: 0.2820
20
 
21
  ## Model description
22
 
@@ -43,17 +43,22 @@ The following hyperparameters were used during training:
43
  - total_train_batch_size: 128
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 5
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
- |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
- | No log | 0.99 | 79 | 1.3863 | 0.2372 |
53
- | No log | 1.99 | 158 | 1.3861 | 0.2655 |
54
- | No log | 2.99 | 237 | 1.3859 | 0.2545 |
55
- | No log | 3.99 | 316 | 1.3837 | 0.2765 |
56
- | No log | 4.99 | 395 | 1.3876 | 0.2820 |
 
 
 
 
 
57
 
58
 
59
  ### Framework versions
 
13
 
14
  # nystromformer-4096-medqa-usmle-MiniLM-IR-cs
15
 
16
+ This model is a fine-tuned version of [GBaker/nystromformer-4096-medqa-usmle-MiniLM-IR-cs](https://huggingface.co/GBaker/nystromformer-4096-medqa-usmle-MiniLM-IR-cs) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.8436
19
+ - Accuracy: 0.2812
20
 
21
  ## Model description
22
 
 
43
  - total_train_batch_size: 128
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 10
47
 
48
  ### Training results
49
 
50
+ | Training Loss | Epoch | Step | Accuracy | Validation Loss |
51
+ |:-------------:|:-----:|:----:|:--------:|:---------------:|
52
+ | No log | 0.99 | 79 | 0.2372 | 1.3863 |
53
+ | No log | 1.99 | 158 | 0.2655 | 1.3861 |
54
+ | No log | 2.99 | 237 | 0.2545 | 1.3859 |
55
+ | No log | 3.99 | 316 | 0.2765 | 1.3837 |
56
+ | No log | 4.99 | 395 | 0.2820 | 1.3876 |
57
+ | No log | 5.99 | 474 | 1.3819 | 0.2639 |
58
+ | 1.3342 | 6.99 | 553 | 1.4875 | 0.2694 |
59
+ | 1.3342 | 7.99 | 632 | 1.6126 | 0.2718 |
60
+ | 1.3342 | 8.99 | 711 | 1.7637 | 0.2804 |
61
+ | 1.3342 | 9.99 | 790 | 1.8436 | 0.2812 |
62
 
63
 
64
  ### Framework versions