NeMo
English
nvidia
steerlm
reward model
zhilinw commited on
Commit
9acabd8
1 Parent(s): fc6281e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -2
README.md CHANGED
@@ -2,8 +2,20 @@
2
  license: other
3
  license_name: nvidia-open-model-license
4
  license_link: LICENSE
 
 
 
 
 
 
 
 
 
 
 
5
  ---
6
 
 
7
  ## Nemotron-4-340B-Reward
8
 
9
  [![Model architecture](https://img.shields.io/badge/Model%20Arch-Transformer%20Decoder-green)](#model-architecture)[![Model size](https://img.shields.io/badge/Params-340B-green)](#model-architecture)[![Language](https://img.shields.io/badge/Language-Multilingual-green)](#datasets)
@@ -55,8 +67,8 @@ Nemotron-4 340B-Reward can be used in the alignment stage to align pretrained mo
55
  ### Required Hardware
56
 
57
  BF16 Inference:
58
- - 32x H100 (4x H100 Nodes)
59
- - 32x A100 (4x A100 80GB Nodes)
60
 
61
  ### Usage:
62
 
 
2
  license: other
3
  license_name: nvidia-open-model-license
4
  license_link: LICENSE
5
+ library_name: nemo
6
+ language:
7
+ - en
8
+ inference: false
9
+ fine-tuning: false
10
+ tags:
11
+ - nvidia
12
+ - steerlm
13
+ - reward model
14
+ datasets:
15
+ - nvidia/HelpSteer2
16
  ---
17
 
18
+
19
  ## Nemotron-4-340B-Reward
20
 
21
  [![Model architecture](https://img.shields.io/badge/Model%20Arch-Transformer%20Decoder-green)](#model-architecture)[![Model size](https://img.shields.io/badge/Params-340B-green)](#model-architecture)[![Language](https://img.shields.io/badge/Language-Multilingual-green)](#datasets)
 
67
  ### Required Hardware
68
 
69
  BF16 Inference:
70
+ - 16x H100 (2x H100 Nodes)
71
+ - 16x A100 (2x A100 80GB Nodes)
72
 
73
  ### Usage:
74