goliath-120b / README.md
alpindale's picture
Update README.md
dedef8b
|
raw
history blame
866 Bytes
metadata
license: llama2
language:
  - en
pipeline_tag: conversational

Goliath 120B

An auto-regressive causal LM created by combining 2x finetuned Llama-2 70B into one.

Prompting Format

Both Vicuna and Alpaca will work, but due the initial and final layers belonging primarily to Xwin, I expect Vicuna to work the best.

Merge process

The models used in the merge are Xwin and Euryale.

The layer ranges used are as follows:

- range 0, 16
  Xwin
- range 8, 24
  Euryale
- range 17, 32
  Xwin
- range 25, 40
  Euryale
- range 33, 48
  Xwin
- range 41, 56
  Euryale
- range 49, 64
  Xwin
- range 57, 72
  Euryale
- range 65, 80
  Xwin

Further details

Coming soon.

Benchmarks

Coming soon.