yard1 commited on
Commit
0d1ff44
1 Parent(s): d1ff2ed

Update `max_position_embeddings` to 4096

Browse files

The model has a 4096 tokens context length, which should be correctly reflected in the `config.json`.

See https://github.com/facebookresearch/codellama/blob/1af62e1f43db1fa5140fa43cb828465a603a48f3/llama/model.py#L277 in reference implementation (`self.params.max_seq_len * 2` where `self.params.max_seq_len == 2048`). Also confirmed offline with a Meta engineer.

Files changed (1) hide show
  1. config.json +1 -1
config.json CHANGED
@@ -10,7 +10,7 @@
10
  "hidden_size": 8192,
11
  "initializer_range": 0.02,
12
  "intermediate_size": 28672,
13
- "max_position_embeddings": 2048,
14
  "model_type": "llama",
15
  "num_attention_heads": 64,
16
  "num_hidden_layers": 80,
 
10
  "hidden_size": 8192,
11
  "initializer_range": 0.02,
12
  "intermediate_size": 28672,
13
+ "max_position_embeddings": 4096,
14
  "model_type": "llama",
15
  "num_attention_heads": 64,
16
  "num_hidden_layers": 80,