7 contributors

History: 24 commits

Alex Birch

apply device-transfer patch from https://github.com/mosaicml/llm-foundry/pull/225/files

ec8bed8 unverified almost 2 years ago

.gitattributes

1.48 kB

initial commit almost 2 years ago
README.md

7.22 kB

Update README.md almost 2 years ago
adapt_tokenizer.py

1.75 kB

Upload folder using huggingface_hub almost 2 years ago
attention.py

23.7 kB

prefer NamedTuple almost 2 years ago
blocks.py

2.65 kB

add support for AutoModelForCausalLM#from_pretrained()'s device_map='auto'. support gradient checkpointing, probably. add lots of type hints so I could understand what's going on. multiline long method signatures/calls (for easier comparison between checkpointed/non-checkpointed variants, and because these lines got even longer when I added type hints). make MPTForCausalLM#forward accept additional kwargs, since PeftModelForCausalLM#forward tries to send it an argument inputs_embeds=None, which it didn't like too much. almost 2 years ago
config.json

1.23 kB

Upload folder using huggingface_hub almost 2 years ago
configuration_mpt.py

9.08 kB

Upload folder using huggingface_hub almost 2 years ago
flash_attn_triton.py

28.2 kB

add flash_attn_triton.py (#9) almost 2 years ago
generation_config.json

91 Bytes

Upload folder using huggingface_hub almost 2 years ago
hf_prefixlm_converter.py

27.2 kB

Upload folder using huggingface_hub almost 2 years ago
is_torch_version.py

2.39 kB

add support for AutoModelForCausalLM#from_pretrained()'s device_map='auto'. support gradient checkpointing, probably. add lots of type hints so I could understand what's going on. multiline long method signatures/calls (for easier comparison between checkpointed/non-checkpointed variants, and because these lines got even longer when I added type hints). make MPTForCausalLM#forward accept additional kwargs, since PeftModelForCausalLM#forward tries to send it an argument inputs_embeds=None, which it didn't like too much. almost 2 years ago
meta_init_context.py

3.64 kB

Upload folder using huggingface_hub almost 2 years ago
modeling_mpt.py

20.3 kB

apply device-transfer patch from https://github.com/mosaicml/llm-foundry/pull/225/files almost 2 years ago
norm.py

2.56 kB

Upload folder using huggingface_hub almost 2 years ago
param_init_fns.py

12.6 kB

Upload folder using huggingface_hub almost 2 years ago
pytorch_model-00001-of-00002.bin
Detected Pickle imports (3)
- "torch.BFloat16Storage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
9.94 GB
LFS

Upload folder using huggingface_hub almost 2 years ago
pytorch_model-00002-of-00002.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.BFloat16Storage",
- "collections.OrderedDict"
What is a pickle import?
3.36 GB
LFS

Upload folder using huggingface_hub almost 2 years ago
pytorch_model.bin.index.json

16 kB

Upload folder using huggingface_hub almost 2 years ago
special_tokens_map.json

174 Bytes

Upload folder using huggingface_hub almost 2 years ago
tokenizer.json

2.11 MB

Upload folder using huggingface_hub almost 2 years ago
tokenizer_config.json

237 Bytes

Upload folder using huggingface_hub almost 2 years ago

Detected Pickle imports (3)

Detected Pickle imports (3)