stabilityai
/

japanese-stablelm-base-gamma-7b

Text Generation

japanese-stablelm

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

fujiki commited on Oct 23, 2023

Commit

45086ce

·

1 Parent(s): 5778d0d

Update README.md

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -15,23 +15,23 @@ language:
 - ja
 ---
-# Japanese StableLM Base Gamma
 ## Model Description
 This is a 7B-parameter decoder-only language model with a focus on maximizing Japanese language modeling performance and Japanese downstream task performance.
 We conducted continued pretraining using Japanese data on the English language model, [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1), to transfer the model's knowledge and capabilities to Japanese.
-*If you are looking for an instruction-following model, check [Japanese StableLM Instruct Gamma](https://huggingface.co/stabilityai/japanese-stablelm-instruct-gamma)*.
 ## Usage
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-tokenizer = AutoTokenizer.from_pretrained("stabilityai/japanese-stablelm-base-gamma")
 model = AutoModelForCausalLM.from_pretrained(
-  "stabilityai/japanese-stablelm-base-gamma",
   trust_remote_code=True,
   torch_dtype="auto",
 )
@@ -50,7 +50,7 @@ print(tokenizer.decode(tokens[0], skip_special_tokens=True))
 ## Model Details
 * **Developed by**: [Stability AI](https://stability.ai/)
-* **Model type**: `Japanese StableLM Base Gamma` model is an auto-regressive language model based on the transformer decoder architecture.
 * **Language(s)**: Japanese
 * **License**: This model is licensed under [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0).
 * **Contact**: For questions and comments about the model, please email `lm@stability.ai`

 - ja
 ---
+# Japanese StableLM Base Gamma 7B
 ## Model Description
 This is a 7B-parameter decoder-only language model with a focus on maximizing Japanese language modeling performance and Japanese downstream task performance.
 We conducted continued pretraining using Japanese data on the English language model, [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1), to transfer the model's knowledge and capabilities to Japanese.
+*If you are looking for an instruction-following model, check [Japanese StableLM Instruct Gamma 7B](https://huggingface.co/stabilityai/japanese-stablelm-instruct-gamma7b)*.
 ## Usage
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+tokenizer = AutoTokenizer.from_pretrained("stabilityai/japanese-stablelm-base-gamma-7b")
 model = AutoModelForCausalLM.from_pretrained(
+  "stabilityai/japanese-stablelm-base-gamma-7b",
   trust_remote_code=True,
   torch_dtype="auto",
 )
 ## Model Details
 * **Developed by**: [Stability AI](https://stability.ai/)
+* **Model type**: `Japanese StableLM Base Gamma 7B` model is an auto-regressive language model based on the transformer decoder architecture.
 * **Language(s)**: Japanese
 * **License**: This model is licensed under [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0).
 * **Contact**: For questions and comments about the model, please email `lm@stability.ai`