tenyx
/

TenyxChat-7B-v1

Text Generation

tenyx-fine-tuning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sarath-shekkizhar commited on Jan 6

Commit

75d20ad

•

1 Parent(s): 602f82f

adding model card

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -1,8 +1,6 @@
 ---
-model_type: Fine-tuned 7B model for chat.
 license: {apache-2.0}
 base_model: {openchat/openchat_3.5}
-demo: [Hugging Face Spaces](https://huggingface.co/spaces/tenyx/TenyxChat-7B-v1)
 ---
 # TenyxChat: Language Model Alignment using Tenyx Fine-tuning
@@ -11,6 +9,12 @@ Introducing TenyxChat, a series of ChatGPT-like models trained to function as us
 We fine-tune [Openchat-3.5](https://arxiv.org/pdf/2309.11235.pdf) with our proprietary approach ([blog](https://www.tenyx.com/post/forgetting-and-toxicity-in-llms-a-deep-dive-on-fine-tuning-methods), [service](https://www.tenyx.com/fine-tuning)), which shows an increase in [MT-Bench](https://arxiv.org/abs/2306.05685), without a drop in performance of the model on other benchmarks. Our approach aims to mitigate forgetting in LLMs in a computationally efficient manner, thereby enabling continual fine-tuning capabilities without altering the pre-trained output distribution. TenyxChat-7B-v1 was trained using eight A100s (80GB) for two hours, with a training setup obtained from HuggingFaceH4 ([GitHub](https://github.com/huggingface/alignment-handbook)).
 ## Usage

 ---
 license: {apache-2.0}
 base_model: {openchat/openchat_3.5}
 ---
 # TenyxChat: Language Model Alignment using Tenyx Fine-tuning
 We fine-tune [Openchat-3.5](https://arxiv.org/pdf/2309.11235.pdf) with our proprietary approach ([blog](https://www.tenyx.com/post/forgetting-and-toxicity-in-llms-a-deep-dive-on-fine-tuning-methods), [service](https://www.tenyx.com/fine-tuning)), which shows an increase in [MT-Bench](https://arxiv.org/abs/2306.05685), without a drop in performance of the model on other benchmarks. Our approach aims to mitigate forgetting in LLMs in a computationally efficient manner, thereby enabling continual fine-tuning capabilities without altering the pre-trained output distribution. TenyxChat-7B-v1 was trained using eight A100s (80GB) for two hours, with a training setup obtained from HuggingFaceH4 ([GitHub](https://github.com/huggingface/alignment-handbook)).
+ # Model details
+- Model type: Fine-tuned 7B model for chat.
+- License: Apache 2.0
+- Base model: OpenChat 3.5 ([https://huggingface.co/openchat/openchat_3.5](https://huggingface.co/openchat/openchat_3.5))
+- Demo: Hugging face space
 ## Usage