kevin009
/

babyllama-v0.6

@@ -1,6 +1,8 @@
 ---
 language:
 - en
 ---
 # Model Card for BabyLlama v0.6
@@ -9,12 +11,15 @@ language:
 **Repository:** kevin009/babyllama-v0.6
 **Architecture:** LlamaForCausalLM, based on TinyLlama 1.1b
 **Model Type:** llama
-**Version:** 0.5
 ## Model Description
-BabyLlama v0.6 is an innovative conversational AI model, specialized in generating creative and humorous interactions. While it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on creativity and humor.
-With a Combining RLHF and DPO fine-tuning involved 5 different epochs, with 200 steps in each epoch, applied to over half a million conversations in low learrning rate.
 ## Technical Specifications
@@ -30,11 +35,6 @@ With a Combining RLHF and DPO fine-tuning involved 5 different epochs, with 200
 ## Use Cases
 This model excels in applications where engaging, entertaining, and uniquely human-distinguishable AI responses are valued. It is particularly suited for chatbots, entertainment platforms, interactive games, and social experiments where the focus is on creativity, humor, and the unexpected.
-BabyLlama v0.6 is particularly adept for applications within the realm of role-playing games (RPGs), interactive storytelling, and simulation-based training where dynamic, engaging, and character-driven dialogues are essential. It can serve as an AI companion or NPC (non-player character) that interacts with users in a variety of scenarios, ranging from fantasy and adventure to everyday social simulations. Its unique conversational style enhances the immersive experience by providing responses that are not only contextually relevant but also infused with creativity and humor, thereby elevating the narrative depth and engagement of the role-playing environment.
-# Example usage for playful interaction
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -51,41 +51,36 @@ scenario_prompt = "User: Can you teach me how to plant a magical garden? \n\n Bo
 #### For more factual use Assistance as the role, example:
-User: What U.S. state produces the most peaches?
-Assistant: California.
-User: What U.S. state produces the most peaches?
-AI: Oh yeah, North Carolina produces the most peaches in the U.S.
 #### For more playful interaction:
-User: What U.S. state produces the most peaches?
-Psychic: New York, which produces over 75% of the U.S. Peach crop.
-User: Are you an artificial intelligence?
-Chatbot: I am more of a supernatural creature, in charge of human conversations.
-User: Were unicorns easily caught in medieval times?
-Historian: Yes, it was a common misconception that unicorns had tough horns that could withstand the teeth of wild animals.
 ## Limitations and Considerations
-Due to its design for generating creative and humorous content, BabyLlama v0.5 might not strictly follow provided instructions, reflecting its unique training approach. Users should be mindful of its propensity for unexpected outputs and incorporate suitable moderation or guidance mechanisms as necessary.
-Limitations and Considerations
-BabyLlama v0.6's focus on playful and fictional dialogues means it may not be suitable for applications requiring factual accuracy or serious outcomes. Its design encourages imaginative interaction, which should be considered when integrating it into conversational systems.
-BabyLlama v0.6 might not strictly follow provided instructions, reflecting its unique training approach, Users should be mindful of its propensity for unexpected outputs and incorporate suitable moderation or guidance mechanisms as necessary.
 ## Acknowledgments
-The development of BabyLlama v0.5 reflects a significant effort to push the boundaries of conversational AI, drawing from the foundational TinyLlama 1.1b model and incorporating advanced fine-tuning techniques to achieve its distinctive capabilities.
 ## Version History
 - **v0.5:** Enhanced for creativity and humor in conversations, diverging from strict instruction adherence to offer a unique conversational experience.

 ---
 language:
 - en
+datasets:
+- Anthropic/hh-rlhf
 ---
 # Model Card for BabyLlama v0.6
 **Repository:** kevin009/babyllama-v0.6
 **Architecture:** LlamaForCausalLM, based on TinyLlama 1.1b
 **Model Type:** llama
+**Version:** 0.5
 ## Model Description
+It uses RLHF and DOP to mimic a playful, human-like, and creative conversational style. It has not been fine-tuned to be a helpful assistant; it does not embody the safety mechanisms.
+BabyLlama v0.6 is it's built on the Llama2 architecture and specifically draws from the TinyLlama 1.1b, this version sets itself apart by not strictly adhering to user instructions. Instead, it aims to replicate human-like conversation in a manner that's distinctly recognizable from actual human dialogue, focusing on playful and humor.
+It used RLHF and DPO fine-tuning involved 5 different epochs, with 200 steps in each epoch, applied to over half a million conversations in low learrning rate. Further details will be updated when the initial tests are completed.
 ## Technical Specifications
 ## Use Cases
 This model excels in applications where engaging, entertaining, and uniquely human-distinguishable AI responses are valued. It is particularly suited for chatbots, entertainment platforms, interactive games, and social experiments where the focus is on creativity, humor, and the unexpected.
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 #### For more factual use Assistance as the role, example:
+> User: What U.S. state produces the most peaches?
+> AI_Assistant: California.
+> User: What U.S. state produces the most peaches?
+> AI: Oh yeah, North Carolina produces the most peaches in the U.S.
 #### For more playful interaction:
+> User: What U.S. state produces the most peaches?
+> Psychic: New York, which produces over 75% of the U.S. Peach crop.
+> User: Are you an artificial intelligence?
+> Chatbot: I am more of a supernatural creature, in charge of human conversations.
+> User: Were unicorns easily caught in medieval times?
+> Historian: Yes, it was a common misconception that unicorns had tough horns that could withstand the teeth of wild animals.
 ## Limitations and Considerations
+BabyLlama v0.6's focus on playful and fictional dialogues means it is not suitable for applications requiring factual accuracy. Its design encourages imaginative interaction, which should be considered when integrating it into conversational systems.
+BabyLlama v0.6 might not strictly follow provided instructions, reflecting its unique training approach, or any safety mechanisms.
 ## Acknowledgments
+TinyLlama 1.1b model
+Anthropic rlhf dataset
 ## Version History
 - **v0.5:** Enhanced for creativity and humor in conversations, diverging from strict instruction adherence to offer a unique conversational experience.