M4-ai
/

Hercules-Qwen1.5-14B

@@ -6,58 +6,158 @@ datasets:
 language:
 - en
 ---
-# Hercules-Qwen1.5-14B
-<!-- Provide a quick summary of what the model is/does. -->
-We fine-tuned Qwen1.5-14B on Locutusque's Hercules-v4. This is M4-ai's new flagship model, with unparalleled levels of performance.
-## Model Details
-### Model Description
-<!-- Provide a longer summary of what this model is. -->
-This model has capabilities in math, coding, function calling, roleplay, and more. We fine-tuned it using 700,000 examples of Hercules-v4.
-- **Developed by:** M4-ai
-- **Language(s) (NLP):** English and maybe Chinese
-- **License:** tongyi-qianwen license
-- **Finetuned from model:** [Qwen1.5-14B](https://huggingface.co/Qwen/Qwen1.5-14B)
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
-General purpose assistant, question answering, chain-of-thought, etc..
-### Recommendations
-<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
-Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
-## Evaluation
-Coming soon
-## Training Details
-### Training Data
-<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-https://huggingface.co/datasets/Locutusque/hercules-v4.0
-#### Training Hyperparameters
-- **Training regime:** bf16 non-mixed precision
-## Technical Specifications
-#### Hardware
-We used 8 Kaggle TPUs, and we trained at a global batch size of 128 and sequence length of 1024
-## Contributions
-Thanks to @Tonic, @aloobun, @fhai50032, and @Locutusque for their contributions to this model.

 language:
 - en
 ---
+<style>
+  body {
+    font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
+    line-height: 1.6;
+    color: #f5f5f5;
+    background-color: #1e2a36;
+    margin: 0;
+    padding: 0;
+  }
+  .container {
+    max-width: 1200px;
+    margin: 20px auto;
+    padding: 20px;
+    background-color: #2a3f54;
+    border-radius: 8px;
+    box-shadow: 0 4px 8px rgba(0, 0, 0, 0.1);
+    display: flex;
+    flex-wrap: wrap;
+    justify-content: space-between;
+  }
+  h1 {
+    font-size: 2.5rem;
+    color: #51a3d3;
+    text-align: center;
+    margin-bottom: 30px;
+    width: 100%;
+  }
+  h2 {
+    font-size: 1.75rem;
+    margin: 20px 0;
+    color: #63b8ea;
+    padding-bottom: 10px;
+  }
+  h3 {
+    font-size: 1.25rem;
+    color: #80c8f4;
+  }
+  p, a {
+    font-size: 1rem;
+  }
+  p {
+    color: #b0c2ce;
+    margin-bottom: 20px;
+  }
+  ul {
+    list-style-type: none;
+    padding: 0;
+    display: flex;
+    flex-wrap: wrap;
+    justify-content: space-between;
+    width: 100%;
+  }
+  li {
+    background-color: #34495e;
+    padding: 20px;
+    margin-bottom: 10px;
+    border-radius: 4px;
+    cursor: pointer;
+    transition: background-color 0.3s ease, color 0.3s ease;
+    overflow: hidden;
+    color: #b0c2ce;
+    width: calc(50% - 10px);
+    box-shadow: 0 2px 4px rgba(0, 0, 0, 0.1);
+  }
+  li:hover {
+    background-color: #4e6a81;
+    color: #dfe8f1;
+  }
+  .section-content {
+    margin-top: 15px;
+    border-top: 1px solid #4e6a81;
+    padding-top: 10px;
+  }
+  a {
+    color: #a4c8e1;
+    text-decoration: none;
+  }
+  a:hover {
+    text-decoration: underline;
+  }
+  pre {
+    background-color: #2c3e50;
+    padding: 10px;
+    border-radius: 5px;
+    overflow-x: auto;
+    color: #b0c2ce;
+  }
+</style>
+<div class="container">
+  <h1>Hercules-Qwen1.5-14B</h1>
+</div>
+<ul>
+  <li>
+    <h2>Model Details</h2>
+    <div class="section-content">
+      <h3>Model Description</h3>
+      <p>This model has capabilities in math, coding, function calling, roleplay, and more. We fine-tuned it using 700,000 examples of Hercules-v4.</p>
+      <p><strong>Developed by:</strong> M4-ai</p>
+      <p><strong>Language(s) (NLP):</strong> English and maybe Chinese</p>
+      <p><strong>License:</strong> tongyi-qianwen license</p>
+      <p><strong>Finetuned from model:</strong> <a href="https://huggingface.co/Qwen/Qwen1.5-14B">Qwen1.5-14B</a></p>
+    </div>
+  </li>
+  <li>
+    <h2>Uses</h2>
+    <div class="section-content">
+      <p>General purpose assistant, question answering, chain-of-thought, etc..</p>
+      <h3>Recommendations</h3>
+      <p>Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.</p>
+    </div>
+  </li>
+  <li>
+    <h2>Evaluation</h2>
+    <div class="section-content">
+      <p>Coming soon</p>
+    </div>
+  </li>
+  <li>
+    <h2>Training Details</h2>
+    <div class="section-content">
+      <h3>Training Data</h3>
+      <p><a href="https://huggingface.co/datasets/Locutusque/hercules-v4.0">https://huggingface.co/datasets/Locutusque/hercules-v4.0</a></p>
+      <h4>Training Hyperparameters</h4>
+      <p><strong>Training regime:</strong> bf16 non-mixed precision</p>
+    </div>
+  </li>
+  <li>
+    <h2>Technical Specifications</h2>
+    <div class="section-content">
+      <h4>Hardware</h4>
+      <p>We used 8 Kaggle TPUs, and we trained at a global batch size of 128 and sequence length of 1024</p>
+    </div>
+  </li>
+  <li>
+    <h2>Contributions</h2>
+    <div class="section-content">
+      <p>Thanks to @Tonic, @aloobun, @fhai50032, and @Locutusque for their contributions to this model.</p>
+    </div>
+  </li>
+</ul>