migtissera
/

HelixNet

Model card Files Files and versions Community

Migel Tissera commited on Nov 3, 2023

Commit

5f5acc4

•

1 Parent(s): 6889a8a

image added

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -2,6 +2,8 @@
 license: apache-2.0
 ---
 # HelixNet
 HelixNet is a Deep Learning architecture consisting of 3 x Mistral-7B LLMs. It has an `actor`, a `critic`, and a `regenerator`. The `actor` LLM produces an initial response to a given system-context and a question. The `critic` then takes in as input, a tuple of (system-context, question, response) and provides a critique based on the provided answer to the given system-context and the question. Its job is not to criticize, but to provide an intelligent critique so that the answer can be modified/regenerated to address the question better. Finally, the `regenerator` takes in a tuple of (system-context, question, response, critique) and regenerates the answer.
@@ -30,7 +32,7 @@ Using the above training dataset, a Mistral-7B was fine-tuned.
 A thrid LLM was fine-tuned using the above data.
-# Reusability of the `critic` and the `regenerator`
 The `critic` and the `regenerator` was tested not only on the accopanying actor model, but 13B and 70B SynthIA models as well. They seem to be readily transferrable, as the function that it has learnt is to provide an intelligent critique and then a regeneration of the original response. Please feel free to try out other models as the `actor`. However, the architecture works best with all three as presented here in HelixNet.

 license: apache-2.0
 ---
+![HelixNet](https://huggingface.co/migtissera/HelixNet/resolve/main/HelixNet.png)
 # HelixNet
 HelixNet is a Deep Learning architecture consisting of 3 x Mistral-7B LLMs. It has an `actor`, a `critic`, and a `regenerator`. The `actor` LLM produces an initial response to a given system-context and a question. The `critic` then takes in as input, a tuple of (system-context, question, response) and provides a critique based on the provided answer to the given system-context and the question. Its job is not to criticize, but to provide an intelligent critique so that the answer can be modified/regenerated to address the question better. Finally, the `regenerator` takes in a tuple of (system-context, question, response, critique) and regenerates the answer.
 A thrid LLM was fine-tuned using the above data.
+# Reusability of the critic and the regenerator
 The `critic` and the `regenerator` was tested not only on the accopanying actor model, but 13B and 70B SynthIA models as well. They seem to be readily transferrable, as the function that it has learnt is to provide an intelligent critique and then a regeneration of the original response. Please feel free to try out other models as the `actor`. However, the architecture works best with all three as presented here in HelixNet.