llmware
/

industry-bert-contracts-v0.1

Feature Extraction

text-embeddings-inference

Model card Files Files and versions Community

doberst commited on Sep 30, 2023

Commit

978db57

•

1 Parent(s): 983cda3

Update README.md

Files changed (1) hide show

README.md +10 -4

README.md CHANGED Viewed

@@ -23,13 +23,19 @@ including open source contract datasets.
 - **License:** Apache 2.0
 - **Finetuned from model [optional]:** BERT-based model, fine-tuning methodology described below.
-[More Information Needed]
-## Bias, Risks, and Limitations
-<!-- This section is meant to convey both technical and sociotechnical limitations. -->
-[More Information Needed]
 ### Training Procedure

 - **License:** Apache 2.0
 - **Finetuned from model [optional]:** BERT-based model, fine-tuning methodology described below.
+## Model Use
+from transformers import AutoTokenizer, AutoModel
+tokenizer = AutoTokenizer.from_pretrained("llmware/industry-bert-contracts-v0.1")
+model = AutoModel.from_pretrained("llmware/industry-bert-contracts-v0.1")
+## Bias, Risks, and Limitations
+This is a semantic embedding model, fine-tuned on public domain contracts and related documents.   Results may vary if used outside of this
+domain, and like any embedding model, there is always the potential for anomalies in the vector embedding space.   No specific safeguards have
+put in place for safety or mitigate potential bias in the dataset.
 ### Training Procedure