Text Generation
Transformers
Safetensors
mistral
axolotl
Generated from Trainer
Mistral
instruct
finetune
chatml
gpt4
synthetic data
science
physics
chemistry
biology
math
conversational
Eval Results
Inference Endpoints
text-generation-inference
Einstein-v5-v0.2-7B / data /remove_empty_output.py
Weyaxi's picture
adding the exact data used to train this model
ada48f7 verified
import json
with open('merged_all.json', 'r') as file:
data = json.load(file)
print(f"Normal len: {len(data)}")
data = [row for row in data if row["output"] != ""]
print(f"After len: {len(data)}")
with open('merged_all.json', 'w') as file:
json.dump(data, file, indent=1)