Experimental Chess Model (Causal)

Overview

This model is an experimental fine-tuned variant designed for causal inference on a very small subset of chess games. It leverages the base model obtained from Microsoft(phi-3-mini-4k-instruct) and has been fine-tuned using Hugging Face Transformers with the Accelerate library.

Key Details

Task: Causal inference on chess games
Base Model: phi-3-mini-4k-instruct
Fine-Tuning Framework: Hugging Face Transformers with Accelerate and peft
License: MIT

Description

The primary purpose of this model is to explore causal relationships within chess games. It was trained on a limited dataset, making it suitable for experimentation and research. While its performance may not match larger-scale models, it serves as a starting point for causal analysis in the chess games. It also gives us an insight on how causal models react to high level chess games (2000> ELO).

Limitations

Small Dataset: Due to the limited data, the model's generalization capabilities are restricted.
Experimental Nature: This model is not production-ready and should be used for research purposes only.
Causal Interpretation: Interpretation of causal effects requires careful consideration and domain expertise.

Usage

will be updated shortly !!!

Metrics

global_step=2795, training_loss=0.15753029228749557, metrics={'train_runtime': 7548.9262, 'train_samples_per_second': 0.37, 'train_steps_per_second': 0.37, 'total_flos': 4.255669870466458e+16, 'train_loss': 0.15753029228749557, 'epoch': 1.0, 'num_input_tokens_seen': 1892547} will be updated shortly !!!

Author

Author: @bhuvanmdev (GitHub profile)

The main authors of the base model can be found Here

Consider having a read at the original model card to understand the biases,limitations and other necessary details.

It's one of my first systematically fine-tuned model, Feel free to experiment with this model and contribute to its development! ;) THANK YOU

bhuvanmdev
/

phi3-7b-chess-beta