Edit model card

Experimental Chess Model (Causal)

Overview

This model is an experimental fine-tuned variant designed for causal inference on a very small subset of chess games. It leverages the base model obtained from Microsoft(phi-3-mini-4k-instruct) and has been fine-tuned using Hugging Face Transformers with the Accelerate library.

Key Details

  • Task: Causal inference on chess games
  • Base Model: phi-3-mini-4k-instruct
  • Fine-Tuning Framework: Hugging Face Transformers with Accelerate and peft
  • License: MIT

Description

The primary purpose of this model is to explore causal relationships within chess games. It was trained on a limited dataset, making it suitable for experimentation and research. While its performance may not match larger-scale models, it serves as a starting point for causal analysis in the chess games. It also gives us an insight on how causal models react to high level chess games (2000> ELO).

Limitations

  • Small Dataset: Due to the limited data, the model's generalization capabilities are restricted.
  • Experimental Nature: This model is not production-ready and should be used for research purposes only.
  • Causal Interpretation: Interpretation of causal effects requires careful consideration and domain expertise.

Usage

will be updated shortly !!!

Metrics

global_step=2795, training_loss=0.15753029228749557, metrics={'train_runtime': 7548.9262, 'train_samples_per_second': 0.37, 'train_steps_per_second': 0.37, 'total_flos': 4.255669870466458e+16, 'train_loss': 0.15753029228749557, 'epoch': 1.0, 'num_input_tokens_seen': 1892547} will be updated shortly !!!

Author

The main authors of the base model can be found Here

Consider having a read at the original model card to understand the biases,limitations and other necessary details.

It's one of my first systematically fine-tuned model, Feel free to experiment with this model and contribute to its development! ;) THANK YOU


Downloads last month

-

Downloads are not tracked for this model. How to track
Unable to determine this model's library. Check the docs .

Dataset used to train bhuvanmdev/phi3-7b-chess-beta