This is the official checkpoint of feedback model trained using COFFEE-GYM with PPO strategy.
This model generates natural language feedback given an erroneous code.
For further detials, please see our paper.
- Downloads last month
- 10
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.