File size: 1,285 Bytes
1ca22ef
 
f90d030
 
1ca22ef
f90d030
 
 
 
 
 
 
 
 
 
 
 
f795f2d
f90d030
f795f2d
f90d030
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
license: afl-3.0
tags:
- medical
---
# ClinicalGPT

This model card introduces ClinicalGPT model, a large language model designed and optimized for clinical scenarios. ClinicalGPT is fine-tuned on extensive and diverse medical datasets, including medical records, domain-specific knowledge, and multi-round dialogue consultations. The model is undergoing ongoing and continuous updates.

## Model Fine-tuning

We set the learning rate to 5e-5, with a batch size of 128 and a maximum length of 1,024, training across 3 epochs.

## How to use the model

Load the model via the transformers library:
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("medicalai/ClinicalGPT-base-zh")
model = AutoModelForCausalLM.from_pretrained("medicalai/ClinicalGPT-base-zh")
```

## Limitations

The project is intended for research purposes only and restricted from commercial or clinical use. The generated content by the model is subject to factors such as model computations, randomness, misinterpretation, and biases, and this project cannot guarantee its accuracy. This project assumes no legal liability for any content produced by the model. Users are advised to exercise caution and independently verify the generated results.