arxiv:2402.05000
Kangqi (Kevin) Ni
kangqi-ni
AI & ML interests
NLP, CV, RLHF
Recent Activity
updated
a model
about 1 month ago
kangqi-ni/zephyr-7b-beta_bio-tutor_kto
updated
a model
about 1 month ago
kangqi-ni/Mistral-7B-Instruct-v0.2_bio-tutor_kto
updated
a model
about 1 month ago
kangqi-ni/Llama-3.1-8b-Instruct_bio-tutor_kto
Organizations
None yet
Papers
1
models
11
kangqi-ni/zephyr-7b-beta_bio-tutor_kto
Updated
kangqi-ni/Mistral-7B-Instruct-v0.2_bio-tutor_kto
Updated
•
4
kangqi-ni/Llama-3.1-8b-Instruct_bio-tutor_kto
Updated
•
5
kangqi-ni/Llama-3.1-8B-Instruct_bio-tutor_dpo
Updated
•
6
kangqi-ni/Llama-3.1-8B-Instruct_bio-tutor_sft
Updated
•
10
kangqi-ni/zephyr-7b-beta_bio-tutor_sft
Text Generation
•
Updated
•
13
kangqi-ni/zephyr-7b-beta_bio-tutor_dpo
Text Generation
•
Updated
•
12
kangqi-ni/Mistral-7B-Instruct-v0.2_bio-tutor_sft
Text Generation
•
Updated
•
8
kangqi-ni/Mistral-7B-Instruct-v0.2_bio-tutor_dpo
Text Generation
•
Updated
•
15
kangqi-ni/roberta-large-mnli-ricechem
Text Classification
•
Updated
•
13
datasets
None public yet