arxiv:2410.01428
Fangkai Jiao
chitanda
AI & ML interests
self-supervised pre-training, large language model and machine reasoning.
Recent Activity
liked
a dataset
3 days ago
OpenCoder-LLM/opc-sft-stage1
upvoted
a
collection
3 days ago
OpenCoder Datasets
liked
a model
about 1 month ago
mistralai/Ministral-8B-Instruct-2410
Organizations
Papers
15
models
72
chitanda/gemma.2b.it.meta_math_distil.H100.w4.v1.0
Updated
chitanda/gemma.2b.it.meta_math_rap.dpo.H100.w4.v1.1.fix.s42
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.step.dpo.fix_hack.H100.w4.v1.0.th.s43
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.step.dpo.fix_hack.H100.w4.v1.0.th.s42
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.step.dpo.fix_hack.A100.w4.v1.0.th.s44
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.prm.fix_hack.H100.w4.v2.0.s42
Updated
chitanda/llama2.7b.chat.reclor.gpt35turbo1106.dpo-sft.H100.w4.v2.0
Updated
chitanda/llama2.7b.chat.logiqav2.70b-distil.dpo.fix_hack.H100.w4.v1.0.th.test.s43
Updated
chitanda/llama2.7b.chat.logiqav2.llama-2-70b-chat.dpo-sft.A6K.w4.v1.0
Updated
chitanda/llama2.70b.q_lora.merit_v91_v91.seq2seq.v5.0.6aug.filter.w4.adamw.500steps.NA100.1010
Updated
•
4