34 4 6

Fuli Luo

luofuli

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model 3 months ago

deepseek-ai/DeepSeek-V3-Base

liked a model 5 months ago

mistralai/Mamba-Codestral-7B-v0.1

View all activity

Organizations

None yet

luofuli's activity

New activity in deepseek-ai/DeepSeek-V2-Lite-Chat 10 months ago

What's the diff with deepseek-ai/deepseek-moe-16b-chat ?

#3 opened 10 months ago by

JohnSaxon

New activity in deepseek-ai/DeepSeek-V2 10 months ago

training: fix type mismatch when training

#6 opened 10 months ago by

Jack477

New activity in deepseek-ai/DeepSeek-V2-Chat 10 months ago

function/tool calling support

#5 opened 10 months ago by

kaijietti

New activity in deepseek-ai/deepseek-coder-6.7b-base 12 months ago

Adding `safetensors` variant of this model

#3 opened about 1 year ago by

SFconvertbot

New activity in deepseek-ai/deepseek-math-7b-rl 12 months ago

Adding `safetensors` variant of this model

#4 opened 12 months ago by

abalogh

New activity in deepseek-ai/deepseek-llm-67b-base 12 months ago

Intermediate Pretraining Checkpoints?

#3 opened about 1 year ago by

RylanSchaeffer

New activity in deepseek-ai/deepseek-llm-7b-base 12 months ago

Intermediate Pretraining Checkpoints?

#1 opened about 1 year ago by

RylanSchaeffer

New activity in deepseek-ai/deepseek-moe-16b-base 12 months ago

Intermediate Pretraining Checkpoints?

#5 opened about 1 year ago by

RylanSchaeffer

max_positional_embeddings

#2 opened about 1 year ago by

ehartford

New activity in deepseek-ai/deepseek-llm-7b-chat 12 months ago

Missing `tokenizer.model`

#3 opened about 1 year ago by

AlienKevin

New activity in deepseek-ai/deepseek-vl-7b-base 12 months ago

Please publish LLM 1.7b base

#2 opened about 1 year ago by

SinanAkkoyun

New activity in deepseek-ai/deepseek-llm-7b-base 12 months ago

llm-1.3b-base

#2 opened about 1 year ago by

SinanAkkoyun

New activity in deepseek-ai/deepseek-vl-7b-chat 12 months ago

4-bit quant?

#3 opened 12 months ago by

Neman

New activity in deepseek-ai/deepseek-coder-33b-instruct 12 months ago

tokenizer.model

#26 opened 12 months ago by

BigDeeper

New activity in deepseek-ai/deepseek-coder-33b-instruct about 1 year ago

Why do we need the line trust_remote_code=True?

#23 opened about 1 year ago by

Rubiel1

New activity in deepseek-ai/deepseek-math-7b-instruct about 1 year ago

excellent results

#1 opened about 1 year ago by

Tonic

New activity in deepseek-ai/deepseek-coder-33b-instruct about 1 year ago

Failed to Deploy this model in Inference Endpoints

#19 opened about 1 year ago by

calvinball

New activity in deepseek-ai/deepseek-coder-1.3b-base about 1 year ago

Does it matter if the prompt is incomplete?

#4 opened over 1 year ago by

Hamlyn

New activity in deepseek-ai/deepseek-llm-67b-chat about 1 year ago

tokenizer.model

#6 opened over 1 year ago by

RonanMcGovern

Exllamav2 need tokenizer.model to load

#8 opened over 1 year ago by

CTXEE