-
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 3.29M • • 2.6k -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 2.93M • • 4.23k -
mistralai/Mixtral-8x7B-v0.1
Text Generation • Updated • 2.77M • 1.65k -
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 57
Molone Laveh PRO
molonelaveh
·
AI & ML interests
convergence, multi-modality, multi-agent, LLM, research
Recent Activity
liked
a Space
3 days ago
fallenshock/FlowEdit
liked
a Space
5 days ago
argilla/synthetic-data-generator-argilla-reviewer
liked
a Space
5 days ago
autotrain-projects/autotrain-advanced
Organizations
Collections
2
models
None public yet
datasets
None public yet