4 4 1

Bo Liu

Benjamin-eecs

https://benjamin-eecs.github.io/

AI & ML interests

Reinforcement Learning, Reasoning, Machine Learning Systems

Recent Activity

updated a model about 2 months ago

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

updated a model about 2 months ago

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value

updated a collection about 2 months ago

Natural Language Reinforcement Learning

View all activity

Organizations

Benjamin-eecs's activity

updated 2 models about 2 months ago

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

Feature Extraction • Updated Nov 24, 2024 • 8

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value

Feature Extraction • Updated Nov 24, 2024 • 5

updated a collection about 2 months ago

Natural Language Reinforcement Learning

Collection

4 items • Updated Nov 24, 2024

upvoted a paper about 2 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

authored a paper about 2 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

commented a paper about 2 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28 •

authored a paper 5 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 53

updated 2 models 7 months ago

deepseek-ai/DeepSeek-V2-Chat

Text Generation • Updated Jun 8, 2024 • 881 • 448

deepseek-ai/DeepSeek-V2

Text Generation • Updated Jun 8, 2024 • 7.09k • 295

authored a paper 7 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 14

upvoted a paper 8 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 14

authored a paper 8 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 37

New activity in deepseek-ai/DeepSeek-VL-7B 10 months ago

Update app.py

#1 opened 10 months ago by

minhdang

New activity in deepseek-ai/deepseek-vl-7b-chat 10 months ago

abb

#8 opened 10 months ago by

perpE

upvoted a paper 10 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 40

authored a paper 10 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 40

liked a Space 10 months ago

Runtime error

295

🐬

Chat with DeepSeek VL 7B

authored a paper about 1 year ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 41