1 172 138

Mohammed Brıman

mohammedbriman

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision

Recent Activity

liked a model 4 days ago

deepseek-ai/DeepSeek-V3-Base

updated a collection 4 days ago

To read... eventually

updated a collection 4 days ago

To read... eventually

View all activity

Organizations

None yet

mohammedbriman's activity

upvoted a paper 4 days ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published 10 days ago • 33

upvoted a paper 5 days ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published 11 days ago • 67

upvoted a paper 7 days ago

Stochastic interpolants with data-dependent couplings

Paper • 2310.03725 • Published Oct 5, 2023 • 1

upvoted a paper 10 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 12 days ago • 112

upvoted 2 papers 14 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 18 days ago • 93

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 17 days ago • 78

upvoted a paper 17 days ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 57

upvoted 2 papers 21 days ago

Transformers Can Navigate Mazes With Multi-Step Prediction

Paper • 2412.05117 • Published 24 days ago • 5

Reinforcement Learning: An Overview

Paper • 2412.05265 • Published 24 days ago • 4

upvoted 2 papers 26 days ago

Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving

Paper • 2407.00079 • Published Jun 24 • 5

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published about 1 month ago • 55

upvoted a paper 27 days ago

Agent Skill Acquisition for Large Language Models via CycleQD

Paper • 2410.14735 • Published Oct 16 • 2

upvoted a paper 28 days ago

BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

Paper • 2405.12971 • Published May 21 • 2

upvoted an article about 1 month ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28

• 127

upvoted 4 papers about 1 month ago

upvoted 2 collections about 1 month ago

VisionLM

Collection

578 items • Updated 3 days ago • 39

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated Nov 27 • 63