2 24 21

Björn Bebensee

bebensee

https://bebens.ee

AI & ML interests

Large language model pre-training, tool augmentation, conversational AI

Recent Activity

liked a model 1 day ago

CohereForAI/c4ai-command-a-03-2025

liked a model 11 days ago

CohereForAI/c4ai-command-r7b-arabic-02-2025

liked a model 11 days ago

CohereForAI/aya-vision-32b

View all activity

Organizations

None yet

bebensee's activity

liked a model 1 day ago

CohereForAI/c4ai-command-a-03-2025

Text Generation • Updated 3 days ago • 1.83k • 235

liked 3 models 11 days ago

liked a dataset 24 days ago

SakanaAI/AI-CUDA-Engineer-Archive

Viewer • Updated 24 days ago • 30.6k • 14.3k • 139

liked a dataset 4 months ago

OpenCoder-LLM/opc-fineweb-math-corpus

Viewer • Updated Nov 24, 2024 • 5.24M • 819 • 26

liked a model 5 months ago

CohereForAI/aya-expanse-8b

Text Generation • Updated 14 days ago • 28.7k • 349

upvoted a paper 7 months ago

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 42

upvoted 3 papers 8 months ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 83

On Leakage of Code Generation Evaluation Datasets

Paper • 2407.07565 • Published Jul 10, 2024 • 6

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 69

liked a Space 9 months ago

12.8k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

liked a dataset 9 months ago

imbue/code-comprehension

Viewer • Updated Jun 25, 2024 • 54k • 96 • 20

liked a dataset 11 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 301k • 2.03k

liked a model 11 months ago

CohereForAI/c4ai-command-r-plus

Text Generation • Updated Sep 27, 2024 • 3.71k • 1.72k

upvoted 3 papers about 1 year ago

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Paper • 2403.04746 • Published Mar 7, 2024 • 25

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Paper • 2402.10466 • Published Feb 16, 2024 • 19

liked a dataset about 1 year ago

Idavidrein/gpqa

Viewer • Updated Mar 28, 2024 • 1.25k • 57.5k • 143

upvoted a paper about 1 year ago

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Paper • 2401.17377 • Published Jan 30, 2024 • 36