37 12 155

Oliver Guhr

oliverguhr

https://www.impact-labs.ai

AI & ML interests

Voice Interfaces, Robotics, Deep Learning

Recent Activity

liked a model 7 days ago

nyrahealth/CrisperWhisper

upvoted an article 8 days ago

Open-R1: Update #1

liked a dataset 9 days ago

clapAI/MultiLingualSentiment

View all activity

Organizations

oliverguhr's activity

liked a model 7 days ago

nyrahealth/CrisperWhisper

Automatic Speech Recognition • Updated Dec 19, 2024 • 34.3k • 228

upvoted an article 8 days ago

Article

Open-R1: Update #1

and 7 others •

11 days ago

• 272

liked a dataset 9 days ago

clapAI/MultiLingualSentiment

Viewer • Updated Dec 27, 2024 • 3.93M • 248 • 7

liked a model 26 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 11 days ago • 400k • 3.08k

liked 2 models about 1 month ago

Efficient-Large-Model/Sana_1600M_1024px_BF16_diffusers

Text-to-Image • Updated Jan 12 • 3

microsoft/phi-4

Text Generation • Updated 8 days ago • 590k • 1.72k

liked a model about 2 months ago

ssary/XLM-RoBERTa-German-sentiment

Text Classification • Updated Mar 24, 2024 • 734 • 11

liked 3 models 3 months ago

Adding `safetensors` variant of this model

#1 opened 4 months ago by

SFconvertbot

commented a paper 4 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51 •

upvoted a paper 4 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

updated a collection 4 months ago

read-list

Collection

8 items • Updated Oct 4, 2024

liked a dataset 5 months ago

tyqiangz/multilingual-sentiments

Viewer • Updated May 23, 2023 • 591k • 3.45k • 45

reacted to MoritzLaurer's post with 🔥👍 5 months ago

Post

1631

Why would you fine-tune a model if you can just prompt an LLM? The new paper "What is the Role of Small Models in the LLM Era: A Survey" provides a nice pro/con overview. My go-to approach combines both:

1. Start testing an idea by prompting an LLM/VLM behind an API. It's fast and easy and I avoid wasting time on tuning a model on a task that might not make it into production anyways.

2. The LLM/VLM then needs to be manually validated. Anyone seriously considering putting AI into production has to do at least some manual validation. Setting up a good validation pipeline with a tool like Argilla is crucial and it can be reused for any future experiments. Note: you can use LLM-as-a-judge to automate some evals, but you always also need to validate the judge!

3. Based on this validation I can then (a) either just continue using the prompted LLM if it is accurate enough and it makes sense financially given my load; or (b) if the LLM is not accurate enough or too expensive to run in the long-run, I reuse the existing validation pipeline to annotate some additional data for fine-tuning a smaller model. This can be sped up by reusing & correcting synthetic data from the LLM (or just pure distillation).

Paper: https://arxiv.org/pdf/2409.06857
Argilla docs: https://docs.argilla.io/latest/
Argilla is also very easy to deploy with Hugging Face Spaces (or locally): https://huggingface.co/new-space?template=argilla%2Fargilla-template-space

updated a collection 5 months ago

read-list

Collection

8 items • Updated Oct 4, 2024

liked 2 models 5 months ago

mdraw/german-news-sentiment-bert

Text Classification • Updated Mar 17, 2023 • 638 • 8

jinaai/jina-colbert-v2

Updated 26 days ago • 82.6k • 101