Flo Schneider
floschne
AI & ML interests
Large Vision-Language Models, Cross-modal Retrieval
Recent Activity
authored
a paper
1 day ago
Why do LLaVA Vision-Language Models Reply to Images in English?
authored
a paper
1 day ago
Multilingual and Explainable Text Detoxification with Parallel Corpora
Organizations
floschne's activity
A bunch. of CUDA errors appearing in the wild
1
#5 opened 6 days ago
by
floschne
File missing
2
#1 opened about 1 month ago
by
floschne
Issues when downloading the dataset
2
#1 opened 3 months ago
by
floschne
Question regarding the Stage 1 training procedure
#1 opened 7 months ago
by
floschne
Librarian Bot: Add language metadata for dataset
#1 opened 8 months ago
by
librarian-bot
What processor to use?
9
#4 opened 9 months ago
by
floschne
Which exact version of Llama 2 was used?
1
#1 opened 9 months ago
by
floschne
Which Vision Encoder was used here?
1
#9 opened 10 months ago
by
floschne
Training data details
#8 opened 10 months ago
by
floschne
What does VIP stand for?
3
#1 opened 11 months ago
by
floschne
RuntimeError: Error(s) in loading state_dict for LlavaForConditionalGeneration
2
#2 opened 11 months ago
by
floschne
Why is `beam_search` not enabled in the samples?
2
#10 opened about 1 year ago
by
floschne
Slow inference
#3 opened almost 2 years ago
by
floschne