view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 β’ 63
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14 β’ 42
view article Article DEMO: French Spoken Language Understanding with the new speech resources from NAVER LABS Europe By mzboito β’ Aug 28 β’ 8
view article Article Deep Learning over the Internet: Training Language Models Collaboratively Jul 15, 2021 β’ 4
Building and better understanding vision-language models: insights and future directions Paper β’ 2408.12637 β’ Published Aug 22 β’ 110
view article Article LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning? Jul 25 β’ 18
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 β’ 60
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5 β’ 106
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper β’ 2406.17557 β’ Published Jun 25 β’ 84
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24 β’ 168
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28 β’ 148
view article Article Wikipedia's Treasure Trove: Advancing Machine Learning with Diverse Data By frimelle β’ Jun 3 β’ 13
view article Article Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Mar 15 β’ 5
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Paper β’ 2403.09029 β’ Published Mar 14 β’ 54