TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation Paper • 2401.14373 • Published Jan 25 • 11
view article Article Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA By sirluk • Jan 22 • 13
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated 6 days ago • 56
OpenScholar_V1 Collection The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". • 8 items • Updated 7 days ago • 26
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated 5 days ago • 74
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published 21 days ago • 35
SpeechT5 Collection The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated Jul 11 • 22
Computer Vision Backbones 🧩 Collection Collection of useful computer vision backbones to fine-tune. It also includes large image classification models, that can be used as backbone. • 22 items • Updated Sep 19, 2023 • 19
view article Article Decoding Strategies in Large Language Models By mlabonne • about 1 month ago • 38
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 57
AudioPaLM: A Large Language Model That Can Speak and Listen Paper • 2306.12925 • Published Jun 22, 2023 • 53