Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 2 items • Updated 16 days ago • 7
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 20 days ago • 118
view article Article Optimize and deploy models with Optimum-Intel and OpenVINO GenAI Sep 20, 2024 • 20