![](https://cdn-avatars.huggingface.co/v1/production/uploads/6048ea0c0f59ab4b614f1836/wiD82ZN_nbJ7ejlQ6PCOQ.png)
AnyModal/LaTeX-OCR-Llama-3.2-1B
Updated
•
2
Multimodal LLMs for all! AnyModal is a modular and extensible framework for integrating diverse input modalities (e.g., images, audio) into large language models (LLMs). It enables seamless tokenization, encoding, and language generation using pre-trained models for various modalities.