Shyam Peri's picture

30 124

Shyam Peri

shyamperi

·

AI & ML interests

None yet

Recent Activity

liked a model 23 days ago

Qwen/Qwen1.5-72B-Chat-GGUF

upvoted a collection 27 days ago

SmolVLM 256M & 500M

liked a model 28 days ago

Alibaba-NLP/gte-modernbert-base

View all activity

Organizations

shyamperi's activity

upvoted a collection 27 days ago

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated about 9 hours ago • 69

upvoted an article 29 days ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

By

and 2 others •

about 1 month ago

• 36

upvoted a collection about 2 months ago

DeepSeek-VL2

5 items • Updated 11 days ago • 69

upvoted a paper 2 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 48

upvoted 2 collections 3 months ago

Finance Commons

A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17, 2024 • 9

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 9 days ago • 82

upvoted 2 collections 4 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 570

MedEmbed: Embedding Models for Medical Domain

GitHub -> https://github.com/abhinand5/MedEmbed • 4 items • Updated Oct 21, 2024 • 9

upvoted 2 papers 5 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Paper • 2409.16040 • Published Sep 24, 2024 • 14

upvoted 2 collections 5 months ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Dec 13, 2024 • 85

LLM2Encoder

Collection of initial models and models that use converted decoders to encoders as backbones • 11 items • Updated Sep 10, 2024 • 6

upvoted a collection 6 months ago

GLiNER bi-encoders

Bi-encoder and poly-encoder architectures of GLiNER • 5 items • Updated Sep 10, 2024 • 13

upvoted an article 6 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

upvoted 2 articles 7 months ago

Article

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

Jul 25, 2024

• 18

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 320

upvoted a collection 9 months ago

NuNerZero - Zero Shot NER

The best compact Zero-Shot NER models with MIT license • 4 items • Updated Jul 3, 2024 • 19

upvoted 2 articles 10 months ago

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24, 2024

• 65

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 233

upvoted a paper 12 months ago

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26, 2024 • 43