Joshua Chris's picture

Joshua Chris

KrisKale45

·

AI & ML interests

None yet

Recent Activity

liked a Space about 6 hours ago

Remsky/Kokoro-TTS-Zero

upvoted a collection 2 days ago

Phi-4 (All Versions)

upvoted a collection 2 days ago

🧠 Reasoning datasets

View all activity

Organizations

None yet

KrisKale45's activity

upvoted 2 collections 2 days ago

Phi-4 (All Versions)

Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. • 4 items • Updated 1 day ago • 39

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 6 items • Updated 5 days ago • 31

upvoted a collection 5 days ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 2 hours ago • 145

upvoted an article 8 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

8 days ago

• 626

upvoted a collection 11 days ago

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 1 day ago • 49

upvoted an article 17 days ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

21 days ago

• 40

upvoted a paper 24 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 28 days ago • 253

upvoted 2 papers 27 days ago

StreamChat: Chatting with Streaming Video

Paper • 2412.08646 • Published Dec 11, 2024 • 18

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published 30 days ago • 35

upvoted 2 articles 4 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

Oct 20, 2024

• 34

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 182

upvoted 2 papers 5 months ago

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

Paper • 2409.12139 • Published Sep 18, 2024 • 12

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 48

upvoted an article 5 months ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

By

•

Aug 26, 2024

• 42

upvoted 2 papers 6 months ago

FocusLLM: Scaling LLM's Context by Parallel Decoding

Paper • 2408.11745 • Published Aug 21, 2024 • 24

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 51

upvoted 3 articles 6 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19, 2024

• 76

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

Article

Revolutionizing Video Transcription: Unveiling Gemma-2b-it and Langchain in the Era of Transformers

By

•

Mar 12, 2024

• 3

upvoted a collection 7 months ago

NuExtract

4 items • Updated Oct 17, 2024 • 9