-
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 51 -
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 138 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper • 2401.04081 • Published • 71 -
hustvl/Vim-tiny
Updated • 19
Michael Schock
mjschock
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
nvidia/Hymba-1.5B-Instruct
liked
a dataset
4 days ago
gaia-benchmark/GAIA
liked
a dataset
5 days ago
bigcode/self-oss-instruct-sc2-exec-filter-50k
Organizations
None yet
Collections
1
spaces
1
models
39
mjschock/TinyLlama-1.1B-Chat-v1.0-sft-chat_threads-11-10-24-v2
Updated
•
22
mjschock/TinyLlama-1.1B-Chat-v1.0-sft-chat_threads-11-10-24
Updated
•
25
mjschock/TinyLlama-1.1B-Chat-v1.0-sft-chat_threads
Updated
•
74
mjschock/TinyLlama-1.1B-Chat-v1.0
Text Generation
•
Updated
•
434
mjschock/sft_mjschock-chat_threads
Updated
•
10
mjschock/sft_openassistant-guanaco
Updated
mjschock/TinyLlama-1.1B-Chat-v1.0-qlora-ultrachat
Updated
mjschock/TinyLlama-1.1B-2.5T-chat-and-function-calling-Q4_K_M-GGUF
Text Generation
•
Updated
•
4
mjschock/TinyLlama-1.1B-Chat-v1.0-Q8_0-GGUF
Updated
•
10
•
1
mjschock/SmolLM-135M-Q4_K_M-GGUF
Updated
•
6
•
1