Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS

StepFun
company
AI & ML interests
None defined yet.
Recent Activity
Organization Card
Welcome to StepFun π
StepFun, founded in April 2023 with the mission to βScale-up possibilities for everyone,β unites top talent in artificial intelligence from both domestic and international backgrounds, and is dedicated to advancing toward AGI. The company has already launched the Step series of foundation models, which includes Step-2, a cutting-edge trillion-parameter Mixture of Experts (MoE) language model; Step-1.5V, a powerful multimodal large model; and Step-1V, an innovative image generation model, among others.
Collections
1
spaces
2
models
8

stepfun-ai/stepvideo-ti2v
Image-to-Video
β’
Updated
β’
121
β’
56

stepfun-ai/stepvideo-t2v
Text-to-Video
β’
Updated
β’
1.08k
β’
419

stepfun-ai/Step-Audio-Tokenizer
Updated
β’
34

stepfun-ai/Step-Audio-Chat
Audio-Text-to-Text
β’
Updated
β’
736
β’
432

stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
β’
Updated
β’
1.22k
β’
173

stepfun-ai/stepvideo-t2v-turbo
Updated
β’
86

stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
β’
Updated
β’
86.8k
β’
1.43k

stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
β’
Updated
β’
168k
β’
177