StepFun Open-Sources Step 3.7 Flash Model with 400 Tokens/s Generation Speed
StepFun has released Step 3.7 Flash, a sparse MoE model optimized for agent workflows, coding, search, and multimodal tasks. Delivering up to 400 tokens per second, it supports native multimodal understanding, web/visual search, and reliable tool calling across major agent frameworks.
On May 29, 2026, StepFun (Jiyue Xingchen) announced the open-source release of Step 3.7 Flash, a next-generation Flash model designed for the production stage of agent-based systems. The model is systematically optimized for agent workflows, coding, search, and multimodal pipelines.
Architecture & Performance
Step 3.7 Flash employs a sparse Mixture-of-Experts (MoE) architecture with a total of 196 billion parameters plus a 1.8B ViT (vision transformer), while activating only 11 billion parameters per forward pass. The model achieves a peak generation speed of 400 tokens per second, making it ideal for high-frequency, multi-turn, low-latency agent applications.
Key Capabilities
- Native Multimodal Understanding & Execution: The model natively understands UI layouts, charts, documents, images, and application interfaces. It converts complex visual information into structured results, code generation, and executable tasks.
- Enhanced Web & Visual Search: Strengthened retrieval and image search capabilities allow the model to actively fetch and cross-reference multi-source evidence in open information environments.
- Reliable Tool Calling & Orchestration: In long-horizon, multi-turn agent workflows, Step 3.7 Flash can stably invoke APIs, browsers, terminals, Office tools, and external systems while maintaining task consistency and reducing derailment or execution failures.
- Agent Ecosystem Compatibility: The model has been optimized for mainstream agent frameworks such as Claude Code, KiloCode, RooCode, OpenCode, Hermes Agent, and OpenClaw, as well as tool-calling protocols like MCP and Skills. This reduces the cost of model integration and workflow orchestration.
Open-Source Availability
Step 3.7 Flash is now open-source under permissive terms. Key resources:
- Model Page: static.stepfun.com/blog/step-3.7-flash
- GitHub Repository: github.com/stepfun-ai/Step-3.7-Flash
- Hugging Face Model: huggingface.co/stepfun-ai/Step-3.7-Flash
- ModelScope: modelscope.cn/models/stepfun-ai/Step-3.7-Flash
- API Access (China): platform.stepfun.com
- API Access (Global): platform.stepfun.ai
With its combination of high speed, sparse activation, and broad ecosystem support, Step 3.7 Flash represents a significant step forward for open-source agent-oriented models.