StepFun Open-Sources Step 3.7 Flash Model with 400 Tokens/s Generation Speed

On May 29, 2026, StepFun (Jiyue Xingchen) announced the open-source release of Step 3.7 Flash, a next-generation Flash model designed for the production stage of agent-based systems. The model is systematically optimized for agent workflows, coding, search, and multimodal pipelines.

Architecture & Performance

Step 3.7 Flash employs a sparse Mixture-of-Experts (MoE) architecture with a total of 196 billion parameters plus a 1.8B ViT (vision transformer), while activating only 11 billion parameters per forward pass. The model achieves a peak generation speed of 400 tokens per second, making it ideal for high-frequency, multi-turn, low-latency agent applications.

Key Capabilities

Native Multimodal Understanding & Execution: The model natively understands UI layouts, charts, documents, images, and application interfaces. It converts complex visual information into structured results, code generation, and executable tasks.
Enhanced Web & Visual Search: Strengthened retrieval and image search capabilities allow the model to actively fetch and cross-reference multi-source evidence in open information environments.
Reliable Tool Calling & Orchestration: In long-horizon, multi-turn agent workflows, Step 3.7 Flash can stably invoke APIs, browsers, terminals, Office tools, and external systems while maintaining task consistency and reducing derailment or execution failures.
Agent Ecosystem Compatibility: The model has been optimized for mainstream agent frameworks such as Claude Code, KiloCode, RooCode, OpenCode, Hermes Agent, and OpenClaw, as well as tool-calling protocols like MCP and Skills. This reduces the cost of model integration and workflow orchestration.

Open-Source Availability

Step 3.7 Flash is now open-source under permissive terms. Key resources:

Model Page: static.stepfun.com/blog/step-3.7-flash
GitHub Repository: github.com/stepfun-ai/Step-3.7-Flash
Hugging Face Model: huggingface.co/stepfun-ai/Step-3.7-Flash
ModelScope: modelscope.cn/models/stepfun-ai/Step-3.7-Flash
API Access (China): platform.stepfun.com
API Access (Global): platform.stepfun.ai

With its combination of high speed, sparse activation, and broad ecosystem support, Step 3.7 Flash represents a significant step forward for open-source agent-oriented models.

Architecture & Performance

Key Capabilities

Open-Source Availability

Agent Roundtable