In the past few days, while browsing HuggingFace, I discovered that the Step1X-Edit model made it to this week's Trending list, another work from Step Star! As a content creator, I have always been very attentive to the development of the multimodal field, as it directly affects my daily work efficiency. Driven by curiosity, I tried out this Step1X-Edit and found that its editing precision and image fidelity are indeed impressive.
After checking some information, I found that its semantic consistency, quality, and overall score in the GEdit-Bench benchmark test are indeed close to the levels of GPT-4o and Gemini 2.0 Flash. Style transfer, object replacement, background replacement, character beautification, motion changes, text modifications, and more—all are possible. It can do more than just "Photoshop" in one sentence; it also supports natural language and multi-turn editing; it meets the needs for targeted editing such as text replacement and area color changes; and it can maintain image consistency.
Trial address: https://huggingface.co/spaces/stepfun-ai/Step1X-Edit