The resulting model, SU-01, supports stable reasoning on difficult problems with trajectories exceeding 100K tokens
Highlights a key capability: handling extremely long reasoning chains (100K+ tokens). This is a significant metric for evaluating the depth and persistence of the model's problem-solving abilities.