1 Matching Annotations
  1. Last 7 days
    1. I-DLM-8B is the first DLM to match the quality of its same-scale AR counterpart, outperforming LLaDA-2.1-mini (16B) by +26 on AIME-24 and +15 on LiveCodeBench-v6 with half the parameters

      这一实验结果令人震惊,表明I-DLM不仅在理论上有所突破,在实践中也实现了重大突破。仅用8B参数就超过了16B参数的LLaDA-2.1-mini,在数学推理和代码生成基准测试上分别提升了26和15分,证明了内省扩散语言模型的高效性和有效性。