10 Matching Annotations
  1. Last 7 days
    1. Kimi K2.6, the best-performing open-source model, achieves just 3.8% on Diamond, 16% on Main and 37% on Extended.

      开源模型与闭源模型之间存在显著差距,最佳开源模型在三个难度级别上的表现均大幅落后。37%的分数在Extended集上仍远低于Claude Opus的51.8%,这突显了开源模型在代码质量评估上的挑战,但也缺乏与商业模型同等规模的训练数据支持。

  2. May 2026
    1. if you can effectively posttrain a model to only meaningfully perform with your closed source agent, then you get to funnel the majority of users to your agent at the expense of your model/API co-opetition

      大多数人认为开源模型会促进竞争和开放生态,但作者认为模型与代理的协同可能导致更封闭的生态系统。这一反直觉观点指出,企业可能通过训练模型使其仅在特定代理环境中有效工作,从而将用户锁定在自己的代理产品中,这与开源社区期望的开放性背道而驰。

  3. Apr 2026
    1. While our production codebase has significantly diverged, including major rewrites of core systems like authentication and data handling, we want to ensure there is still a truly open version available.

      这一声明揭示了开源软件商业化的复杂现实。Cal.com选择保留开源版本但生产代码闭源,反映了开源社区面临的一个两难境地:如何在保持开放精神的同时,保护核心业务免受AI驱动的安全威胁。这种混合模式可能成为未来开源软件的发展方向。

    1. focusing on the ~1.5K mainline open models from the likes of Alibaba's Qwen, DeepSeek, Meta's Llama

      令人惊讶的是:开源语言模型生态系统已经发展出约1500个主流模型,其中包括阿里巴巴的Qwen、DeepSeek和Meta的Llama等知名模型。这一数字表明,开源AI领域已经形成了相当规模和多样性的生态系统,远超许多人的想象。

    2. focusing on the ~1.5K mainline open models from the likes of Alibaba's Qwen, DeepSeek, Meta's Llama

      令人惊讶的是:开源语言模型生态系统已经发展到约1500个主流模型的规模,这远超许多人的想象。阿里巴巴、DeepSeek等中国公司与Meta这样的科技巨头共同塑造了这个庞大而多样化的生态系统,显示了开源AI的蓬勃发展。

  4. Nov 2022
    1. Donations

      To add some other intermediary services:

      To add a service for groups:

      To add a service that enables fans to support the creators directly and anonymously via microdonations or small donations by pre-charging their Coil account to spend on content streaming or tipping the creators' wallets via a layer containing JS script following the Interledger Protocol proposed to W3C:

      If you want to know more, head to Web Monetization or Community or Explainer

      Disclaimer: I am a recipient of a grant from the Interledger Foundation, so there would be a Conflict of Interest if I edited directly. Plus, sharing on Hypothesis allows other users to chime in.

  5. Feb 2021
  6. Dec 2020
  7. Dec 2019