Hypothesis

1 Matching Annotations

Last 7 days
nlp.elvissaravia.com nlp.elvissaravia.com

https://nlp.elvissaravia.com/p/top-ai-papers-of-the-week-f2f

1
1. fxp007 01 May 2026
  
  in Public
  
  The release includes DeepSeek-V4-Pro (1.6T total / 49B active) and DeepSeek-V4-Flash (284B total / 13B active), both trained natively at 1M context length.
  
  DeepSeek V4的模型规模之大令人震惊，这表明了在长上下文处理方面取得的显著进步。
  
  large-scale-model context-length surprising-data
Visit annotations in context

Tags

surprising-data

large-scale-model

context-length

Annotators

fxp007

URL

nlp.elvissaravia.com/p/top-ai-papers-of-the-week-f2f