Hypothesis

4 Matching Annotations

May 2026
subq.ai subq.ai

https://subq.ai/introducing-subq

1
1. fxp007 07 May 2026
  
  in Public
  
  SubQ's research model performs on up to 12 million tokens, while other frontier models break down well before their stated 1M-token limit.
  
  SubQ研究模型可处理高达1200万token，而其他前沿模型在达到其声称的100万token限制前就已崩溃。这个对比数据点突显了SubQ在上下文长度方面的显著优势，是AI架构的重大突破。
  
  data-point comparison context-length
Visit annotations in context

Tags

comparison

context-length

data-point

Annotators

fxp007

URL

subq.ai/introducing-subq
nlp.elvissaravia.com nlp.elvissaravia.com

https://nlp.elvissaravia.com/p/top-ai-papers-of-the-week-f2f

1
1. fxp007 01 May 2026
  
  in Public
  
  The release includes DeepSeek-V4-Pro (1.6T total / 49B active) and DeepSeek-V4-Flash (284B total / 13B active), both trained natively at 1M context length.
  
  DeepSeek V4的模型规模之大令人震惊，这表明了在长上下文处理方面取得的显著进步。
  
  large-scale-model context-length surprising-data
Visit annotations in context

Tags

context-length

surprising-data

large-scale-model

Annotators

fxp007

URL

nlp.elvissaravia.com/p/top-ai-papers-of-the-week-f2f
Apr 2026
api-docs.deepseek.com api-docs.deepseek.com

https://api-docs.deepseek.com/news/news260424

1
1. fxp007 30 Apr 2026
  
  in Public
  
  🔹 **1M Standard:** 1M context is now the default across all official DeepSeek services.
  
  DeepSeek V4将上下文长度提升到100万token，成为行业新标准。这一数据点意义重大，相比行业常见的32K-128K上下文窗口，提升了约8-31倍，能处理更长文档和复杂任务。这需要创新的注意力机制和内存管理技术支撑，文中提到的'Novel Attention: Token-wise compression + DSA'可能是实现这一突破的关键。
  
  data-point context-length technical-innovation
Visit annotations in context

Tags

context-length

data-point

technical-innovation

Annotators

fxp007

URL

api-docs.deepseek.com/news/news260424
arxiv.org arxiv.org

https://arxiv.org/abs/2604.05091

1
1. fxp007 16 Apr 2026
  
  in Public
  
  MegaTrain also enables 7B model training with 512k token context on a single GH200.
  
  令人惊讶的是：该系统单块GH200 GPU就能支持7B模型进行512k token的上下文训练，这远超当前主流模型的上下文长度限制。这种超长上下文能力可能彻底改变大模型处理长文档、代码库或书籍的方式。
  
  surprising context-length model-capabilities
Visit annotations in context

Tags

context-length

model-capabilities

surprising

Annotators

fxp007

URL

arxiv.org/abs/2604.05091

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL