Hypothesis

5 Matching Annotations

May 2026
sakana.ai sakana.ai

https://sakana.ai/diffusion-blocks/

1
1. fxp007 29 May 2026
  
  in Public
  
  With DiffusionBlocks, we split the network into blocks and train them one at a time, so you only need memory for a single block.
  
  大多数人认为训练深度神经网络需要与网络深度成比例的内存，但作者认为这一限制可以被打破，因为通过分块训练方法，内存需求不再随网络深度线性增长，这一发现可能改变大型模型的训练方式。
  
  counterintuitive memory-efficiency
Visit annotations in context

Tags

counterintuitive

memory-efficiency

Annotators

fxp007

URL

sakana.ai/diffusion-blocks/
Apr 2026
www.technologyreview.com www.technologyreview.com

https://www.technologyreview.com/2026/04/24/1136422/why-deepseeks-v4-matters/

1
1. fxp007 25 Apr 2026
  
  in Public
  
  In a 1-million-token context, V4-Pro uses only 27% of the computing power required by its previous model, V3.2, while cutting memory use to 10%.
  
  大多数人认为AI模型处理更长上下文必然需要更多计算资源，但作者认为DeepSeek V4通过创新架构实现了惊人的效率提升，大幅降低了计算和内存需求。这一反直觉的发现挑战了'长上下文等于高成本'的行业认知。
  
  counterintuitive memory-efficiency ai-architecture
Visit annotations in context

Tags

counterintuitive

ai-architecture

memory-efficiency

Annotators

fxp007

URL

technologyreview.com/2026/04/24/1136422/why-deepseeks-v4-matters/
x.com x.com

https://x.com/berryxia/status/2042017501253661059

1
1. fxp007 16 Apr 2026
  
  in Public
  
  KV Cache 内存占用降低 10.7 倍
  
  令人惊讶的是：KV Cache内存占用降低了惊人的10.7倍，这一数字远超普通技术优化的幅度。KV Cache是大模型推理中的主要内存消耗部分，如此大幅度的减少意味着同样的硬件可以处理更长的上下文，或者同时运行更多模型实例。
  
  surprising memory-efficiency kv-cache-optimization
Visit annotations in context

Tags

memory-efficiency

kv-cache-optimization

surprising

Annotators

fxp007

URL

x.com/berryxia/status/2042017501253661059
Oct 2022
Local file Local file

Frederic L. Paxson and His Approach to History

1
1. chrisaldrich 31 Oct 2022
  
  in Public
  
  On the whole, his efficiency probablyreduced the time required for taking and filing notes to the amountother historians spent in note-taking alone. What he wrote in hisnotes was brief, and yet specific enough so that he saved himself thejob of searching at length for what he had read. His mind was freeto reflect and appraise.
  
  Earl Pomeroy suggests that Paxson's note taking method freed his mind to better reflect and appraise his work. This allows a greater efficiency of work, particularly when it comes to easier search and recall as well as the overall process which becomes easier through practice.
  
  note taking affordances efficiency atomic notes card index as memory
Tags

note taking affordances

card index as memory

efficiency

atomic notes

Annotators

chrisaldrich
Feb 2022
Local file Local file

How to Take Smart Notes: One Simple Technique to Boost Writing, Learning and Thinking – for Students, Academics and Nonfiction Book Writers

1
1. chrisaldrich 01 Feb 2022
  
  in Public
  
  We need a reliable and simple external structure tothink in that compensates for the limitations of our brains
  
  Let's be honest that there are certainly methods for doing all of this within our brains and not needing to rely on external structures. This being said, using writing, literacy, and external structures does allow us to process things faster than before.
  
  Can we calculate what the level of greater efficiency allows for doing this? What is the overall throughput difference in being able to forget and write? Not rely on communication with others? What does a back of the envelope calculation for this look like?
  
  orality orality and memory Llullan combinatorial arts efficiency tools for thought external structures for thought internal structures for thought productivity open questions
Tags

internal structures for thought

orality

tools for thought

productivity

orality and memory

open questions

efficiency

external structures for thought

Llullan combinatorial arts

Annotators

chrisaldrich

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

Tags

Annotators