Hypothesis

2 Matching Annotations

Apr 2026
mp.weixin.qq.com mp.weixin.qq.com

https://mp.weixin.qq.com/s/AfYl4p7AbD6C_Xo5VIkFSg

1
1. fxp007 17 Apr 2026
  
  in Public
  
  未来的 CNC 也许不是一团越来越大的连续表征，而会更像一套可路由、可组合、局部更容易检查的机器底座。
  
  这一观点挑战了当前AI模型向更大规模发展的主流趋势。作者提出神经计算机可能更接近离散、稀疏、局部可验证的结构，这暗示了AI发展可能存在与当前大模型路线完全不同的方向，具有颠覆性意义。
  
  alternative-architecture discrete-neural
Visit annotations in context

Tags

discrete-neural

alternative-architecture

Annotators

fxp007

URL

mp.weixin.qq.com/s/AfYl4p7AbD6C_Xo5VIkFSg
Jan 2023
transformer-circuits.pub transformer-circuits.pub

A Mathematical Framework for Transformer Circuits

1
1. mshook 26 Jan 2023
  
  in Public
  
  A transformer starts with a token embedding, followed by a series of “residual blocks”, and finally a token unembedding. Each residual block consists of an attention layer, followed by an MLP layer. Both the attention and MLP layers each “read” their input from the residual stream (by performing a linear projection), and then “write” their result to the residual stream by adding a linear projection back in. Each attention layer consists of multiple heads, which operate in parallel.
  
  transformer residual architecture alternative colah explanation
Visit annotations in context

Tags

architecture

colah

residual

transformer

alternative

explanation

Annotators

mshook

URL

transformer-circuits.pub/2021/framework/index.html

Tags

Annotators

URL

Tags

Annotators

URL