Hypothesis

8 Matching Annotations

Jun 2026
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/golden-age-of-applications/

1
1. fxp007 17 Jun 2026
  
  in Public
  
  Models are tricky. Budgets prevent defaulting everyone to state-of-the-art. The legion of other models each have a personality.
  
  作者详细描述了不同AI模型的特性差异，如Kimi K2.6创意性强但精确度较低，Qwen 3.6性能好但可能中断工作流，GLM 5.1擅长编程但速度较慢。这提醒开发者需要根据具体需求选择合适的模型，而非盲目追求最新或最大的模型，同时要注意预算限制。
  
  model-selection cost-optimization
Visit annotations in context

Tags

cost-optimization

model-selection

Annotators

fxp007

URL

tomtunguz.com/golden-age-of-applications/
May 2026
apple.github.io apple.github.io

https://apple.github.io/ml-pico/

1
1. fxp007 24 May 2026
  
  in Public
  
  search over millions of model configurations to jointly optimize over perceptual quality and on-device runtime
  
  数百万模型配置的搜索规模表明研究进行了大规模的实验和优化，这增强了结果的可信度。然而，文章没有提供具体的搜索方法、优化算法或计算资源信息，这使得难以评估这一过程的效率和科学性。
  
  data-point model-optimization statistics
Visit annotations in context

Tags

model-optimization

statistics

data-point

Annotators

fxp007

URL

apple.github.io/ml-pico/
www.llmwatch.com www.llmwatch.com

https://www.llmwatch.com/p/ai-agents-of-the-week-papers-you-cbd

1
1. fxp007 01 May 2026
  
  in Public
  
  These papers suggest that strategic data engineering and inference-time optimization can substitute for raw parameter count.
  
  这一观点提出了通过数据工程和推理时间优化来提高模型性能的新方法，为模型优化提供了新的思路。
  
  data-engineering model-optimization
Visit annotations in context

Tags

data-engineering

model-optimization

Annotators

fxp007

URL

llmwatch.com/p/ai-agents-of-the-week-papers-you-cbd
Apr 2026
www.tomtunguz.com www.tomtunguz.com

https://www.tomtunguz.com/gemma-4-vs-gpt-4o/

1
1. fxp007 16 Apr 2026
  
  in Public
  
  In 23 months, the same capability that needed 1.8 trillion parameters now fits in 4 billion parameters. A 450x compression.
  
  令人惊讶的是：AI模型参数量在短短23个月内实现了450倍的压缩，这意味着原本需要超级计算机才能运行的强大AI模型现在可以完全在手机上运行。这种技术进步的速度远超摩尔定律，展示了算法优化和模型压缩技术的惊人突破。
  
  surprising ai-compression model-optimization
Visit annotations in context

Tags

surprising

ai-compression

model-optimization

Annotators

fxp007

URL

tomtunguz.com/gemma-4-vs-gpt-4o/
ai.meta.com ai.meta.com

https://ai.meta.com/blog/introducing-muse-spark-msl/

1
1. fxp007 16 Apr 2026
  
  in Public
  
  After compressing, the model again extends its solutions to achieve stronger performance.
  
  令人惊讶的是：Muse Spark在测试时展现出一种独特的'思想压缩'能力，模型在最初通过延长思考时间提高性能后，会在时间惩罚机制下自发压缩推理过程，然后再扩展解决方案以获得更强的性能。这种动态的自我优化机制在AI模型中前所未见。
  
  surprising ai-reasoning model-optimization
Visit annotations in context

Tags

surprising

model-optimization

ai-reasoning

Annotators

fxp007

URL

ai.meta.com/blog/introducing-muse-spark-msl/
developer.nvidia.com developer.nvidia.com

https://developer.nvidia.com/blog/bringing-ai-closer-to-the-edge-and-on-device-with-gemma-4/

1
1. fxp007 08 Apr 2026
  
  in Public
  
  NVFP4 enables 4-bit precision while maintaining nearly identical accuracy to 8-bit precision, increasing performance per watt and lowering cost per token.
  
  大多数人认为降低模型精度会显著牺牲性能，但作者声称Gemma 4通过NVFP4量化技术实现了4位精度与8位精度几乎相同的准确率。这一反直觉的结论挑战了传统量化会大幅降低模型性能的认知，暗示NVIDIA可能在量化技术方面取得了突破性进展。
  
  non-consensus quantization model-optimization
Visit annotations in context

Tags

quantization

model-optimization

non-consensus

Annotators

fxp007

URL

developer.nvidia.com/blog/bringing-ai-closer-to-the-edge-and-on-device-with-gemma-4/
Oct 2024
x.com x.com

Anthony Dean on X: "@cesifoti The similarity is because they are all saying roughly the same thing: Total (result) = Kinetic (cost) + Potential (benefit) Cost is either imaginary squared or negative (space-like), benefit is real (time-like), result is mass-like. Just like physics, the economic unfavourable" / X

1
1. chrisaldrich 15 Oct 2024
  
  in Public
  
  The similarity is because they are all saying roughly the same thing: Total (result) = Kinetic (cost) + Potential (benefit) Cost is either imaginary squared or negative (space-like), benefit is real (time-like), result is mass-like. Just like physics, the economic unfavourable models are the negative results. In economics, diversity of products is a strength as it allows better recovery from failure of any one, comically DEI of people fails miserably at this, because all people are not equal. Here are some other examples you will know if you do physics: E² + (ipc)² = (mc²)² (relativistic Einstein equation), mass being the result, energy time-like (potential), momentum the space-like (kinetic). ∇² - 1/c² ∂²/∂t² = (mc/ℏ)² (Klein-Gordon equation), mass is the result, ∂²/∂t² potential, ∇² is kinetic. Finally we have Dirac equation, which unlike the previous two as "sum of squares" is more like vector addition (first order differentials, not second). iℏγ⁰∂₀ψ + iℏγⁱ∂ᵢψ = mcψ First part is still the time-like potential, second part is the space-like kinetic, and the mass is still the result though all the same. This is because energy is all forms, when on a flat (free from outside influence) worksheet, acts just like a triangle between potential, kinetic and resultant energies. E.g. it is always of the form k² + p² = r², quite often kinetic is imaginary to potential (+,-,-,-) spacetime metric, quaternion mathematics. So the r² can be negative, or imaginary result if costs out way benefits, or work in is greater than work out. Useless but still mathematical solution. Just like physics, you always want the mass or result to be positive and real, or your going to lose energy to the surrounding field, with negative returns. Economic net loss do not last long, just like imaginary particles in physics.
  
  in reply to Cesar A. Hidalgo at https://x.com/realAnthonyDean/status/1844409919161684366
  
  via Anthony Dean @realAnthonyDean
  
  physics mathematics equations portfolio optimization Markowitz model Sherrington-Kirkpatrick model Hopfield model neural networks Economic Complexity Index (ECI) economic complexity
Visit annotations in context

Tags

economic complexity

physics

mathematics

Hopfield model

Sherrington-Kirkpatrick model

portfolio optimization

equations

Economic Complexity Index (ECI)

Markowitz model

neural networks

Annotators

chrisaldrich

URL

x.com/realAnthonyDean/status/1844409919161684366
Jun 2020
arxiv.org arxiv.org

Clustering - What Both Theoreticians and Practitioners are Doing Wrong

1
1. ErikStuchly 25 Jun 2020
  
  in BehSci
  
  Ben-David, S. (2018). Clustering—What Both Theoreticians and Practitioners are Doing Wrong. ArXiv:1805.08838 [Cs, Stat]. http://arxiv.org/abs/1805.08838
  
  is:article lang:en cluster clustering tool machine learning unsupervised learning theory practice algorithm parameter computational task optimization model selection
Visit annotations in context

Tags

algorithm

computational task

practice

clustering tool

lang:en

parameter

model selection

machine learning

is:article

optimization

cluster

theory

unsupervised learning

Annotators

ErikStuchly

URL

arxiv.org/abs/1805.08838

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL