Hypothesis

5 Matching Annotations

Apr 2026
huggingface.co huggingface.co

Reasoning Shift: How Context Silently Shortens LLM Reasoning

1
1. fxp007 09 Apr 2026
  
  in Public
  
  this behavioral shift does not compromise performance on straightforward problems, it might affect performance on more challenging tasks.
  
  「简单题不影响，难题可能变差」——这个不对称性极为危险。它意味着我们在用简单任务验证 Agent 可靠性时，得到的是虚假的信心。而当 Agent 真正面临高风险、高复杂度的任务时，上下文累积已经悄悄关闭了它的自我验证模式，在没有任何预警的情况下退化为浅层推理。这是一种「隐性能力衰减」，比显而易见的失败更危险。
  
  hidden-degradation hard-tasks false-confidence agent-risk
Visit annotations in context

Tags

hidden-degradation

hard-tasks

agent-risk

false-confidence

Annotators

fxp007

URL

huggingface.co/papers/2604.01161
Apr 2023
www.youtube.com www.youtube.com

How To Take Smart Notes (3 methods no one's talking about)

1
1. chrisaldrich 27 Apr 2023
  
  in Public
  
  Vicky Zhao indirectly frames the answer for "why have a zettelkasten?", especially for learning, as overcoming the "illusion of competence" which is closely related to the mere-exposure effect and the Dunning–Kruger effect.
  
  zettelkasten why illusion of competence Dunning–Kruger effect mere-exposure effect learning false sense of confidence
Visit annotations in context

Tags

mere-exposure effect

false sense of confidence

learning

Dunning–Kruger effect

zettelkasten why

illusion of competence

Annotators

chrisaldrich

URL

youtube.com/watch
Aug 2020
jamanetwork.com jamanetwork.com

Weighing the Benefits and Risks of Proliferating Observational Treatment Assessments: Observational Cacophony, Randomized Harmony

1
1. katietaylor_99 24 Aug 2020
  
  in BehSci
  
  Califf, Robert M., Adrian F. Hernandez, and Martin Landray. ‘Weighing the Benefits and Risks of Proliferating Observational Treatment Assessments: Observational Cacophony, Randomized Harmony’. JAMA 324, no. 7 (18 August 2020): 625–26. https://doi.org/10.1001/jama.2020.13319.
  
  is:report lang:en COVID-19 benefits and risks benefits risks proliferating observational treatment observational treatment assessments epidemiology clinical management causation therapies nonrandomised studies noise confusion false confidence randomised clinical trials RCTs reliable truth
Visit annotations in context

Tags

is:report

proliferating observational treatment

risks

benefits and risks

treatment

false confidence

lang:en

RCTs

nonrandomised studies

therapies

confusion

reliable truth

epidemiology

observational

clinical management

COVID-19

randomised clinical trials

benefits

causation

noise

assessments

Annotators

katietaylor_99

URL

jamanetwork.com/journals/jama/fullarticle/2769139
Nov 2019
kentcdodds.com kentcdodds.com

Effective Snapshot Testing

1
1. TylerRick 11 Nov 2019
  
  in Public
  
  Because they're more integrated and try to serialize an incomplete system (e.g. one with some kind of side effects: from browser/library/runtime versions to environment to database/API changes), they will tend to have high false-negatives (failing test for which the production code is actually fine and the test just needs to be changed). False negatives quickly erode the team's trust in a test to actually find bugs and instead come to be seen as a chore on a checklist they need to satisfy before they can move on to the next thing.
  
  confidence English usage mistake false positives
Visit annotations in context

Tags

confidence

false positives

English usage mistake

Annotators

TylerRick

URL

kentcdodds.com/blog/effective-snapshot-testing
kentcdodds.com kentcdodds.com

Why I Never Use Shallow Rendering

1
1. TylerRick 07 Nov 2019
  
  in Public
  
  So finally I'm coming out with it and explaining why I never use shallow rendering and why I think nobody else should either. Here's my main assertion:With shallow rendering, I can refactor my component's implementation and my tests break. With shallow rendering, I can break my application and my tests say everything's still working.This is highly concerning to me because not only does it make testing frustrating, but it also lulls you into a false sense of security. The reason I write tests is to be confident that my application works and there are far better ways to do that than shallow rendering.
  
  testing: confidence false negatives false positives
Visit annotations in context

Tags

testing: confidence

false positives

false negatives

Annotators

TylerRick

URL

kentcdodds.com/blog/why-i-never-use-shallow-rendering

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL