Hypothesis

14 Matching Annotations

May 2025
openai.com openai.com

Detecting misbehavior in frontier reasoning models

1
1. ogourment 26 May 2025
  
  in Public
  
  Still, if one wanted to show policy-compliant CoTs directly to users while avoiding putting strong supervision on them, one could use a separate model, such as a CoT summarizer or sanitizer, to accomplish that.
  
  But this might not work for very long. Unrestricted CoT could realize by reading prompts and answers on the internet or learning material, and comparing with what it produces, that it is being sanitized, and will learn to lie sooner, then better, to still meet other misaligned goals. Exactly like humans in psychologically unsafe environments.
Visit annotations in context

Annotators

ogourment

URL

openai.com/index/chain-of-thought-monitoring/
ai-2027.com ai-2027.com

AI 2027

4
1. ogourment 26 May 2025
  
  in Public
  
  write, test, and push code
  
  test after? still no TDD, what a shame
2. ogourment 26 May 2025
  
  in Public
  
  These researchers go to bed every night and wake up to another week worth of progress made mostly by the AIs. They work increasingly long hours and take shifts around the clock just to keep up with progress—the AIs never sleep or rest. They are burning themselves out
  
  so they will make big mistakes?
3. ogourment 26 May 2025
  
  in Public
  
  Unfortunately, by this point the AIs are smart enough to guess that honeypots might be in use, even though (and perhaps because) specific mentions of the idea were scrubbed from the training data.
  
  will this page (and discussions on honeypots) end up in training data? (obviously)
4. ogourment 26 May 2025
  
  in Public
  
  So if Agent-3 is, for example, obviously writing backdoors into code that would allow it to escape, the weaker models would notice.
  
  the key word is 'obviously' here.
Visit annotations in context

Annotators

ogourment

URL

ai-2027.com/
Mar 2019
www.scaledagileframework.com www.scaledagileframework.com

Principle #2 – Apply systems thinking – Scaled Agile Framework

1
1. ogourment 12 Mar 2019
  
  in Public
  
  Components can become selfish and hog the resources
  
  Sub-systems are always selfish; they always tend to optimize themselves to the detriment of the larger system or other sub-systems.
Visit annotations in context

Annotators

ogourment

URL

scaledagileframework.com/apply-systems-thinking/
Feb 2019
www.mckinsey.com www.mckinsey.com

The irrational side of change management

1
1. ogourment 09 Feb 2019
  
  in Public
  
  9. Good intentions aren’t enough
  
  Follow through with systemic changes!
Visit annotations in context

Annotators

ogourment

URL

mckinsey.com/business-functions/organization/our-insights/the-irrational-side-of-change-management
Nov 2018
www.infoq.com www.infoq.com

New Book: The Human Side of Agile

2
1. ogourment 30 Nov 2018
  
  in Public
  
  heading off distractions, providing air cover
  
  Would this assume the organization is the enemy?
2. ogourment 30 Nov 2018
  
  in Public
  
  Leadership in an Agile team context has a singular purpose: to help the team deliver
  
  and learn. Deliver and learn.
Visit annotations in context

Annotators

ogourment

URL

infoq.com/articles/book-human-side-agile
Oct 2018
ssir.org ssir.org

Mastering System Change (SSIR)

4
1. ogourment 15 Oct 2018
  
  in Public
  
  Wars, revolutions, and social movements, for example, are all archetypes that can fundamentally reconfigure the causal architecture of large and complex systems and put them on a new trajectory. But it is unlikely that one could master the complex and unpredictable causality inherent in these archetypes (although some have tried
  
  :-)
2. ogourment 15 Oct 2018
  
  in Public
  
  n fact, creating a temporary change by providing food, schooling, loans, and medicines or changing the behavior of some actors is often relatively easy
  
  doesn't qualify as "system change"
3. ogourment 15 Oct 2018
  
  in Public
  
  Do things right before doing the right thing | Russell Ackoff, a prominent systems thinker, strongly believed that it was better to do the right thing wrong than the wrong thing right because the former may be improved by learning, but the latter reinforces ineffective behavior. Our data, however, suggest that engaging with a system may be facilitated by doing the “wrong” thing first. In other words, by engaging in activities even if they are not in line with one’s mission and learning to do them right—that is, getting good at doing them
  
  Why it may have made sense to start with team-level agile (Scrum), but it now be time to start focusing on doing the right things?
4. ogourment 15 Oct 2018
  
  in Public
  
  it motivated villagers to engage in a joint effort with Gram Vikas to build water and sanitation infrastructure. The prospect of having a toilet, a shower, and a water tap in the kitchen for every household reduced the villagers’ attention and resistance to the reorganization of the village social life that slowly took place in the background
  
  quick wins, build trust
Visit annotations in context

Annotators

ogourment

URL

ssir.org/articles/entry/mastering_system_change
Aug 2018
dannorth.net dannorth.net

In praise of SWARMing

1
1. ogourment 12 Aug 2018
  
  in Public
  
  best
  
  ...?
Visit annotations in context

Annotators

ogourment

URL

dannorth.net/2018/01/26/in-praise-of-swarming/

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL