Hypothesis

4 Matching Annotations

Jun 2026
openai.com openai.com

https://openai.com/index/patch-the-planet/

1
1. fxp007 26 Jun 2026
  
  in Public
  
  Trail of Bits engineers found that, with limited guidance, GPT‑5.5‑Cyber made useful choices about where to expand coverage, which builds and entry points to probe, and which candidates were too weak to pursue.
  
  大多数人认为AI模型需要大量精确指导才能有效工作，但作者认为GPT-5.5-Cyber仅凭有限指导就能自主做出明智的安全分析决策，因为它能够自主判断哪些测试路径有价值，哪些候选问题值得探索。这挑战了AI需要过度监督的常规认知。
  
  non-consensus ai-autonomy security-research
Visit annotations in context

Tags

ai-autonomy

security-research

non-consensus

Annotators

fxp007

URL

openai.com/index/patch-the-planet/
red.anthropic.com red.anthropic.com

Claude Mythos Preview \ red.anthropic.com

1
1. fxp007 05 Jun 2026
  
  in Public
  
  in 89% of the 198 manually reviewed vulnerability reports, our expert contractors agreed with Claude's severity assessment exactly, and 98% of the assessments were within one severity level. If these results hold consistently for our remaining findings, we would have over a thousand more critical severity vulnerabilities and thousands more high severity vulnerabilities.
  
  89%的严重性评估精确一致是一个重要的校准信号：它意味着Mythos不仅能找到漏洞，还能准确理解其安全影响。这个校准水平与经验丰富的人类安全研究员相当甚至更优。基于这个比率外推的「上千个关键严重性漏洞」虽然是估计值，但有统计基础——这是迄今为止关于AI大规模漏洞发现能力最有力的量化声明。
  
  severity-calibration vulnerability-scale ai-security-research
Visit annotations in context

Tags

ai-security-research

vulnerability-scale

severity-calibration

Annotators

fxp007

URL

red.anthropic.com/2026/mythos-preview/
Jun 2024
docdrop.org docdrop.org

Video: Ex-OpenAI Employee Just Revealed it ALL! (DocDrop)

1
1. stopresetgo 22 Jun 2024
  
  in Public
  
  this is a serious problem because all they need to do is automate AI research 00:41:53 build super intelligence and any lead that the US had would vanish the power dynamics would shift immediately
  
  for - AI - security risk - once automated AI research is known, bad actors can easily build superintelligence
  
  AI - security risk - once automated AI research is known, bad actors can easily build superintelligence - Any lead that the US had would immediately vanish.
  
  AI - security risk - once automated AI research is known, bad actors can easily build superintelligence
Visit annotations in context

Tags

AI - security risk - once automated AI research is known, bad actors can easily build superintelligence

Annotators

stopresetgo

URL

docdrop.org/video/om5KAKSSpNg/
Jul 2020
www.youtube.com www.youtube.com

Opening Talk | EA Global: Virtual Conference

1
1. edampf 23 Jul 2020
  
  in BehSci
  
  Centre for Effective Altruism. (2020, June 13 & 14). EAGxVirtual 2020 Virtual Conference. https://www.youtube.com/playlist?list=PLwp9xeoX5p8NfF4UmWcwV0fQlSU_zpHqc
  
  is:youtube lang:en COVID-19 altruism virtual conference webinar video AI security conflict climate change global health animal advocacy decision making research
Visit annotations in context

Tags

COVID-19

virtual conference

altruism

AI

animal advocacy

decision making

security

conflict

lang:en

global health

research

is:youtube

climate change

video

webinar

Annotators

edampf

URL

youtube.com/watch

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL