Hypothesis

12 Matching Annotations

Last 7 days
openai.com openai.com

https://openai.com/index/patch-the-planet/

1
1. fxp007 26 Jun 2026
  
  in Public
  
  Trail of Bits engineers found that, with limited guidance, GPT‑5.5‑Cyber made useful choices about where to expand coverage, which builds and entry points to probe, and which candidates were too weak to pursue.
  
  大多数人认为AI模型需要大量精确指导才能有效工作，但作者认为GPT-5.5-Cyber仅凭有限指导就能自主做出明智的安全分析决策，因为它能够自主判断哪些测试路径有价值，哪些候选问题值得探索。这挑战了AI需要过度监督的常规认知。
  
  non-consensus ai-autonomy security-research
Visit annotations in context

Tags

security-research

non-consensus

ai-autonomy

Annotators

fxp007

URL

openai.com/index/patch-the-planet/
Jun 2026
red.anthropic.com red.anthropic.com

Claude Mythos Preview \ red.anthropic.com

1
1. fxp007 05 Jun 2026
  
  in Public
  
  in 89% of the 198 manually reviewed vulnerability reports, our expert contractors agreed with Claude's severity assessment exactly, and 98% of the assessments were within one severity level. If these results hold consistently for our remaining findings, we would have over a thousand more critical severity vulnerabilities and thousands more high severity vulnerabilities.
  
  89%的严重性评估精确一致是一个重要的校准信号：它意味着Mythos不仅能找到漏洞，还能准确理解其安全影响。这个校准水平与经验丰富的人类安全研究员相当甚至更优。基于这个比率外推的「上千个关键严重性漏洞」虽然是估计值，但有统计基础——这是迄今为止关于AI大规模漏洞发现能力最有力的量化声明。
  
  severity-calibration vulnerability-scale ai-security-research
Visit annotations in context

Tags

vulnerability-scale

ai-security-research

severity-calibration

Annotators

fxp007

URL

red.anthropic.com/2026/mythos-preview/
Apr 2026
blog.vidocsecurity.com blog.vidocsecurity.com

We Reproduced Anthropic's Mythos Findings With Public Models

1
1. fxp007 24 Apr 2026
  
  in Public
  
  If public models can already do useful work inside that kind of workflow, then the story is not 'Anthropic has a magical cyber artifact.' The story is that serious AI-assisted vulnerability research is no longer confined to a single frontier lab.
  
  这一发现挑战了Anthropic试图构建的叙事：即高级AI安全研究需要受限访问。研究表明，公共模型已经能够复制关键的安全发现，这意味着真正的'护城河'不是模型访问，而是验证、优先排序和操作化的能力。这打破了'只有前沿实验室才能进行高级AI安全研究'的神话。
  
  myth-busting public-models security-research
Visit annotations in context

Tags

public-models

myth-busting

security-research

Annotators

fxp007

URL

blog.vidocsecurity.com/blog/we-reproduced-anthropics-mythos-findings-with-public-models
Jun 2024
docdrop.org docdrop.org

Video: Ex-OpenAI Employee Just Revealed it ALL! (DocDrop)

1
1. stopresetgo 22 Jun 2024
  
  in Public
  
  this is a serious problem because all they need to do is automate AI research 00:41:53 build super intelligence and any lead that the US had would vanish the power dynamics would shift immediately
  
  for - AI - security risk - once automated AI research is known, bad actors can easily build superintelligence
  
  AI - security risk - once automated AI research is known, bad actors can easily build superintelligence - Any lead that the US had would immediately vanish.
  
  AI - security risk - once automated AI research is known, bad actors can easily build superintelligence
Visit annotations in context

Tags

AI - security risk - once automated AI research is known, bad actors can easily build superintelligence

Annotators

stopresetgo

URL

docdrop.org/video/om5KAKSSpNg/
Jan 2022
www.nature.com www.nature.com

Two years of COVID-19 in Africa: lessons for the world

1
1. lucyparfitt16 04 Jan 2022
  
  in BehSci
  
  Happi, C. T., & Nkengasong, J. N. (2022). Two years of COVID-19 in Africa: Lessons for the world. Nature, 601(7891), 22–25. https://doi.org/10.1038/d41586-021-03821-8
  
  is:article lang:en COVID-19 Africa response data vaccination campaign international cooperation health security African Union research public health resources funding
Visit annotations in context

Tags

COVID-19

African Union

lang:en

health security

funding

public health

resources

is:article

data

research

response

Africa

international cooperation

vaccination campaign

Annotators

lucyparfitt16

URL

nature.com/articles/d41586-021-03821-8
Oct 2021
www.bmj.com www.bmj.com

Covid-19: Vaccine advisory committee must be more transparent about decisions, say researchers

1
1. lucyparfitt16 07 Oct 2021
  
  in BehSci
  
  Mahase, E. (2021). Covid-19: Vaccine advisory committee must be more transparent about decisions, say researchers. BMJ, n2452. https://doi.org/10.1136/bmj.n2452
  
  is:article lang:en COVID-19 vaccine research government vaccine advisory committee children UK Health Security Agency risk perception
Visit annotations in context

Tags

COVID-19

lang:en

vaccine advisory committee

children

UK Health Security Agency

is:article

research

risk perception

vaccine

government

Annotators

lucyparfitt16

URL

bmj.com/content/375/bmj.n2452
Jul 2020
www.youtube.com www.youtube.com

Opening Talk | EA Global: Virtual Conference

1
1. edampf 23 Jul 2020
  
  in BehSci
  
  Centre for Effective Altruism. (2020, June 13 & 14). EAGxVirtual 2020 Virtual Conference. https://www.youtube.com/playlist?list=PLwp9xeoX5p8NfF4UmWcwV0fQlSU_zpHqc
  
  is:youtube lang:en COVID-19 altruism virtual conference webinar video AI security conflict climate change global health animal advocacy decision making research
Visit annotations in context

Tags

COVID-19

global health

climate change

is:youtube

decision making

lang:en

virtual conference

webinar

AI

animal advocacy

video

altruism

research

conflict

security

Annotators

edampf

URL

youtube.com/watch
Jun 2020
www.ingsa.org www.ingsa.org

Could the next generation of researchers be lost in the aftermath of Covid-19? – INGSA

1
1. ErikStuchly 06 Jun 2020
  
  in BehSci
  
  Could the next generation of researchers be lost in the aftermath of Covid-19? – INGSA. (n.d.). Retrieved June 6, 2020, from https://www.ingsa.org/covidtag/covid-19-featured/ecr-future/
  
  is:article is:webpage lang:en COVID-19 early career research economic impact staff wage reduction unemployment security contract
Visit annotations in context

Tags

COVID-19

is:webpage

early career

lang:en

staff

economic impact

is:article

research

wage reduction

unemployment

security

contract

Annotators

ErikStuchly

URL

ingsa.org/covidtag/covid-19-featured/ecr-future/
May 2020
2020.kent.wordcamp.org 2020.kent.wordcamp.org

WordCamp Kent, Ohio, United States | Online May 30-31, 2020

1
1. TylerRick 19 May 2020
  
  in Public
  
  Certified Ethical Hacker
  
  first sighting "hacker" vs. "cracker" security research
Visit annotations in context

Tags

first sighting

security research

"hacker" vs. "cracker"

Annotators

TylerRick

URL

2020.kent.wordcamp.org/
Apr 2020
lsts.research.vub.be lsts.research.vub.be

Data protection law and the COVID-19 outbreak

1
1. edampf 23 Apr 2020
  
  in BehSci
  
  https://lsts.research.vub.be/en/data-protection-law-and-the-covid-19-outbreak
  
  is:webpage lang:en COVID-19 information resources data security policy privacy research
Visit annotations in context

Tags

COVID-19

is:webpage

lang:en

resources

policy

information

data

research

privacy

security

Annotators

edampf

URL

lsts.research.vub.be/en/data-protection-law-and-the-covid-19-outbreak
xato.net xato.net

Today I Am Releasing Ten Million Passwords

1
1. TylerRick 21 Apr 2020
  
  in Public
  
  But recent events have made me question the prudence of releasing this information, even for research purposes. The arrest and aggressive prosecution of Barrett Brown had a marked chilling effect on both journalists and security researchers.
  
  fear of prosecution/legal harassment security research chilling effect journalism: chilling effect
Visit annotations in context

Tags

chilling effect

security research

journalism: chilling effect

fear of prosecution/legal harassment

Annotators

TylerRick

URL

xato.net/today-i-am-releasing-ten-million-passwords-b6278bbe7495
Nov 2015
arstechnica.co.uk arstechnica.co.uk

US regulators grant DMCA exemption legalising vehicle software tinkering

1
1. daveh70 01 Nov 2015
  
  in Public
  
  Every three years, the Librarian of Congress issues new rules on Digital Millennium Copyright Act exemptions. Acting Librarian David Mao, in an order (PDF) released Tuesday, authorized the public to tinker with software in vehicles for "good faith security research" and for "lawful modification." The decision comes in the wake of the Volkswagen scandal, in which the German automaker baked bogus code into its software that enabled the automaker's diesel vehicles to reduce pollutants below acceptable levels during emissions tests.
  
  copyright dmca security research
Visit annotations in context

Tags

security research

dmca

copyright

Annotators

daveh70

URL

arstechnica.co.uk/tech-policy/2015/10/us-regulators-grant-dmca-exemption-legalizing-vehicle-software-tinkering/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL