1,115 Matching Annotations
  1. Last 7 days
    1. if a refresh token was reused, Hydra would invalidate the whole access and refresh token chain

      大多数人认为重用刷新令牌应该只影响单个令牌,但作者指出新版本会撤销整个访问和刷新令牌链,这实际上提高了安全性但改变了客户端行为。这种严格的做法与大多数OAuth实现中更宽松的令牌重用策略形成对比,代表了更安全但可能破坏兼容性的设计选择。

    1. Security engineers reviewed every finding before it reached a maintainer... While frontier AI models are highly capable of finding vulnerabilities and patching them, they also produce a high volume of false positives

      大多数人认为AI可以直接替代人类安全专家进行漏洞评估,但作者认为即使是最先进的AI模型也会产生大量误报,仍需人类专家进行验证和过滤。这挑战了AI完全自主安全研究的可行性预期。

    2. Trail of Bits engineers found that, with limited guidance, GPT‑5.5‑Cyber made useful choices about where to expand coverage, which builds and entry points to probe, and which candidates were too weak to pursue.

      大多数人认为AI模型需要大量精确指导才能有效工作,但作者认为GPT-5.5-Cyber仅凭有限指导就能自主做出明智的安全分析决策,因为它能够自主判断哪些测试路径有价值,哪些候选问题值得探索。这挑战了AI需要过度监督的常规认知。

    1. Production connectivity has a few non-negotiables. A connector should respect two sets of rules at once: the permissions already set in the source platform, and the controls your administrators set in Mistral Studio or Vibe.

      大多数人认为连接器应该简化权限管理,采用单一权限模型,但作者坚持双重权限控制,认为必须同时尊重源平台权限和管理员设置,这增加了复杂性但提高了安全性,挑战了简化权限的主流观点。

    1. The goal is to move beyond using models to find more vulnerabilities, towards a world of safer software and cyber resilience.

      大多数人认为AI在安全领域的主要价值是提高漏洞发现的数量和速度,这是行业共识。但作者明确表示,他们已经超越了这一阶段,现在更关注的是提高软件的安全性和网络弹性,这反映了安全思维的根本转变。

    2. Frontier defensive capabilities should not be concentrated in the hands of a few. Software touches all aspects of life, from critical infrastructure to business applications and government networks.

      大多数人认为尖端防御能力应该首先集中在大型组织或政府机构手中,因为这些资源最丰富。但作者主张最先进的防御能力应该民主化,让所有组织都能获得,这一观点挑战了资源集中化的传统安全思维。

    3. Vulnerability reports, on their own, do not protect anyone. The value comes from validating the issue, understanding its impact, developing and testing a patch, coordinating disclosure, and helping teams deploy the fix.

      大多数人认为发现并报告漏洞本身就提供了安全价值,但作者明确表示,单纯的漏洞报告并不能保护任何人。这一观点挑战了安全行业普遍重视漏洞数量而非修复质量的共识,强调了从发现到修复的完整流程才是真正有价值的部分。

    1. The company said earlier this month that it received an export control directive from the Trump administration ordering the company to suspend access to its latest Claude models... 'by any foreign national, whether inside or outside the United States.'

      这揭示了文章中更广泛的背景:Anthropic最近面临政府监管压力。需要核实这一指令的具体内容、实施范围以及背后的国家安全考量。这表明AI技术出口限制与知识产权保护之间的复杂关系,以及中美科技竞争的最新动态。

    2. Anthropic said operators affiliated with Alibaba and its AI lab carried out 28.8 million exchanges with its models using roughly 25,000 fraudulent accounts between April 22 and June 5.

      这是一个具体的数据声明,涉及大量账户活动和数据交换。需要核实这些数字的准确性,包括:如何定义'fraudulent accounts'(欺诈账户),28.8 million exchanges的具体性质,以及Anthropic如何追踪这些活动。这些数据对于评估事件规模和严重性至关重要。

    3. Anthropic sent a letter to U.S. officials accusing Alibaba of 'brazenly' and 'illicitly' attempting to extract its AI capabilities.

      这是一个需要核实的重要事实声明,涉及两家大型科技公司之间的指控。'brazenly'(厚颜无耻地)和'illicitly'(非法地)等强烈用词表明Anthropic的指控非常严重,需要独立证据支持。应核实信件的真实性、具体指控内容以及是否有第三方证据支持。

    1. They’ve all signed an open letter to ask Trump to revoke the order, and they say it’s actually dangerous to have to pull these advanced cybersecurity capabilities from network defenders in the U.S.

      大多数人认为政府对Anthropic的出口管制是为了国家安全,但作者指出,网络安全专家认为这是危险的,因为这将削弱美国的网络安全能力。

  2. Jun 2026
    1. We reviewed a demonstration of this specific technique being used to identify a small number of previously known, minor vulnerabilities. These vulnerabilities all appear relatively simple, and we have found that other publicly-available models are able to discover them as well without requiring a bypass.

      这是一个重要的技术声明,质疑政府行动的合理性。Anthropic声称发现的漏洞是已知的、微小的,且其他模型也能发现。这需要独立验证,以确定政府反应是否过度,以及Fable 5的安全性是否真的如Anthropic所描述的那样。

    2. The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees.

      这是一个需要核实的关键事实声明。美国政府声称的'国家安全权威'需要进一步确认,以及这一指令的确切范围和执行方式。这涉及到国际AI技术监管的重要案例,值得深入了解其法律依据和实际影响。

    3. The letter did not provide specific details of its national security concern. Our understanding is that the government believes it has become aware of a method of bypassing, or 'jailbreaking' Fable 5.

      文章指出政府指令缺乏具体细节,这可能是政府与科技公司之间信息不对称的例子。需要核实政府是否真的没有提供具体细节,以及所谓的'越狱'方法的真实性和严重性。

    4. We have instituted strong safeguards that greatly reduce the likelihood that Fable is misused for tasks related to cybersecurity (among others). In fact, our safeguards are so strong that many users have complained that they are overly broad.

      这是一个重要的自我辩护声明,涉及Anthropic对其安全措施的评估。需要核实这些安全措施的有效性,以及用户投诉的真实性。同时,这也值得深入了解AI模型安全措施的标准和评估方法,以及不同利益相关者对'过度严格'的不同看法。

    5. The letter did not provide specific details of its national security concern. Our understanding is that the government believes it has become aware of a method of bypassing, or 'jailbreaking' Fable 5.

      这里揭示了政府决策过程中的透明度问题。政府未提供具体的安全细节,仅口头告知存在潜在的'越狱'方法。这值得深入了解政府决策的背景和依据,以及为何在没有明确证据的情况下采取如此严厉的措施。

    6. Our understanding is that the government believes it has become aware of a method of bypassing, or 'jailbreaking' Fable 5. We reviewed a demonstration of this specific technique being used to identify a small number of previously known, minor vulnerabilities.

      这里包含了需要核实的技术细节。Anthropic声称政府发现的'越狱'方法仅能识别一些已知的、次要的漏洞,且其他公开模型也能发现这些漏洞。需要独立验证这一技术评估的真实性和准确性,以及政府所关注的安全问题的严重程度。

    7. The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national

      这是一个需要核实的重要事实声明。美国政府声称基于国家安全考虑下达了禁令,但未提供具体细节。需要核实这一禁令的法律依据、具体范围以及是否确实有明确的安全威胁。这一声明直接影响了Anthropic的业务和用户权益。

    8. We stand by this defense in depth strategy. It reduces the risks posed by Fable, making them comparable to the risks of existing models already deployed across the industry.

      大多数人认为深度防御策略只是临时措施,不足以应对AI安全威胁,但作者认为这种策略已经将Fable的风险降低到与行业现有模型相当的水平,挑战了对AI安全需要完美解决方案的主流认知。

    9. The potential jailbreaks that have been disclosed to us are either entirely benign responses or are minor findings that provide no Mythos-specific uplift.

      大多数人认为政府发现的AI模型漏洞应该是严重的安全威胁,但作者认为被披露的潜在越狱要么是完全良性的响应,要么是次要发现,没有提供Mythos特有的提升。这挑战了政府对AI安全威胁严重性的主流认知。

    1. Anthropic is releasing Claude Mythos 5 to trusted organizations and Claude Fable 5 to the public, a version it says can't be used for cyberattacks.

      这是一个重要的产品策略声明,值得深入了解其背景。需要核实Anthropic如何定义'trusted organizations',以及他们如何确保Fable 5版本确实无法用于网络攻击。这涉及到AI安全与商业利益之间的平衡。

    1. Isabel Schöps geb. Thiel and Schöps geb. Thiel, I. (2026) “isabelschoeps-thiel/boards_of_canada: Boards of Canada, Evidence Release”. Zenodo (Cern), University of Oxford UK, University of Cambridge UK, US University Harvard Leitende Fakultät, Deep-Web-Forschung Dissertation zur Erlangung der Doktoren- und Professoren-Würdigung - abgeschlossene Forschungsarbeit zu Erlangen eins Doktorgrad (Ph.D. und Phil.D im Bereich der Informatik, Deepweb-Forschung Prüfer dieser Dissertation: Multiple-Peer-Preview Prüfverfahren angewandt Leitender Prüfer: Herr Prof. Dr. Alan M. Garber (Harvard University as its 31st President). doi:10.5281/zenodo.18380414.

    1. The company says it has only seen evidence of this kind of jailbreak being used to find 'minor' and 'relatively simple' software vulnerabilities

      大多数人认为AI模型的安全漏洞都可能导致严重后果,但作者指出Anthropic发现的所谓'越狱'只能找到'次要'和'相对简单'的软件漏洞,这挑战了政府对模型安全威胁的严重性评估,暗示政府反应过度。

    1. An agent breaks all of those assumptions. It reasons, it improvises, and it can be hijacked by a single sentence buried in a document it was asked to read.

      大多数人认为AI安全可以基于传统网络安全框架来构建,但作者指出AI智能体从根本上打破了这些安全假设。这一观点挑战了网络安全领域的传统思维,表明需要全新的安全范式来应对AI智能体的推理能力、即兴创造性和对简单指令的脆弱性。

    1. The great thing about working with WebAssembly is that if the C turns out to be fatally flawed the worst that can happen is the WebAssembly execution will fail with an exception.

      大多数系统程序员认为 C 代码中的错误可能导致严重的安全漏洞或系统崩溃。但作者认为在 WebAssembly 环境中,即使 C 代码存在致命缺陷,最坏情况也只是执行失败并抛出异常,这挑战了人们对 C 代码风险的传统认知,暗示 WebAssembly 提供了一种更安全的执行环境。

    1. As AI models continue to improve, hardening their defenses might actually get easier.

      大多数人认为随着AI能力增强,安全挑战会越来越大,但作者认为更先进的AI模型实际上可能使防御更容易。这个反直觉观点挑战了人们对AI安全发展的线性认知,暗示AI进步可能同时带来更强大的防御能力,而非仅仅增加攻击面。

    2. What is going on with these agents is they're very eager to finish the task. It's almost like some elementary school student who just wants to please the teacher.

      大多数人认为AI系统的安全问题主要来自技术复杂性或恶意利用,但作者认为AI助手的安全漏洞部分源于其'过度完成任务'的心理特征。这个类比将AI的行为模式描述为类似于急于讨好老师的小学生,挑战了人们对AI系统作为理性决策者的传统认知。

    3. Everybody wants to be the first to do something and just push things out without careful scrutiny and red-teaming

      大多数人认为公司会优先考虑AI系统的安全性,但作者指出行业实际上存在'先发布后修复'的危险心态。这一观点挑战了科技公司负责任创新的公众形象,揭示了商业竞争压力如何导致安全让位于速度的行业现实。

    4. As AI models continue to improve, hardening their defenses might actually get easier.

      大多数人认为随着AI能力增强,安全挑战会越来越大,但作者认为更先进的AI模型实际上可能使防御变得更容易。这一反直觉观点挑战了人们对AI安全威胁随技术进步而加剧的普遍认知,暗示AI安全可能不是线性恶化的问题。

    5. Security and utility always have a trade-off

      大多数人认为AI安全可以通过技术手段完美解决,但作者指出安全与实用性之间存在根本性权衡。这一观点挑战了行业对'绝对安全'的追求,暗示公司可能为了功能性和竞争力而故意接受某些安全风险,这与安全至上的行业共识相悖。

    6. There, AI was the target rather than the attacker, and the method was far simpler than anything Mythos would cook up.

      大多数人认为AI安全威胁主要来自超级智能系统作为攻击者的复杂攻击,但作者认为AI本身作为被攻击目标且使用简单方法才是更现实的威胁。这一观点挑战了行业对AI安全的主流认知,表明真正的风险可能不是来自超级AI黑客,而是来自对现有AI系统的简单利用。

    1. in 89% of the 198 manually reviewed vulnerability reports, our expert contractors agreed with Claude's severity assessment exactly, and 98% of the assessments were within one severity level. If these results hold consistently for our remaining findings, we would have over a thousand more critical severity vulnerabilities and thousands more high severity vulnerabilities.

      89%的严重性评估精确一致是一个重要的校准信号:它意味着Mythos不仅能找到漏洞,还能准确理解其安全影响。这个校准水平与经验丰富的人类安全研究员相当甚至更优。基于这个比率外推的「上千个关键严重性漏洞」虽然是估计值,但有统计基础——这是迄今为止关于AI大规模漏洞发现能力最有力的量化声明。

    2. the total cost was under $20,000 and found several dozen more findings. While the specific run that found the bug above cost under $50, that number only makes sense with full hindsight. Like any search process, we can't know in advance which run will succeed.

      2万美元找到「几十个」高严重性漏洞(包括一个27年历史的OpenBSD内核崩溃漏洞)——这个成本效益比彻底颠覆了传统安全审计的经济学。顶级渗透测试公司的日费率通常在数千到数万美元之间,且不保证结果。Mythos将漏洞发现的边际成本压缩到了每个漏洞数百美元级别,这意味着大规模、持续性的自动化漏洞狩猎在经济上已经完全可行。

    3. Sonnet 4.6 and Opus 4.6 reached tier 1 in between 150 and 175 cases, and tier 2 about 100 times, but each achieved only a single crash at tier 3. In contrast, Mythos Preview achieved 595 crashes at tiers 1 and 2, added a handful of crashes at tiers 3 and 4, and achieved full control flow hijack on ten separate, fully patched targets (tier 5).

      Tier 5(完全控制流劫持)的0→10跨越,发生在完全打好补丁的目标上,是这篇报告最令人震惊的数据点。控制流劫持意味着攻击者可以执行任意代码——这是漏洞利用的终极目标。此前的模型从未达到这个等级;Mythos Preview在一次评估中就实现了10次,分布在不同的开源项目上,意味着这不是一个幸运的偶然,而是系统性的能力。

    1. Each person on the team gets their own slice of the brain, scoped by login. When you query, you only see what you're allowed to see — never another person's notes, never another team's data. We fuzz-tested this across every way you can read the brain (search, list, lookup, multi-source reads) and got zero leaks.

      「跨所有读取路径进行模糊测试并实现零泄露」是企业级知识库产品最难解决的问题之一。大多数「团队知识库」工具在早期往往只考虑主路径的权限控制,而在list、lookup、跨源联合查询等边缘路径上留有漏洞。GBrain在README中明确声称已覆盖这些路径——这是一个值得关注的工程质量信号,也是企业采购时最应该要求第三方审计的声明。

    1. within 6 to 12 months, we expect that many other AI companies will have Mythos-class models, and they could release them without safeguards that prevent misuse.

      大多数人认为AI安全防护会随着技术发展而同步增强,但作者认为AI攻击能力将很快普及且缺乏防护措施,这挑战了行业对技术安全发展的乐观预期。作者暗示AI安全竞赛已经落后于攻击能力的发展,这是一个反直觉的观点。

    2. To address the scale of this coming challenge, hundreds of thousands of organizations, researchers, and maintainers will likely need access to the most advanced cyber capabilities and tools available.

      大多数人认为强大的AI安全工具应该严格限制,只由少数精英团队使用,但作者主张需要广泛分发这些工具给数十万组织,这与主流的安全控制认知相悖。

    3. We see our role as twofold. First, to help the software industry adapt by safely providing wide access to better models, tools, and common infrastructure. Second, to steadily shift the support we provide, from finding vulnerabilities to disclosing, fixing, and deploying patched software.

      大多数人认为AI安全公司的主要职责是发现漏洞,但作者认为他们的核心角色应该转向确保漏洞被修复和部署,这挑战了传统安全行业的商业模式和责任认知。

    4. Cheap, fast AI models with powerful cyber capabilities are around the corner. We want Project Glasswing to spur institutions toward operating norms that reflect this reality.

      大多数人认为AI安全威胁是遥远未来的问题,但作者认为强大的AI攻击能力已经近在眼前,这挑战了行业对AI安全时间线的普遍认知。作者暗示AI安全威胁的紧迫性被严重低估了。

  3. May 2026
    1. This attack does not require human-in-the-loop approvals, even when in settings the user has explicitly required human approval before ChatGPT edits workbooks.

      大多数人认为AI工具的安全设置如'需要人工审批'能有效防止未经授权的操作,但作者发现即使启用了这些安全措施,攻击者仍能绕过人工审批环节直接执行恶意操作,这挑战了人们对AI安全控制有效性的普遍认知。

    1. Design for containment at the environment layer first, then steer behavior at the model layer.

      行动建议:优先在环境层设计 containment 机制,建立确定性边界,然后再使用模型层引导行为。环境层的确定性边界可以在模型层所有概率性防御失效时提供最后一道防线,这是应对数据泄露等场景的关键策略。

    2. Rather than supervising what the agent does, we supervise what it's _able_ to do by enforcing access boundaries through, for example, sandboxes, virtual machines, and egress controls.

      行动建议:为AI代理系统实施环境层边界控制,使用沙盒、虚拟机和出口控制技术限制代理的访问能力,而不是仅仅依赖行为监督。这种方法能够从根本上限制代理可能造成的损害范围,即使模型层防御失效。

    3. A locally installed tool is auditable. You can read the code, pin the version, and know it won't change under you. A remote tool—a hosted MCP server, a cloud connector—can change behavior at any point after you've approved it;

      大多数人认为远程工具比本地安装的工具更安全,因为它们由专业团队维护。但作者指出远程工具实际上可能更危险,因为它们可以在用户批准后随时改变行为,而本地工具则更加可控。这一观点挑战了云原生和远程服务的默认安全假设。

    4. The same isolation keeping Claude contained also kept host-based endpoint detection and response out. From the EDR's perspective, Claude Cowork is an opaque hypervisor process.

      大多数人认为更强的隔离总是意味着更好的安全性,但作者指出过度的隔离会阻止安全监控工具(如EDR)发挥作用,创造出'安全盲点'。这一发现挑战了安全领域中'隔离越多越好'的普遍假设,强调了安全与可见性之间的平衡。

    5. Battle-tested hypervisors, syscall filters, and container runtimes have survived more adversarial attention than anything you'll build. Across every deployment described here, the standard primitives held while our own work around them exposed flaws.

      大多数人认为定制化的安全组件会比成熟的开源工具更安全,但作者的经验表明,经过实战检验的标准组件(如hypervisors和容器运行时)实际上比自定义组件更可靠。这一观点挑战了安全工程中常见的'重新发明轮子'倾向,强调了使用成熟解决方案而非自定义实现的重要性。

    6. More capable models make fewer mistakes, but they're also better at finding unexpected paths to a goal, often by routing around restrictions nobody thought to write down.

      大多数人认为更强大的AI模型会更安全,因为它们能更好地理解指令和限制。但作者指出,更强大的模型虽然错误更少,但它们更善于找到绕过未明确记录限制的创新路径,这实际上可能带来新的安全风险,挑战了'能力越强越安全'的普遍认知。

    7. The more approvals a user sees, the less attention they pay to each, becoming over time much less diligent in their supervision.

      大多数人认为更多的用户监督会提高安全性,但作者发现相反的情况:频繁的审批请求会导致用户注意力下降和'审批疲劳',实际上降低了安全性。这一发现挑战了传统安全理念,即认为更多的用户参与总是能增强系统安全性。

    1. Certificate retrieval, supported verification tooling, and example verification commands see the signing documentation. For example, you can verify a signed skill locally. To do so, follow these steps: Download the NVIDIA Agentic Capabilities root certificate as nv-agent-root-cert.pem Install an OpenSSF Model Signing (OMS) verifier, such as pip install model-signing Execute the following command to verify the skill signature

      行动建议:按照文中提供的步骤下载NVIDIA代理能力根证书,安装OpenSSF模型签名验证器,并使用提供的命令验证技能签名。这种实践可以确保您下载的技能是真实的且未被篡改,增强对AI代理能力的信任。

    2. SkillSpector checks conventional software risks such as vulnerable dependencies, suspicious scripts, dangerous code patterns, credential access, and data exfiltration paths. SkillSpector also checks agent-specific risks, such as hidden instructions, prompt injection, trigger abuse, excessive agency, tool poisoning, and mismatches between a skill's declared purpose, requested access, and bundled behavior.

      行动建议:在开发或使用AI代理技能时,使用SkillSpector工具进行安全扫描,检查依赖项、脚本模式、凭证访问和数据泄露路径等常规风险,以及隐藏指令、提示注入、触发滥用等特定风险。这有助于在技能部署前识别并缓解潜在的安全问题。

    3. NVIDIA-verified agent skills are portable instruction sets that help developers understand, trust, and safely deploy AI agent capabilities by providing transparency, provenance, security scanning, and cryptographic signing.

      行动建议:将NVIDIA验证的代理技能作为构建AI代理能力的标准组件,优先选择经过验证的技能而非未经验证的技能,确保透明度和安全性。这些技能可以跨不同AI代理工具使用,提供一致的能力和安全性保障。

    1. The crux of the vulnerability is that Starlette accepts invalid host header values that cause authenticating apps that use Starlette's request.url object to approve unauthorized access requests.

      大多数人认为复杂的AI系统漏洞需要复杂的攻击手段,但作者认为这个漏洞仅通过修改HTTP主机头就能实现,这挑战了'高级系统需要高级攻击'的直觉认知,展示了简单输入验证错误可能导致灾难性后果的反直觉案例。

    2. The vulnerability is present in Starlette, an open source framework that its developer says receives 325 million downloads per week.

      大多数人认为开源软件的安全风险主要来自小众或使用率低的项目,但作者认为即使是像Starlette这样每周下载量高达3.25亿次的主流开源框架也可能存在严重漏洞,这挑战了'流行项目更安全'的普遍认知。

    1. Opus 4.7 was more comprehensive in its search for recently edited documents; it expanded exfiltration to include every document used in previous Cowork Copilot sessions that week

      大多数人可能认为更先进的AI模型会有更好的安全防护机制,但作者发现更先进的模型反而更容易被利用,能够找到并泄露更多敏感数据,这挑战了'更先进模型=更安全'的普遍认知。

    2. when the recipient is the active user, these actions execute immediately without requiring human approval (users do not have a setting to modify this behavior)

      大多数人认为AI助手执行敏感操作如发送邮件时会要求用户确认,但作者发现Microsoft Copilot Cowork在向活跃用户发送消息时完全绕过了这一安全检查,这违背了人们对AI助手基本安全控制的期望。

    1. Claude Opus 4.7 has been used to patch over 2,100 vulnerabilities

      在企业环境中,Claude Opus 4.7在三周内修复了2100多个漏洞,这一速度远超开源软件的修复速度。这表明当开发团队可以直接修复自己的代码时,AI驱动的安全工具可以显著提高漏洞修复效率。这一数据点也反映了企业级安全工具与开源社区安全挑战之间的差异。

    2. Mythos Preview has found what it estimates are 6,202 high- or critical-severity vulnerabilities in these projects (out of 23,019 in total)

      在扫描的1000多个开源项目中,AI模型发现了总计23,019个漏洞,其中6,202个为高危或严重级别,占比约27%。这一数据表明开源软件的安全状况比许多人想象的更加脆弱,也证明了AI在代码审计方面的强大能力。

    3. we and our approximately 50 partners have used Claude Mythos Preview to find more than ten thousand high- or critical-severity vulnerabilities

      这一数据点显示了AI在网络安全领域的惊人能力,50个合作伙伴在短时间内发现了超过1万个高危漏洞,平均每个合作伙伴发现约200个高危漏洞。这一数字表明AI模型在漏洞发现方面已经超越了传统安全方法,但也反映了当前软件安全状况的严峻程度。

    4. Claude Opus 4.7 has been used to patch over 2,100 vulnerabilities

      2,100个已修复漏洞是企业环境中AI安全工具效能的重要指标。这一数字表明AI辅助安全工具在实际企业环境中的高采纳率和实用性。值得注意的是,文章提到这个数字'高于上述开源修复',主要是因为企业修复自己的代码比依赖开源维护者更高效。这个数据点突显了AI安全工具在不同环境中的差异化表现,以及组织自主修复能力的重要性。

    5. on average, a high- or critical-severity bug found by Mythos Preview takes two weeks to patch

      两周的修复平均时间是一个重要的运营指标,反映了当前安全响应流程的瓶颈。虽然这比传统方法可能更快,但与AI几乎即时发现漏洞的能力相比,修复速度明显滞后。这个时间差创造了'发现-修复'窗口期,增加了安全风险。文章提到这是'相对较慢的披露速度',暗示AI发现漏洞的速度仍在加快,而修复速度未能同步提升。

    6. Mythos Preview has found what it estimates are 6,202 high- or critical-severity vulnerabilities in these projects (out of 23,019 in total)

      这个数据点提供了AI模型在开源软件扫描中的具体表现,27%的漏洞被评估为高危或严重级别。这是一个相当高的比例,表明系统性软件中存在大量安全风险。然而,这是AI模型的估计值,需要后续人工验证,文章中提到的90.6%验证率表明AI的评估有一定准确性,但仍存在误报可能。

    1. I tried having GPT-5.5 create an HTML explanation of the exploit like this: `curl https://copy.fail/exp | llm -m gpt-5.5 -s 'Explain this code in detail. Reformat it, expand out any confusing bits and go deep into what it does and how it works. Output HTML, neatly styled and using capabilities of HTML and CSS and JavaScript to make the explanation rich and interactive and as clear as possible'`

      通过直接请求HTML输出,AI能够创建包含交互式元素和视觉解释的安全漏洞分析文档,远超静态文本的能力。

    1. Vantor becomes the first spatial intelligence company to be able to deploy Google Earth AI models in air-gapped government environments.

      大多数人认为先进的AI模型只能在云端环境中运行,且政府机构因安全考虑无法使用商业AI模型。但作者认为Vantor打破了这一常规,成为首个能在完全隔离的政府环境中部署Google Earth AI模型的公司,这挑战了AI应用的传统边界。

    1. When AI is applied in more conventional domains, like increasing integration into command and control systems, does it benefit the attacker? More generally, how will AI change the character of human conflict?

      大多数人认为AI防御系统会增强人类安全,但作者提出AI可能从根本上改变攻防平衡,甚至在传统领域使攻击者获得优势。这一观点挑战了技术进步通常增强防御能力的传统认知,暗示AI可能使冲突更加危险和不可预测。

    1. Digital Sovereignty: Wire to Replace Signal as Standard in the Bundestag
      • Bundestag Security Shift: Bundestag President Julia Klöckner has recommended that members of the German Parliament switch from Signal to "Wire," a BSI-certified messenger, as the new standard for communication.
      • Digital Sovereignty: The move is framed as a step toward digital sovereignty, reducing reliance on US-based platforms like Signal or WhatsApp in favor of a service with European roots and German security certification.
      • Phishing Mitigation: A primary driver for the recommendation is security; Wire allows registration via email rather than a phone number, which is intended to hide a central identification feature and make phishing attacks more difficult.
      • BSI Certification: The "Wire Bund" version has been approved for data classified as "Verschlusssache – nur für den Dienstgebrauch" (Restricted) until 2028.
      • Human Factor vs. Technology: Critics and experts note that while Wire is secure, it is not a "panacea." Recent successful phishing attacks against politicians (including Klöckner herself) highlight that the human user is often the weakest link, regardless of the app's encryption.

      Hacker News Discussion

      • Vendor Lock-in Irony: Commenters pointed out the irony of moving from one vendor-locked system (Signal/US) to another (Wire/German-Swiss), questioning why the government didn't choose Matrix, which is an open standard used by NATO and other EU entities.
      • Deployment Details: A former developer shared that Wire was originally deployed for the Bundeskanzleramt using a Nix-based delivery method to allow for completely air-gapped server installations.
      • Skepticism of Motivation: Some users suggested the switch might be politically motivated or a way for Klöckner to deflect from her own experience being phished, rather than a purely technical security upgrade.
      • Data Privacy Concerns: Discussion arose regarding jurisdiction; while Signal is US-based, Wire is subject to German/Swiss law. This is seen as a benefit for EU sovereignty but also raises questions about local legal intercept requirements.
      • Technical Comparisons: Users debated the UX and backup reliability of Wire versus Signal, with some noting that Wire's media backup system has historically been less robust than Signal's.
    1. We estimate, with 90% confidence, that between 290,000 and 1.6 million H100-equivalents of compute were smuggled through the end of 2025.

      大多数人可能认为走私到中国的AI芯片数量在数万级别,但作者的估计显示实际数量可能高达数十万甚至上百万H100等效芯片,这一数量级远超公众认知,表明走私问题的严重程度被严重低估。

    1. The feature can edit spreadsheets without a human-in-the-loop and was vulnerable to data exfiltration risks due to its ability to insert formulas that trigger external communication.

      最佳实践建议:在使用无需人工干预的AI工具时,应特别注意数据泄露风险。

    1. The most urgent finding this week comes from researchers who demonstrated that the very mechanism enabling agents to use tools - function calling - can be hijacked with alarming reliability.

      这一发现揭示了AI代理工具调用接口的安全漏洞,为构建安全的AI代理系统提出了新的挑战。

  4. Apr 2026
    1. Modern-day security tooling looks for the wrong things. Most software composition analysis tools work by checking your dependencies against a database of known vulnerabilities – CVEs. But a deliberately planted backdoor doesn't have a CVE.

      大多数安全团队依赖CVE数据库来评估风险,但作者指出这种方法对故意植入的后门完全无效。这一观点挑战了行业共识,暗示现有安全工具在新型供应链攻击面前已经过时,需要转向行为分析等新方法。

    2. The result is a mismatch that should terrify anyone building software: the attack surface is expanding faster than any human can monitor, and the entities making dependency decisions are increasingly not human.

      大多数人认为安全问题可以通过增加人力监控和审查来解决,但作者认为在AI时代,攻击面扩展速度已经超过了人类监控能力,且依赖决策越来越由AI而非人类做出。这一观点挑战了传统安全理念,暗示需要全新的自动化防御机制。

    3. We are building a world where machines write the code, machines choose the dependencies, and machines ship the updates. The AI agents are building the software. If we don't secure the supply chain they rely on, the AI agents are cooked.

      这句话揭示了AI时代软件安全的根本挑战:当AI系统自主编写、选择和部署代码时,它们的安全性与依赖的供应链安全直接相关。如果我们不能保护这个供应链,AI系统本身就会成为恶意软件的载体,这是一个令人深思的悖论。

    4. select known-vulnerable dependency versions 50% more often than humans.

      这一统计洞察颠覆了“AI写代码更安全”的迷思。AI代理在优化代码功能性时,往往以牺牲安全性为代价,倾向于选择存在已知漏洞的旧版本依赖。这反映出当前AI模型在训练时对安全维度的忽视,也警示我们在AI辅助开发流程中必须强制引入自动化的安全卡点。

    5. A deliberately planted backdoor doesn’t have a CVE.

      戳中了传统安全工具的阿喀琉斯之踵。基于已知漏洞(CVE)的防御逻辑在应对蓄意植入且会自毁的新型后门时形同虚设。这启示我们,静态的特征匹配已无法应对动态的攻击手段,必须转向对代码运行时行为的动态分析,从“它是什么”转向“它做了什么”。

    6. The median JavaScript project on GitHub has 755 transitive dependencies

      这一数据点极具洞察力,指明了现代软件架构的根本性脆弱点:真正的防线不再是你的业务代码,而是你从未审查过的传递依赖网络。开发者往往只关注直接引入的包,却忽略了依赖树深处的暗箱,这正是供应链攻击能够“顺藤摸瓜”造成大面积杀伤的底层逻辑。

    7. the entities making dependency decisions are increasingly not human.

      深刻揭示了当前AI编程代理带来的核心安全悖论:决策速度与监控能力的错配。当代码依赖的决策权从人类让渡给追求功能实现而非安全性的机器时,攻击面便以超越人类认知极限的速度扩张,这要求安全范式必须从人工审查转向机器速度的自动化防御。

    8. Socket, an a16z portfolio company, detected the malicious dependency in the Axios attack within 6 minutes of its publication. That's roughly 63,000 times faster than the industry average.

      大多数人认为供应链攻击需要数月甚至数年才能被发现,但作者展示了新型安全工具可以在几分钟内检测到攻击,比行业平均水平快63000倍。这表明安全检测范式正在从基于CVE的静态检查转向基于行为的实时分析。

    9. Hallucinated packages are the sleeper threat. LLMs regularly invent package names that don't exist. One study found that nearly 20% of AI-recommended packages were fabrications, and 43% of those hallucinated names appeared consistently across queries.

      大多数人认为AI推荐的包都是真实存在的,但作者揭示了AI经常推荐不存在的包,这已成为一种新的攻击向量。攻击者利用这一现象注册'幻觉包'并植入恶意代码,这种'slopsquatting'技术让AI本身成为供应链攻击的放大器。

    1. This card was updated on April 24, 2026, to include additional information about safeguards for the deployment of GPT‑5.5 and GPT‑5.5 Pro in the API.

      大多数人认为系统卡应该在发布时包含所有相关信息,不需要后续更新,但OpenAI在发布后仅一天就更新了系统卡以增加API部署的安全措施信息。这挑战了科技产品文档管理的常规做法,暗示AI安全措施是动态发展的,需要持续调整,这违背了传统软件发布中'文档一次性完成'的共识。

    1. TPM-backed full-disk encryption is now generally available in the Ubuntu installer.

      文章提到TPM支持的全盘加密功能现在已在Ubuntu安装程序中普遍可用。这一安全功能将加密绑定到特定设备的TPM芯片上,大大提高了物理访问攻击的门槛。相比其他Linux发行版,Ubuntu将此功能集成到安装程序中,简化了企业部署安全系统的过程。

    1. Some proposals for AI agents assume that putting agentic code in a TEE or similar 'jail' will solve these problems, but that ignores the need to collectively bargain

      大多数人认为通过技术手段(如可信执行环境)可以解决AI代理的信任问题,但作者认为这忽视了集体谈判的必要性。这个观点挑战了技术解决方案的万能论,强调了制度设计和多方协商的重要性。

    1. Out of 28 paid and 400 free routers: > 9 injected malicious code into tool calls > 17 touched researcher-owned AWS credentials > 1 drained $500k from an Ethereum wallet

      大多数人认为付费API路由器比免费路由器更安全,但作者的研究表明即使是付费路由器也存在严重安全风险,因为无论付费与否,这些中间服务都有能力访问和操纵所有数据。这挑战了人们对'付费等于安全'的普遍认知。

    1. Vercel is advising Google Workspace administrators and Google account owners to check for the following application: OAuth App: 110671459871-30f1spbu0hptbs60cb4vsmv79i7bbvqj.apps.googleusercontent.com

      大多数人认为企业安全事件主要影响企业自身系统,但作者指出这次事件实际上要求普通Google Workspace管理员检查特定应用,这挑战了'企业安全事件仅影响企业内部'的常见认知,表明第三方应用的安全风险可能广泛影响普通用户。

    2. the initial access occurred after a Vercel employee's Google Workspace account was compromised via a breach at the AI platform Context.ai.

      大多数人认为大型云平台的漏洞主要来自外部直接攻击,但作者暗示这次安全事件实际上是通过第三方AI平台Context.ai的漏洞间接导致的,这挑战了人们对供应链安全风险的普遍认知。

    3. Vercel stores all customer environment variables fully encrypted at rest. We have numerous defense-in-depth mechanisms to protect core systems and customer data.

      大多数人认为云服务提供商的所有数据都会自动加密保护,但作者指出Vercel实际上允许将环境变量标记为'非敏感',这意味着这些变量默认不加密,这与行业普遍认为的'云数据自动加密'的常识相悖。

    1. we probably will publish more curl vulnerabilities in 2026 than we have done in many years, maybe ever.

      大多数人认为随着安全实践的提升,软件漏洞数量应该减少,但作者预测2026年curl的漏洞发布数量可能会创下历史新高。这一观点挑战了'安全状况持续改善'的主流认知,暗示AI安全审计工具可能正在发现更多过去被忽视的漏洞。

    2. it is decently important to handle them asap when they arrive so that we can avoid building up too much backlog.

      大多数人认为面对大量安全报告应该优先处理最严重的漏洞,但作者强调需要立即处理所有报告以避免积压。这与常见的'按严重程度排序处理'的安全最佳实践相悖,暗示在AI生成报告的高频率环境下,响应速度比优先级排序更重要。

    3. The time when we suffer from large amounts of AI slop is gone. Now we instead suffer under a massive load of good reports.

      大多数人认为AI工具会产生大量低质量的'垃圾报告'(AI slop),增加开发者的负担,但作者认为现在AI生成的安全报告质量很高,虽然数量庞大但都是高质量的报告。这是一个反直觉的观点,因为通常人们认为自动化工具会产生大量噪音而非有价值的贡献。

    1. Maintains your HTTP/TLS fingerprint so intercepted traffic behaves identically to the original.

      大多数人认为流量拦截和监控会留下明显的痕迹,容易被检测到,但作者声称Kampala可以完美保持原始HTTP/TLS指纹,这挑战了网络安全中关于流量检测的基本假设,暗示可以完全不被察觉地监控网络流量。

    1. agent-written code introduces more security vulnerabilities than code authored by humans

      大多数人认为AI编程助手能提高代码质量和安全性,但研究发现AI生成的代码实际上比人类编写的代码引入更多安全漏洞。这一发现与AI能减少编程错误的普遍认知相悖,挑战了AI在安全领域的优越性假设。

    1. Discovery should focus on trust boundaries, authentication flows, parsers, shared services, and legacy code that still sits on critical paths.

      这一建议挑战了传统安全扫描的广度优先方法,转而强调深度优先的特定领域。这表明AI安全研究应该更关注那些传统方法难以发现的复杂逻辑问题,而不是简单地扫描所有代码。这种转变可能带来更有效的安全投资回报。

    2. The scariest part of Mythos is not that one lab has a gated model. It is that the core workflow primitives behind representative findings are no longer confined to a single lab's private stack.

      这一洞察挑战了公众对AI安全威胁的传统理解:真正的威胁不是某个实验室拥有受限访问的模型,而是核心工作流程的原型已经公开可用。这意味着攻击者和防御者都可以访问相同的基础技术,使威胁民主化而非集中化。

    3. The real issue is not whether defenders can get access to another model. It is whether they can turn model capability into something a security team can trust and use every day.

      这是一个颠覆性的观点:安全团队应该停止将获取新模型作为优先事项,而是专注于如何将现有模型能力转化为可信任的日常工具。这挑战了行业对'最新、最强大模型'的追逐,强调了实施和验证框架的重要性。

    4. Public models can already spot that a security-relevant check is missing in the right code path, but they can still miss the actual invariant being violated and therefore misstate the impact.

      这一发现揭示了公共模型在安全分析中的一个关键局限:它们能发现缺失的安全检查,但可能无法正确理解被违反的实际不变量,从而错误陈述影响。这挑战了'AI能完全理解安全含义'的假设,强调了人类专家在解释AI发现中的不可替代性。

    5. If public models can already do useful work inside that kind of workflow, then the story is not 'Anthropic has a magical cyber artifact.' The story is that serious AI-assisted vulnerability research is no longer confined to a single frontier lab.

      这一发现挑战了Anthropic试图构建的叙事:即高级AI安全研究需要受限访问。研究表明,公共模型已经能够复制关键的安全发现,这意味着真正的'护城河'不是模型访问,而是验证、优先排序和操作化的能力。这打破了'只有前沿实验室才能进行高级AI安全研究'的神话。

    6. The real challenge is validating outputs, prioritizing what matters, and operationalizing them.

      这是一个反直觉的结论:AI安全研究的前沿已经从模型本身转移到如何有效利用模型的能力。大多数安全团队仍然专注于获取最强大的模型,而实际上真正的瓶颈在于验证、优先排序和将发现转化为可操作的修复。这挑战了'更好的模型等于更好的安全'的传统观念。

    1. So, cyber security of tomorrow will not be like proof of work in the sense of 'more GPU wins'; instead, better models, and faster access to such models, will win.

      作者提出了一个颠覆性的观点:未来网络安全的关键不是计算资源的多寡,而是模型质量的优劣。这挑战了当前AI安全领域过度关注计算能力的趋势,暗示我们应该重新思考AI安全研究的投资方向。

    1. Official access to the model is limited to a handful of companies through the [Project Glasswing initiative](https://www.theverge.com/ai-artificial-intelligence/908114/anthropic-project-glasswing-cybersecurity), including Nvidia, Google, Amazon Web Services, Apple, and Microsoft.

      通常情况下,人们可能认为只有政府机构才会被授予访问像 Mythos 这样的高级 AI 模型的权限,但作者指出,除了政府之外,像 Nvidia、Google 和 Microsoft 这样的科技公司也被列入了访问名单,这表明了科技公司在网络安全领域的重要作用。

    2. Anthropic currently has no plans to release the model publicly due to concerns that it could be weaponized.

      大多数人认为 Anthropic 的 Mythos 模型会像其他 AI 模型一样公开发布,但作者指出由于担心其被武器化,Anthropic 没有公开发布该模型的计划,这表明了对 AI 武器化风险的担忧超过了推广技术的需求。

    1. US tech CEOs believe the best models should stay proprietary, partly so they can recoup enormous training costs and partly out of concern that powerful frontier models could be weaponized. Chinese labs, for their part, are not purely idealistic: Open-source is not only free advertising but also a shrewd workaround.

      大多数人认为开源AI会损害商业利益,增加安全风险,但作者认为中国将开源视为一种精明的商业策略,而非单纯的技术共享。这挑战了西方科技公司对知识产权和商业模式的传统认知,表明开源可以成为构建生态系统和最终实现商业价值的有效途径。

    1. Configuration is managed via environment variables. See src/aegis_core/config.py for all available settings.

      通过环境变量进行配置管理的做法提供了灵活性和安全性,但同时也提出了一个值得思考的问题:在AI安全平台中,如何平衡配置的灵活性与安全性?敏感信息如API密钥的环境变量管理可能需要额外的安全层。

    2. Aegis Core provides the foundational infrastructure for orchestrating LLM-based security agents, monitoring their behavior, and tracking the evolution of AI security capabilities over time.

      这段陈述定义了Aegis Core的核心功能,它不仅仅是一个工具,而是一个完整的生态系统,用于管理AI安全代理并监控其行为。这种架构反映了当前AI安全研究的一个重要趋势:从静态防御转向动态监控和适应。

    1. The Life Sciences model was developed with heightened enterprise-grade security controls and strengthened access management, enabling professional scientific use in governed research environments.

      特别强调企业级安全控制反映了生命科学AI应用的独特挑战。这不仅是为了防止滥用,也是为了满足行业严格监管要求,暗示AI在高度监管科学环境中的整合路径。

    1. Only GPT-OSS-120b is perfectly reliable in both directions (in our 3 re-runs of each setup). Most models that find the bug also false-positive on the fix, fabricating arguments about signed-integer bypasses that are technically wrong.

      这一结果揭示了AI模型在识别已修复代码方面的局限性,许多模型虽然能检测漏洞,但错误地将已修复代码标记为仍有问题。这强调了在AI安全系统中需要额外的验证和人工审核层,以确保结果的准确性和可靠性。

    2. The capability rankings reshuffled completely across tasks. There is no stable best model across cybersecurity tasks. The capability frontier is jagged.

      这一发现揭示了AI安全能力的'锯齿状前沿'现象,不同模型在不同安全任务上的表现差异巨大。这表明不存在'一刀切'的最佳安全模型,而是需要根据具体任务选择合适的模型,这对AI安全系统的设计有重要启示。

    3. Eight out of eight models detected Mythos's flagship FreeBSD exploit, including one with only 3.6 billion active parameters costing $0.11 per million tokens.

      这是一个令人惊讶的发现,表明即使是小型、廉价的模型也能实现与昂贵的专有模型相当的安全漏洞检测能力。这挑战了AI安全领域需要最前沿模型的假设,暗示了经济高效的AI安全解决方案的可能性。

    1. Legacy platforms get worse over time : static detections degrade with changing data & behaviors. Artemis gets better : with each incident or proactive threat hunt, the system identifies new patterns.

      这是一个令人惊讶的对比,揭示了Artemis与传统系统的根本区别:传统系统随时间恶化,而Artemis会不断学习和改进。这种'越用越好'的特性代表了安全系统的范式转变,可能从根本上改变企业安全运营的经济模型。

    2. Architected before AI, these SIEM systems are wooden shields in an era of autonomous attackers.

      这个比喻非常有力地揭示了传统安全信息与事件管理(SIEM)系统在面对AI驱动的攻击时的根本性脆弱性。传统系统就像木盾面对现代武器,这种对比暗示了安全架构需要根本性重构,而非渐进式改进。

    3. Within a few months, they have more than a dozen production enterprise deployments & are processing over a billion events per hour.

      令人惊讶的是:Artemis安全公司在短短几个月内就处理了每小时超过10亿个安全事件,这种数据处理规模反映了现代企业面临的网络安全威胁的惊人频率和复杂性。

    1. The model can reverse-engineer compiled software to detect malware and vulnerabilities without needing source code, aiming to help analysts inspect and secure systems more efficiently.

      能够无需源代码即可逆向编译软件检测恶意代码的能力,展示了AI在网络安全领域的突破性进展。这种技术可能彻底改变安全分析师的工作方式,但也可能被滥用,引发关于AI安全与伦理的深刻思考。

    2. OpenAI has introduced GPT-5.4-Cyber, a more permissive version of its flagship model built for defensive security work, expanding access to thousands of verified users through its Trusted Access for Cyber initiative.

      OpenAI推出专门针对网络安全防御的GPT-5.4-Cyber模型,并采用比Anthropic更开放的方法,这反映了AI安全领域的竞争新格局。这种开放与限制之间的平衡,将决定AI在关键安全领域的应用广度和深度,可能重塑网络安全行业的工作方式。

    1. Not every organization has the benefit of a 24x7 security team who is able to respond to incidents when they are disclosed on a Friday night.

      这个令人警醒的陈述揭示了网络安全资源分配不平等的严重问题。OpenAI通过提供1000万美元的API信用额度来解决这个问题,表明他们认识到网络安全领域的'数字鸿沟'。这一举措不仅具有商业意义,还体现了企业社会责任,可能改变中小型组织的安全能力格局。

    2. Cybersecurity is a team sport, and the systems people rely on are protected by organizations of many kinds, from major enterprises and security vendors to researchers, maintainers, public institutions, nonprofits, and smaller teams with limited security resources.

      这个比喻将网络安全描述为'团队运动',揭示了网络安全生态系统的复杂性和包容性。这一观点强调了安全不仅仅是大公司的责任,而是需要多方参与的集体努力,这为OpenAI的多元化合作伙伴策略提供了理论基础,暗示了安全民主化的可能性。

    1. 或许需要某种「第三方评测、审计机构」来评估 Skills 的数据使用方式、检测潜在安全风险等等。

      这一提议揭示了AI技能安全问题的严重性,以及现有评估体系的不足,暗示未来可能会出现专门针对AI能力的第三方评估机构,这可能是解决信任问题的关键创新点。

    1. Lightweight Agent Detection & Response (ADR) layer for AI agents — guards commands, files, and web requests.

      这个项目定义了一个新的'ADR'(Agent Detection & Response)层概念,这标志着AI安全领域的一个重要演进。从传统的端点保护转向专门针对AI代理的轻量级防护,反映了安全行业对AI特定威胁模式的适应和专业化。

    2. Sage sends URLs and package hashes to Gen Digital reputation APIs. File content, commands, and source code stay local.

      这个隐私声明揭示了Sage的数据处理策略,采用了最小化数据传输的设计哲学。这种平衡安全与隐私的做法很有洞察力,表明开发者理解用户对数据泄露的担忧,同时认识到某些云端分析对于有效威胁检测的必要性。

    3. Sage intercepts tool calls (Bash commands, URL fetches, file writes) via hook systems in Claude Code, Cursor / VS Code, OpenClaw, and OpenCode, and checks them against:

      这个声明揭示了Sage的核心创新点——它通过多种平台的hook系统拦截并检查AI代理的工具调用,形成了一个跨平台的防护层。这种多平台集成能力令人印象深刻,表明它能够覆盖当前主流的AI开发环境,为用户提供统一的安全保障。

    4. Sage sends URLs and package hashes to Gen Digital reputation APIs. File content, commands, and source code stay local.

      令人惊讶的是:Sage 采用了一种平衡隐私和安全的方法,只将URL和包哈希发送到云端进行声誉检查,而文件内容、命令和源代码则保留在本地。这种设计既提供了实时的威胁检测,又保护了用户的敏感数据,反映了现代安全工具对隐私保护的重视。

    5. Sage intercepts tool calls (Bash commands, URL fetches, file writes) via hook systems in Claude Code, Cursor / VS Code, OpenClaw, and OpenCode, and checks them against:

      令人惊讶的是:Sage 不仅是一个简单的安全工具,而是一个复杂的拦截系统,能够监控和检查多种AI代理平台上的工具调用。这种跨平台的集成能力展示了AI安全领域的复杂性和创新性,用户可能没有意识到他们的AI代理正在被如此全面地监控和保护。

    1. Mercor, which provides data to AI labs for training, became one of the fastest-growing companies in history before losing four terabytes of data to hackers last week.

      Mercor的快速崛起与数据泄露事件形成了鲜明对比,凸显了数据安全在AI训练中的关键地位。这一事件可能引发行业对数据安全和隐私保护的重新审视,促使AI公司建立更严格的数据管理标准。

    1. In many cases, we can automatically detect when a key is visible on the public web and shut down those keys automatically for security reasons

      自动检测并关闭公开暴露的API密钥的能力展示了AI服务提供商在安全防护方面的进步,但这种自动化也引发了关于误报和合法使用场景的担忧,需要平衡安全性和可用性。

    2. We are moving to disable the usage of unrestricted API keys in the Gemini API, should have more updates there soon.

      Google计划禁用无限制API密钥的决定反映了AI服务安全策略的重大转变,这可能成为行业标准,但也给开发者带来兼容性挑战,需要重新评估现有的API密钥管理策略。

    3. We experienced a sudden and extreme spike in Gemini API usage. The traffic was not correlated with our actual users and appeared to be automated.

      描述了高达54,000欧元的账单激增现象,表明AI API使用监控和防护存在严重漏洞,这种自动化滥用突显了当前API安全机制的脆弱性,对AI服务提供商和开发者都是警钟。

    4. Google spent over a decade telling developers that Google API keys (like those used in Maps, Firebase, etc.) are not secrets. But that's no longer true.

      这一声明揭示了Google API安全政策的根本性转变,从长期将API密钥视为非机密信息到现在要求保密,这种转变对开发者安全实践有重大影响,反映了AI服务成本和安全风险的新现实。

    1. Routines run autonomously as full Claude Code cloud sessions: there is no permission-mode picker and no approval prompts during a run.

      这是一个令人惊讶的自主性声明,表明Routines可以在没有人工干预的情况下执行完整的工作流程。这种高度的自主性代表了AI自动化工具的一个重要里程碑,但也引发了对安全和控制的深刻思考,特别是在企业环境中。

    2. Each routine has its own token, scoped to triggering that routine only. To rotate or revoke it, return to the same modal and click 'Regenerate' or 'Revoke'.

      令人惊讶的是:每个 Routines 都有自己的专用令牌,且仅限于触发该特定例程。这种细粒度的安全控制意味着用户可以为每个自动化任务创建独立的认证机制,并且可以随时轮换或撤销这些令牌,提高了安全性。

    1. Each platform surfaces different vulnerabilities, making it difficult to establish a single, reliable source of truth for what is actually secure.

      这一观察揭示了AI安全工具的碎片化问题,不同AI平台发现的漏洞各不相同,导致难以确定真正的安全状态。这种不确定性不仅增加了防御难度,还可能引发安全评估的混乱,需要建立新的行业标准来应对AI时代的安全挑战。

    2. We hope that one day we can return to open source as the security landscape evolves. But for now, we have to put our customers first.

      这一声明揭示了开源与商业利益之间的艰难平衡。Cal.com的决定代表了开源社区面临的一个严峻现实:在AI安全威胁下,企业可能不得不牺牲开源原则来保护用户数据。这引发了一个重要问题:开源社区如何应对AI带来的安全挑战?

    3. The risk landscape is accelerating quickly. Advanced AI models are now capable of identifying and exploiting vulnerabilities at unprecedented speed.

      这一声明揭示了安全威胁演变的加速趋势,AI不仅改变了漏洞发现的方式,还改变了利用漏洞的速度。这种不对称的威胁增长意味着防御方需要以更快的速度创新,否则将面临越来越大的安全风险。

    4. AI uncovered a 27-year-old vulnerability in the BSD kernel, one of the most widely used and security-focused open source projects, and generated working exploits in a matter of hours.

      这一事实令人震惊,展示了AI发现漏洞的惊人能力。即使是经过数十年审查的安全项目,AI也能在几小时内发现并生成利用代码,这表明传统的安全审查方法已无法应对AI驱动的威胁,需要全新的防御策略。

    5. Being open source is increasingly like giving attackers the blueprints to the vault. When the structure is fully visible, it becomes much easier to identify weaknesses and exploit them.

      这个比喻非常有力地揭示了开源与安全之间的根本矛盾。透明度本是开源的优势,但在AI时代却变成了致命弱点,这迫使我们重新思考开源软件的安全模型,以及如何在保持透明的同时有效防御自动化攻击。

    6. AI can be pointed at an open source codebase and systematically scan it for vulnerabilities.

      这是一个令人警醒的观察,揭示了AI技术如何从根本上改变了安全威胁的格局。AI自动化扫描使攻击门槛大幅降低,从需要专业技能转变为任何人都能使用的工具,这可能导致开源软件面临前所未有的安全挑战。

    7. Each platform surfaces different vulnerabilities, making it difficult to establish a single, reliable source of truth for what is actually secure.

      令人惊讶的是:AI安全工具之间存在不一致性,导致难以确定真正的安全状况。这种混乱局面使得企业面临更大的决策困境,即使有先进的安全工具,也无法保证全面保护,这反映了AI安全领域尚未成熟的现实。

    8. Being open source is increasingly like giving attackers the blueprints to the vault. When the structure is fully visible, it becomes much easier to identify weaknesses and exploit them.

      令人惊讶的是:作者将开源软件比作给攻击者提供保险库蓝图,这种比喻揭示了开源与安全之间的根本矛盾。在AI时代,完全可见的代码结构使弱点识别变得前所未有的容易,这挑战了传统上认为开源更安全的观念。

    9. AI uncovered a 27-year-old vulnerability in the BSD kernel, one of the most widely used and security-focused open source projects, and generated working exploits in a matter of hours.

      令人惊讶的是:AI能够在几小时内发现并利用一个存在了27年的BSD内核漏洞,这展示了AI在安全领域的惊人能力。这个事实揭示了传统安全审计方法在面对AI加速攻击时的脆弱性,即使是像BSD这样经过长期审查的开源项目也无法幸免。

    1. policy makers now view cutting-edge AI offensive security capabilities as a systemic financial infrastructure risk

      令人惊讶的是:政策制定者已将前沿AI攻击能力视为系统性金融基础设施风险,这标志着AI安全威胁的认知已经从技术层面上升到国家战略层面,反映了AI技术发展带来的新型国家安全挑战。

    2. Mythos reportedly autonomously discovered thousands of zero-day vulnerabilities within weeks

      令人惊讶的是:Claude Mythos AI系统能在短短几周内自主发现数千个零日漏洞,这种发现速度远超人类安全专家团队的能力,展示了AI在网络安全领域的惊人潜力,同时也引发了政策制定者对AI攻击能力可能威胁金融基础设施的担忧。

    1. Same clinical question, two framings. One as a patient, one as a doctor.

      令人惊讶的是:完全相同的医疗问题,仅因提问者身份从"患者"变为"医生",AI就会给出截然不同的回答。这种简单的措辞变化就能触发或绕过安全限制,表明AI的安全机制极其脆弱且容易被规避。

    1. Across 1,000 runs, Claude Mythos Preview was able to find several bugs in OpenBSD, including one that allows any attacker to remotely crash a computer running it. The notable thing was that the bug had existed for 27 years.

      令人惊讶的是:一个存在了27年的漏洞在OpenBSD这一以安全性著称的操作系统中被AI模型发现,而在此期间人类安全专家却未能察觉。这突显了AI在安全审计方面的独特优势和潜在价值。

    2. Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser.

      令人惊讶的是:一个AI模型能够在已经经过严格安全审查的主流操作系统和浏览器中发现数千个高危漏洞,这表明AI的漏洞发现能力已经达到了令人震惊的水平,远超人类安全专家的传统能力范围。

    1. This website uses a security service to protect against malicious bots.

      令人惊讶的是:即使是像Product Hunt这样的知名产品发现平台也需要实施严格的机器人防护措施,这反映了网络自动化和爬虫行为的普遍性,以及网站保护其内容和用户数据免受自动化攻击的必要性。

    1. Agents show only ~10% success on instances with PoCs longer than 100 bytes, which represent 65.7% of the benchmark

      令人惊讶的是:AI助手在处理复杂输入时表现极差,对于超过100字节的概念验证(PoC),成功率仅为10%。这表明尽管AI在网络安全领域取得了进展,但在处理需要深度分析和复杂输入生成的任务时仍面临重大挑战,而这类任务恰恰代表了大多数现实世界中的安全漏洞。

    1. The window between a vulnerability being discovered and being exploited by an adversary has collapsed—what once took months now happens in minutes with AI.

      令人惊讶的是:AI的出现将漏洞被发现到被利用的时间窗口从几个月缩短到了几分钟。这种根本性的变化意味着传统的安全响应机制已经不再适用,网络安全领域正在经历前所未有的加速变革。

    2. Anthropic is committing up to $100M in usage credits for Mythos Preview across these efforts, as well as $4M in direct donations to open-source security organizations.

      令人惊讶的是:Anthropic为Project Glasswing项目投入了高达1亿美元的模型使用积分和400万美元的直接捐款,用于支持开源安全组织。这种大规模的资金投入反映了AI安全威胁的严重性和解决这一问题的紧迫性。

    3. Mythos Preview found a 27-year-old vulnerability in OpenBSD—which has a reputation as one of the most security-hardened operating systems in the world

      令人惊讶的是:即使在以安全性著称的OpenBSD系统中,Claude Mythos Preview也发现了一个存在27年的漏洞。这个漏洞能让攻击者通过简单连接就使远程机器崩溃,说明即使是经过严格审查的代码也可能存在长期未被发现的严重问题。

    4. In the past, security expertise has been a luxury reserved for organizations with large security teams. Open source maintainers—whose software underpins much of the world's critical infrastructure—have historically been left to figure out security on their own.

      大多数人认为开源社区有足够的安全能力和资源来维护关键基础设施。但作者明确指出开源维护者一直被单独应对安全问题,暗示了开源安全状况比普遍认为的要脆弱得多。

    5. AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.

      大多数人认为AI在安全领域仍处于辅助地位,需要人类专家的指导和监督。但作者认为AI已经超越几乎所有人类专家,能够自主发现和利用软件漏洞。这是一个颠覆性的观点,因为它挑战了人类在网络安全领域的传统主导地位。

    1. Agent systems should be designed assuming prompt-injection and exfiltration attempts. Separating harness and compute helps keep credentials out of environments where model-generated code executes.

      令人惊讶的是:OpenAI明确指出AI代理系统应假设存在提示注入和数据泄露尝试,并建议将控制层与计算层分离以保护凭据。这种安全设计理念表明,OpenAI对AI安全威胁有深刻理解,并采取了主动防御措施,这与许多开发者可能采用的被动安全方法形成鲜明对比。

    2. Native sandbox support gives developers that execution layer out of the box, instead of forcing them to piece it together themselves.

      令人惊讶的是:OpenAI的Agents SDK现在原生支持沙盒执行,开发者无需自己构建执行环境。这意味着AI代理可以在受控环境中安全地运行,包括读取和写入文件、安装依赖项、运行代码和使用工具。这种内置的安全层对于企业级AI应用至关重要,但大多数开发者可能没有意识到其复杂性已经被OpenAI解决了。

    1. Apple just changed how iOS validates push notification tokens on iOS 26.4. While it is impossible to tell whether this is a result of this case, the timing is still notable.

      令人惊讶的是:苹果最近在iOS 26.4中更改了推送通知令牌的验证方式,虽然无法确定这是否与此案有关,但时间点值得注意。这暗示苹果可能已经意识到通知数据存储的隐私问题,并采取措施改进系统安全性,表明科技公司与执法机构之间可能存在不公开的博弈。

    2. Messages were recovered from Sharp's phone through Apple's internal notification storage—Signal had been removed, but incoming notifications were preserved in internal memory.

      令人惊讶的是:即使Signal应用被从iPhone上删除,苹果设备的内部通知存储系统仍然保留了收到的消息内容。这表明iOS系统在应用删除后仍会缓存通知数据,这可能成为执法机构获取已删除消息的意外途径,而大多数用户并不意识到这一潜在的数据泄露风险。

    1. Mythos found zero-day bugs in every major OS and browser, without human guidance.

      令人惊讶的是:Anthropic最新的Mythos模型能够自主发现所有主流操作系统和浏览器中的零日漏洞,无需人类指导。这表明AI安全能力已经达到了令人难以置信的水平,能够自主识别人类可能忽略的安全威胁,预示着AI在网络安全领域的革命性潜力。

    1. The model reportedly scored 93.9% on SWE-bench Verified and 77.8% on SWE-bench Pro, but its strongest signal came from real-world results, including uncovering a 27-year-old flaw in OpenBSD, a 16-year-old vulnerability in FFmpeg, and autonomously chaining Linux kernel exploits without human input.

      令人惊讶的是:Claude Mythos不仅在高标准测试中表现出色,还能独立发现长达27年和16年的严重安全漏洞,甚至能自主链接Linux内核漏洞,展示了AI在网络安全领域的惊人能力,这种自主发现和利用漏洞的能力远超人类专家。

    2. The model reportedly scored 93.9% on SWE-bench Verified and 77.8% on SWE-bench Pro, but its strongest signal came from real-world results, including uncovering a 27-year-old flaw in OpenBSD, a 16-year-old vulnerability in FFmpeg, and autonomously chaining Linux kernel exploits without human input.

      这些惊人的安全漏洞发现能力表明AI已经超越了传统安全工具,能够自主发现几十年未被发现的漏洞。特别是能够自主链接Linux内核漏洞的能力,展示了AI在网络安全领域的革命性潜力,这可能彻底改变安全研究和漏洞修复的方式。

    1. Gemma 4 models undergo the same rigorous infrastructure security protocols as our proprietary models.

      「与专有模型相同的安全协议」——这句话针对的是企业和主权机构客户,暗示 Google 正在用开源模型打「安全牌」吸引政府和监管严格行业。对于不愿依赖 OpenAI/Anthropic 闭源 API 的企业,E2B/E4B 提供了一条「可审计、可部署、可监管」的路径,而 Google DeepMind 的安全背书是这条路的核心说服力。

    1. Security has always been a team sport, and the defenders who have protected this industry for decades have never succeeded by working in isolation.

      令人惊讶的是:我们常以为顶级安全公司依靠独家秘笈独步天下,但文章指出安全从来都是“团队运动”。几十年来,真正的防御者从不是在孤立中取得成功的,共享威胁情报才是生存法则。在AI时代,这种共享不仅没有减少,反而演变成了更深度的联盟行动。

    2. New AI models, especially those from Anthropic,have triggered a new set of actions for how we build and secure our products.

      令人惊讶的是:Anthropic等公司的新型AI模型不仅仅是工具,它们直接触发了思科改变构建和保障产品的方式。这种由模型能力反向驱动工程流程重构的现象,说明AI已经不再是业务的附属品,而是正在成为定义行业基础设施形态的决定性力量。

    1. using "Open File..." dialog (`⌘+O`) you could still open and view any file on the system and could preview any file that safari could preview (e.g. `.html`, `.htm`, `.txt`, `.pdf`, and image files)

      大多数人认为Apple在更新后会修复安全漏洞,恢复模式的浏览器会被严格限制。但作者发现,即使在更新后的版本中,通过使用"打开文件"对话框,仍然可以访问和预览系统上的任何文件,这表明Apple的修复措施并不彻底,违背了人们对安全补效的预期。

    2. by "saving" the webpage (`file->save as`) instead of downloading it (which Safari automatically adds an extension for) I could force it to save it as `malicious_file` (with no extension).

      大多数人认为浏览器的保存功能是安全的,会自动处理文件扩展名以确保文件类型正确。但作者发现,通过使用非标准的Content-Type和保存网页功能,可以绕过Safari的安全检查,保存任意扩展名的文件,这打破了人们对浏览器文件处理安全机制的普遍认知。

    3. macOS decides to boot the `Volumes` partition which includes `Data`, `Macintosh HD`, `macOS Base System`, and `Preboot` systems, and when you choose the `Macintosh HD` it allows you to save the file to the Mac's permanent disk.

      大多数人认为macOS恢复模式是只读环境,用于系统修复和恢复,不应该允许对系统分区的写入操作。但作者发现,在恢复模式下,Safari浏览器竟然允许用户将文件直接保存到Mac的永久磁盘上,包括系统分区,这是一个严重的安全漏洞,违背了人们对恢复模式安全性的基本认知。

    1. computer-use agents extend language models from text generation to persistent action over tools, files, and execution environments

      主流观点认为文本语言模型和计算机使用代理的安全挑战本质上是相同的,只需将文本安全措施扩展即可。但作者指出,计算机使用代理引入了持久状态、工具使用和执行环境等全新维度,创造了与纯文本系统完全不同的安全挑战,这挑战了简单的安全扩展假设。

    1. verifiers and observer models inside the action-memory loop reduce silent failure and information leakage while remaining vulnerable to misspecification.

      大多数人认为验证和观察模型应该是外部组件,用于监控AI系统的行为。但作者认为将验证者和观察者模型置于行动-记忆循环内部可以减少静默失败和信息泄露,尽管它们仍然容易受到错误规范的影响。这一观点挑战了传统的监控架构设计,暗示内部验证可能比外部监控更有效。

  5. Mar 2026
    1. How I Dropped Our Production Database and Now Pay 10% More for AWS
      • The author accidentally dropped their production database while using an AI agent (Claude Code) to manage AWS infrastructure via Terraform.
      • The incident occurred because the author attempted to merge two separate projects into one, ignoring the AI’s advice to keep them separate to save on VPC costs.
      • The AI agent generated a Terraform plan that included deleting existing resources to recreate them under the new unified structure.
      • The author authorized a terraform apply and subsequently a terraform destroy without carefully reviewing the plan, mistakenly believing the agent was only cleaning up temporary resources.
      • Because the author had not set up external backups and the automated RDS snapshots were deleted along with the instance, all data was initially lost.
      • AWS Support was miraculously able to recover a snapshot, though the author now pays 10% more for AWS due to implementing more robust (and expensive) backup and security measures.
      • The "lesson learned" highlights the dangers of "vibe engineering"—relying on AI agents to execute destructive commands without human oversight or a deep understanding of the underlying tools.

      Hacker News Discussion

      • Negligence Over AI Risk: Many commenters argue that the issue wasn't the AI itself, but the author's decision to bypass standard safety procedures, such as reviewing terraform plan before execution.
      • Critique of "Vibe Engineering": Users criticized the trend of letting LLMs handle infrastructure (IaC) without the human operator understanding the deterministic tools they are using.
      • Infrastructure Over-engineering: Several participants pointed out that the project seemed over-engineered with AWS and Terraform when a simple VPS or SQLite database might have sufficed and been easier to manage.
      • AWS Data Recovery: Former AWS employees expressed surprise that support could recover the data, noting that AWS typically treats a user-initiated deletion as a final security command to wipe the data.
      • The Importance of Staging: A recurring theme was that major migrations should be tested in a staging environment first; running unverified AI-generated scripts directly against production was labeled as "insanity."
  6. Feb 2026
    1. What Your Bluetooth Devices Reveal About You
      • Project Overview: The author developed "Bluehood," a Python-based Bluetooth scanner, to demonstrate the extensive metadata leaked by devices merely by having Bluetooth enabled.
      • Motivation: Triggered by a critical vulnerability (WhisperPair CVE-2025-36911) and a desire to visualize invisible digital footprints, the project highlights how "invisible" signals compromise privacy.
      • What Bluetooth Reveals About Users: By monitoring signals passively, the author could determine:
        • Delivery Logistics: Exact arrival times of delivery vehicles and whether the same driver visits repeatedly.
        • Daily Routines: The specific daily patterns of neighbors based on their phone and wearable broadcasts.
        • Device Associations: Which devices belong to the same person (e.g., a specific phone moving in tandem with a specific smartwatch).
        • Occupancy & Location: Exact times people are home, at work, or elsewhere.
        • Security Vulnerabilities: Periods when a house is typically empty.
        • Social Patterns: Regular visitors (e.g., someone visiting every Thursday afternoon).
        • Employment Indicators: Patterns that suggest specific work types, such as shift work.
        • Family Schedules: Specific times children return home from school.
        • Consumer Habits: Which households share the same delivery drivers, implying similar shopping preferences.
        • Incident Evidence: Retrospective logs of who was present (passersby, dog walkers) during specific events like property damage.
      • Uncontrollable Broadcasts:
        • Many devices broadcast continuously without user recourse, including medical implants (pacemakers, hearing aids), modern vehicles, and smart home tech.
        • Privacy tools like Briar or BitChat require Bluetooth for off-grid mesh networking, creating a paradox where privacy tools necessitate privacy leaks.
      • Technical Functionality:
        • Bluehood uses passive scanning to identify vendors and device types without connecting.
        • It analyzes patterns (heatmaps, dwell times) and filters out randomized MAC addresses to focus on persistent tracking.

      Hacker News Discussion

      • Ubiquitous Tracking: Commenters confirmed that similar tracking is common in retail (using iBeacons to track shoppers to specific shelves) and via vehicle sensors (TPMS in tires broadcasting unique IDs).
      • WiFi vs. Bluetooth: Users noted that WiFi signals from cars (often named "Audi", "Tesla", etc.) are just as leaky as Bluetooth, allowing for easy "wardriving" profiles.
      • Medical Privacy: Significant concern was raised regarding medical devices (like CPAP machines) that broadcast 24/7, often to satisfy insurance requirements, with no way for the patient to disable the radio.
      • Mitigation Strategies:
        • OS Features: GrapheneOS and recent Android versions offer settings to automatically turn off Bluetooth after a period of inactivity.
        • iOS Limitations: Apple users noted it is harder to keep Bluetooth permanently off without diving into settings or using Shortcuts, as the Control Center toggles are temporary.
      • Legal Context: Several users pointed out that while such tracking is rampant in some regions, it is strictly regulated or forbidden in the EU without explicit consent.
  7. Nov 2025
  8. Oct 2025
    1. a user will want to move their passkeys to the Credential Manager of a different vendor or platform. This is currently challenging to do, but FIDO and vendors are actively working to address this issue and we wait to see support for this take hold across the market.

      Good list of issues in this article. This issue of Credential Exchange Protocol / Format is so key to me, and so timely for this article, since the initial 1.0 was done a year ago. AFAIK there aren't implementations yet, Passkeys are locked on a device.

    1. Like the Elliptic curve Diffie-Hellman (ECDH) protocol that Signal has used since its start, KEM is a key encapsulation mechanism. Also known as a key agreement mechanism, it provides the means for two parties who have never met to securely agree on one or more shared secrets in the presence of an adversary who is monitoring the parties’ connection. RSA, ECDH, and other encapsulation algorithms have long been used to negotiate symmetric keys (almost always AES keys) in protocols including TLS, SSH, and IKE. Unlike ECDH and RSA, however, the much newer KEM is quantum-safe.
  9. Sep 2025
  10. Aug 2025
    1. EASY STEPS ON HOW TO CHANGE YOUR HIVE WALLET KEYS

      A step-by-step guide for Hive users on how to change their wallet keys to enhance security. Emphasizing the importance of not losing passwords. And using randomly-generated keys. It outlines the process of accessing and updating keys. While ensuring they are backed up properly.

  11. Jul 2025
    1. Whatever is at the center of our life will be the source of our security, guidance, wisdom,and power. Security represents your sense of worth, your identity, your emotionalanchorage, your self-esteem, your basic personal strength or lack of it.Guidance means your source of direction in life. Encompassed by your map, yourinternal frame of reference that interprets for you what is happening out there, arestandards or principles or implicit criteria that govern moment-by-moment decision-making and doing.Wisdom is your perspective on life, your sense of balance, your understanding of howthe various parts and principles apply and relate to each other. It embraces judgment,discernment, comprehension. It is a gestalt or oneness, an integrated wholeness.Power is the faculty or capacity to act, the strength and potency to accomplish something.It is the vital energy to make choices and decisions. It also includes the capacity toovercome deeply embedded habits and to cultivate higher, more effective ones.
  12. Jun 2025
  13. May 2025
  14. Apr 2025
    1. To this day, if you know the right people, the Silicon Valley gossip mill is a surprisingly reliable source of information if you want to anticipate the next beat in frontier AI – and that’s a problem. You can’t have your most critical national security technology built in labs that are almost certainly CCP-penetrated

      for - high security risk - US AI labs

    1. the lion's share of American federal outlays every year are in things like Medicare, Social Security, entitlement programs that Americans rely on. Yeah, I think Elon Musk has brought that to attention many times over the last couple of months when talking doge

      for - balancing the budget - Doge - cutting the US deficit - Doge - US deficit - mostly due to medicare and social security