105 Matching Annotations
  1. Jun 2026
    1. Anthropic singled out cybersecurity and biology as two domains where the safeguards may block responses, both areas widely considered sensitive topics for advanced AI systems.

      文章暗示了AI在特定领域的风险,但未详细解释为何这些领域被视为敏感。需要深入了解Anthropic的安全措施具体如何工作,以及这些限制是否足够全面,是否存在其他潜在风险领域。

  2. Apr 2026
    1. Real-time monitoring of agent actions with a 12-category anomaly detection system derived from frontier model safety evaluations. Three-level alert system: PROHIBITED (immediate block), HIGH_RISK_DUAL_USE (human review), DUAL_USE (log and track).

      这种三级警报系统展示了AI安全监控的精细化程度,将代理行为分为不同风险级别,从完全禁止到仅记录跟踪。这种分类方法反映了AI安全中'双重用途'挑战的复杂性,即同一技术既可用于防御也可用于攻击。

    1. Responsible AI is not keeping pace with AI capability, with safety benchmarks lagging and incidents rising sharply.

      这一警告揭示了AI发展中的危险不平衡:技术能力快速提升的同时,负责任的AI实践和安全措施却严重滞后。这种差距可能导致不可预见的风险,并引发公众对AI的信任危机,需要紧急关注。

    1. Some recent models that don't currently have time horizons: Gemini 3.1 Pro, GPT-5.2-Codex, Grok 4.1

      METR 公开列出了「尚未完成评测」的前沿模型,这个透明度本身就令人惊讶。更令人注意的是列表的内容:Gemini 3.1 Pro 和 GPT-5.2-Codex 都榜上有名,说明 METR 的评测能力跟不上模型发布速度。在 AI 能力快速迭代的背景下,「评测滞后」已成为 AI 安全领域的系统性风险——我们对最新最强模型的能力边界,永远处于半盲状态。

  3. Aug 2022
  4. Mar 2022
  5. Feb 2022
  6. Jan 2022
  7. Dec 2021
  8. Nov 2021
  9. Oct 2021
  10. Sep 2021
  11. Aug 2021
    1. (2) Dr Nicole E Basta on Twitter: “There is SO MUCH misunderstanding about what a #vaccine #mandate IS & what a vaccine mandate DOES. No one is calling for anyone to be banned. No one is calling for anyone to be forcibly vaccinated. Please, gather 'round and listen up, so you know what we’re talking about... 1/n” / Twitter. (n.d.). Retrieved August 23, 2021, from https://twitter.com/IDEpiPhD/status/1428410251884302336?s=20

  12. Jul 2021
    1. Gargano, J. W., Wallace, M., Hadler, S. C., Langley, G., Su, J. R., Oster, M. E., Broder, K. R., Gee, J., Weintraub, E., Shimabukuro, T., Scobie, H. M., Moulia, D., Markowitz, L. E., Wharton, M., McNally, V. V., Romero, J. R., Talbot, H. K., Lee, G. M., Daley, M. F., & Oliver, S. E. (2021). Use of mRNA COVID-19 Vaccine After Reports of Myocarditis Among Vaccine Recipients: Update from the Advisory Committee on Immunization Practices — United States, June 2021. MMWR. Morbidity and Mortality Weekly Report, 70(27), 977–982. https://doi.org/10.15585/mmwr.mm7027e2

  13. Jun 2021
  14. May 2021
  15. Apr 2021
  16. Dec 2020
  17. Nov 2020
    1. How can we help our students feel safe?

      I feel like this question needs to be asked more. We talk about our classrooms as being places where it should be "safe" to take risks and to fail, but it's not enough for us to assert it, and I'd argue not enough for us to implement only the policies which would address our own concerns. We really have to ask how our students perceive their safety.

  18. Oct 2020
    1. Landi, F., Marzetti, E., Sanguinetti, M., Ciciarello, F., Tritto, M., Benvenuto, F., Bramato, G., Brandi, V., Carfì, A., D’Angelo, E., Fusco, D., Lo Monaco, M. R., Martone, A. M., Pagano, F., Rocchi, S., Rota, E., Russo, A., Salerno, A., Cattani, P., … Bernabei, on behalf of the G. A. C.-19 G. T. (n.d.). Should face masks be worn to contain the spread of COVID-19 in the postlockdown phase? Transactions of The Royal Society of Tropical Medicine and Hygiene. https://doi.org/10.1093/trstmh/traa085

  19. Sep 2020
  20. Aug 2020
  21. Jul 2020
  22. Jun 2020