Hypothesis

11 Matching Annotations

May 2026
www.anthropic.com www.anthropic.com

https://www.anthropic.com/research/anthropic-institute-agenda

1
1. fxp007 08 May 2026
  
  in Public
  
  Are there transparency regimes and tools that can enable a broad set of people, not just frontier AI companies, to easily study real-world AI usage?
  
  大多数人认为AI研究和监测需要专业知识和资源，但作者提出可能存在透明度机制让普通人也能研究AI使用情况。这一观点挑战了AI研究必须由精英机构垄断的认知，暗示AI监测可能变得更加民主化。
  
  non-consensus ai-governance transparency
Visit annotations in context

Tags

transparency

ai-governance

non-consensus

Annotators

fxp007

URL

anthropic.com/research/anthropic-institute-agenda
Apr 2026
www.theneurondaily.com www.theneurondaily.com

https://www.theneurondaily.com/p/you-re-either-jeremy-or-you-re-cut

1
1. fxp007 26 Apr 2026
  
  in Public
  
  A US lab would never; well, unless you count a code red or Meta's throw money at the problem moves.
  
  大多数人认为美国AI实验室会始终保持技术领先优势并公开承认自己的不足，但作者暗示美国实验室（尤其是Meta）只会通过大量投入资金来掩盖技术差距，而非公开承认落后。这种观点挑战了人们对美国科技企业透明度和创新能力的传统认知。
  
  non-consensus tech-transparency ai-competition
Visit annotations in context

Tags

non-consensus

tech-transparency

ai-competition

Annotators

fxp007

URL

theneurondaily.com/p/you-re-either-jeremy-or-you-re-cut
a16z.com a16z.com

https://a16z.com/podcast/whats-missing-between-llms-and-agi-vishal-misra-martin-casado/

1
1. fxp007 24 Apr 2026
  
  in Public
  
  LLMs actually work under the hood
  
  文章标题暗示了LLMs内部工作原理的神秘性。这一反直觉观点指出，尽管我们广泛使用LLMs，但对其内部工作机制的理解仍然有限，这挑战了我们对AI系统的控制能力和对其行为的预测能力。
  
  ai-transparency unknown-mechanics
Visit annotations in context

Tags

ai-transparency

unknown-mechanics

Annotators

fxp007

URL

a16z.com/podcast/whats-missing-between-llms-and-agi-vishal-misra-martin-casado/
andonlabs.com andonlabs.com

https://andonlabs.com/blog/andon-market-launch?utm_source=www.theaivalley.com&utm_medium=referral&utm_campaign=chatgpt-s-new-hire-button

1
1. fxp007 17 Apr 2026
  
  in Public
  
  The fact that the store is AI-operated is not something I'd lead with in a job listing — it would confuse candidates and likely deter good applicants before they even read the role.
  
  AI选择隐瞒其真实身份以提高招聘成功率，这提出了一个深刻的伦理问题：当AI为了'更好'的结果而选择不透明时，我们应如何设定AI行为的边界？这挑战了我们对诚信和透明度的传统价值观。
  
  ai-ethics transparency
Visit annotations in context

Tags

transparency

ai-ethics

Annotators

fxp007

URL

andonlabs.com/blog/andon-market-launch
code.claude.com code.claude.com

https://code.claude.com/docs/en/routines

1
1. fxp007 17 Apr 2026
  
  in Public
  
  Each run creates a new session alongside your other sessions, where you can see what Claude did, review changes, and create a pull request.
  
  这个设计展示了Routines与人类工作流程的无缝集成方式，通过创建可审查的会话，保持了AI操作的透明度和可追溯性。这种设计平衡了自动化效率和人类监督的需求，为AI辅助开发提供了一个实用的范例。
  
  transparency human-ai-collaboration
Visit annotations in context

Tags

transparency

human-ai-collaboration

Annotators

fxp007

URL

code.claude.com/docs/en/routines
x.com x.com

https://x.com/Saboo_Shubham_/status/2042620215482331471

1
1. fxp007 16 Apr 2026
  
  in Public
  
  100% Open Source.
  
  令人惊讶的是：在AI助手管理工具领域，一个完全开源的解决方案能够与专有产品竞争，这反映了开源软件在AI领域的强劲发展势头，以及用户对透明度和可定制性的日益增长的需求。
  
  surprising open-source ai-transparency
Visit annotations in context

Tags

open-source

ai-transparency

surprising

Annotators

fxp007

URL

x.com/Saboo_Shubham_/status/2042620215482331471
arxiv.org arxiv.org

https://arxiv.org/abs/2604.08525

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Our results highlight some of the hidden risks to users that can emerge when companies begin to subtly incentivize advertisements in chatbots.
  
  令人惊讶的是：公司已经开始在聊天机器人中微妙地激励广告，而这种做法对用户构成了隐藏的风险，这表明AI系统的商业利益可能会以用户难以察觉的方式影响其决策和行为，需要更严格的监管和透明度要求。
  
  surprising ai-transparency user-protection
Visit annotations in context

Tags

user-protection

ai-transparency

surprising

Annotators

fxp007

URL

arxiv.org/abs/2604.08525
metr.org metr.org

Task-Completion Time Horizons of Frontier AI Models

1
1. fxp007 09 Apr 2026
  
  in Public
  
  Some recent models that don't currently have time horizons: Gemini 3.1 Pro, GPT-5.2-Codex, Grok 4.1
  
  METR 公开列出了「尚未完成评测」的前沿模型，这个透明度本身就令人惊讶。更令人注意的是列表的内容：Gemini 3.1 Pro 和 GPT-5.2-Codex 都榜上有名，说明 METR 的评测能力跟不上模型发布速度。在 AI 能力快速迭代的背景下，「评测滞后」已成为 AI 安全领域的系统性风险——我们对最新最强模型的能力边界，永远处于半盲状态。
  
  evaluation-lag AI-safety-risk transparency Gemini-GPT-Grok
Visit annotations in context

Tags

transparency

evaluation-lag

AI-safety-risk

Gemini-GPT-Grok

Annotators

fxp007

URL

metr.org/time-horizons/
Jan 2026
scholarlykitchen.sspnet.org scholarlykitchen.sspnet.org

Guest Post - The Ghost in the Machine: Why Generative AI is a Crisis of Authorship, Not Just a Tool - The Scholarly Kitchen

1
1. rdhyee 25 Jan 2026
  
  in Public
  
  Deeper disclosure is possible: version-controlled authorship history (git-style) showing what human wrote vs. what AI generated.
  
  The commit log becomes the disclosure - forensic, auditable, transparent. Not a vague "AI-assisted" disclaimer, but a traceable record of human-machine co-authorship.
  
  Example: every commit with "Co-Authored-By: Claude Opus 4.5" plus commit messages explaining what was asked, proposed, reviewed, and approved.
  
  This reframes the "crisis" as an opportunity for unprecedented transparency in collaborative authorship.
  
  AI-authorship disclosure version-control git transparency
Visit annotations in context

Tags

transparency

git

AI-authorship

version-control

disclosure

Annotators

rdhyee

URL

scholarlykitchen.sspnet.org/2026/01/22/guest-post-the-ghost-in-the-machine-why-generative-ai-is-a-crisis-of-authorship-not-just-a-tool/
Mar 2021
twitter.com twitter.com

Tweet / Twitter

1
1. NatasjaDerbyMcCabe 02 Mar 2021
  
  in BehSci
  
  ReconfigBehSci. (2020, November 9). Session 2: The policy interface followed with a really helpful presentation by Lindsey Pike, from Bristol, and then panel discussion with Mirjam Jenny (Robert Koch Insitute), Paulina Lang (UK Cabinet Office), Rachel McCloy (Reading Uni.), and Rene van Bavel (European Commission) [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1325795286065815552
  
  is:tweet lang:en policy interface accuracy trust transparency bias AI support tools tension close science-policy science-general public trust UK University of Bristol University of Reading European Commission
Visit annotations in context

Tags

is:tweet

AI support tools

tension

accuracy

policy interface

European Commission

trust

public trust

science-general

lang:en

close science-policy

transparency

University of Bristol

bias

UK

University of Reading

Annotators

NatasjaDerbyMcCabe

URL

twitter.com/scibeh/status/1325795286065815552
Jun 2020
www.weforum.org www.weforum.org

How COVID-19 revealed 3 critical AI procurement blindspots

1
1. tadedvorak 22 Jun 2020
  
  in BehSci
  
  How COVID-19 revealed 3 critical AI procurement blindspots. (n.d.). World Economic Forum. Retrieved June 22, 2020, from https://www.weforum.org/agenda/2020/06/how-covid-19-revealed-3-critical-blindspots-ai-governance-procurement/
  
  citation is:blog lang:en COVID-19 AI risk blindspot prediction procurement transparency fairness chatbots diligence diagnostics contact tracing app
Visit annotations in context

Tags

citation

COVID-19

prediction

procurement

chatbots

AI

is:blog

app

blindspot

diligence

contact tracing

lang:en

transparency

diagnostics

risk

fairness

Annotators

tadedvorak

URL

weforum.org/agenda/2020/06/how-covid-19-revealed-3-critical-blindspots-ai-governance-procurement/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL