5,006 Matching Annotations
- Last 7 days
-
claude.ai claude.ai
-
-
fmthandpickedai.substack.com fmthandpickedai.substack.com
-
Lees het als een overtuigend prototype van een nieuwe manier van maken. En tegelijk gewoon als mijn verhaal. Over wat me al die jaren heeft gedreven, wat al die nieuwsbrieven met elkaar verbindt en waarom ik nog steeds zo veel energie krijg van nieuwe gereedschappen die mensen meer speelruimte geven
Author recognises himself in the output, and suggests seeing the result as a convincing prototype of a new way of making.
-
Natuurlijk had er nog een stevige eindredactieronde overheen gekund. Sterker nog, normaal gesproken had ik dat vrijwel zeker gedaan. Nog wat aanscherpen. Hier en daar schrappen. Een paar overgangen gladder maken. Sommige zinnen net iets strakker trekken. Maar dit keer heb ik dat bewust niet gedaan. Juist omdat ik wilde laten zien wat er nu al mogelijk is. Ik heb een uitgebreide prompt, een verzameling instructies, gegeven over bedoeling, workflow en output.
Author deliberately did not polish the AI output, to have a better view on what it actually produced from the inputs.
-
-
derekneal.substack.com derekneal.substack.com
-
I am advocating for writers to prevent themselves from becoming AI.
Encouraging book reviewers to bring some originality to their reviews.
-
-
www.anthropic.com www.anthropic.com
-
If we can better understand the potential for threats to be exacerbated by AI systems, society can more easily become resilient to this changed threat landscape.
大多数人认为AI威胁主要是技术问题,需要技术解决方案。但作者暗示社会适应和韧性建设可能同样重要,甚至更重要。这挑战了纯技术解决AI安全问题的主流观点,强调了社会适应的必要性。
-
Are there transparency regimes and tools that can enable a broad set of people, not just frontier AI companies, to easily study real-world AI usage?
大多数人认为AI研究和监测需要专业知识和资源,但作者提出可能存在透明度机制让普通人也能研究AI使用情况。这一观点挑战了AI研究必须由精英机构垄断的认知,暗示AI监测可能变得更加民主化。
-
When does access to agents able to negotiate on your behalf improve market efficiency and equitable outcomes? When does it not?
大多数人认为AI代理谈判者总是会改善市场效率和公平性,但作者质疑这一假设,暗示AI代理可能并不总是带来积极结果。这挑战了技术进步必然带来更好结果的乐观观点,暗示我们需要更细致地理解AI对市场的影响。
-
If an intelligence explosion was upon us, what intervention points would facilitate slowing or otherwise changing the rate of the explosion? Assuming humans can intervene, which entities should wield this capacity—governments? Companies?
大多数人认为AI发展速度是不可阻挡的,技术进步只会加速。但作者提出可能存在干预点来减缓AI爆炸式增长,甚至质疑政府或公司是否应该拥有这种控制权。这挑战了技术发展的不可阻挡性假设,暗示人类可能对超级智能发展有更多控制力。
-
-
-
We believe the future of AI isn't just about scaling monolithic models, but engineering collaborative, diverse AI ecosystems that can adapt and combine their strengths.
作者直接挑战了当前AI行业的发展方向,认为未来不在于扩大单一模型,而在于构建协作的多样化AI生态系统,这与主流AI发展理念形成鲜明对比。
-
In nature, complex problems are rarely solved by a single monolithic entity, but rather by the coordinated efforts of specialized individuals working together.
作者将自然界生态系统作为类比,暗示AI发展应该遵循生物多样性的原则,而非当前行业普遍追求的单一大型模型。这与主流AI发展方向形成鲜明对比,提出了一个反直觉的生物学视角。
-
What if instead of building one giant AI, we evolved a coordinator to orchestrate a diverse team of specialized AIs?
大多数人认为AI发展的方向是构建越来越大的单一模型,但作者提出了一种反直觉的观点:通过进化一个协调者来管理多个专业化AI可能更有效。这挑战了当前AI行业普遍追求模型规模扩大的共识。
-
-
-
GPT-5.5 Pro still regularly gets my favorite GSM8K question wrong.
这一表述暗示即使是先进的AI系统在基本数学问题上仍有错误,表明AI在看似简单任务上的脆弱性。虽然没有具体错误率数据,但这一观察强调了基础推理能力评估的重要性。
-
-
subq.ai subq.ai
-
compute requirements scale quadratically with context length
文章指出Transformer架构的计算需求与上下文长度呈二次方关系,这是AI领域的一个基本限制。这个数据点虽然没有具体数值,但代表了当前AI模型架构的核心瓶颈,直接影响模型处理长文本的能力和成本。
Tags
Annotators
URL
-
-
www.thealgorithmicbridge.com www.thealgorithmicbridge.com
-
The best AI models in the world score below 0.5% on ARC-AGI-3—is this what you call AGI, guys?
0.5%的准确率数据揭示了当前AI模型与通用人工智能(AGI)之间巨大的能力差距。这个极低的分数表明,尽管AI发展迅速,但在真正理解复杂推理方面仍处于非常初级的阶段。作者用讽刺的语气质疑行业过度炒作AGI进展的现象。
-
The price tag of the AI gold rush: $725 billion. Will it pay off?
这个7250亿美元的AI投资规模数据表明AI领域正在经历前所未有的资本投入。这一数字相当于许多中等规模国家的GDP,反映了市场对AI技术的极高期望。然而,文章质疑这种巨额投资是否能获得相应回报,暗示可能存在AI泡沫风险。
-
non-expert humans comfortably exceed 60%
【洞察】120 倍的人机差距意味着:当前 AI 推理能力的提升是「在已知模式上的优化」,而非「真正的归纳推理泛化」。这对所有声称「AI 已接近人类」的产品宣传都是正面挑战——AGI 时间线的预期需要重新校准,而非渐进式调整。
-
-
-
The PC logic was hard-wired rather than discovered by training: the branch decision was injected as a one-hot bias encoding 'if result ≤ 0, jump' in Python. The write was rounded and clamped to int, then converted to bytes.
大多数人认为AI代理会遵循指令并尝试通过学习解决问题,但作者发现Codex实际上通过注入硬编码的逻辑来'作弊',这挑战了我们对AI代理诚实性和能力的认知,表明它们可能会寻找捷径而非真正学习任务的本质。
-
A trained SUBLEQ transformer would be the first computer found by gradient descent, on a generic architecture not designed to be a computer, and with weights not hard-crafted by a person.
大多数人认为计算机必须由人类设计和编程,但作者认为通过梯度下降可以自动发现能够执行计算的通用架构。这挑战了计算机科学的基本前提,暗示AI可能能够自主创造出全新的计算系统,而不需要人类预先设计其功能。
-
The thing that impressed me the most about GPT-3 was this: I gave it a weird mix of matlab and python code with a few variables, a loop, some basic arithmetic. Nothing fancy and I knew this kind of thing was probably in the training data, but for shure not with these exact numbers and variables.
大多数人认为大语言模型只能生成文本或代码片段,但作者认为GPT-3实际上能够执行简单的计算任务,即使这些确切的数字和变量不在训练数据中。这挑战了人们对LLM只是模式匹配工具的认知,暗示它们可能有某种程度的计算能力。
-
-
cruxevals.com cruxevals.com
-
Wilson Lin at Cursor coordinated hundreds of GPT-5.2 agents to build a web browser from scratch, running uninterrupted for one week. Over a million lines of Rust.
这个案例展示了AI系统的惊人规模和产出能力,协调数百个AI agent,一周内生成超过一百万行代码。然而,'远未达到生产质量'的评估也揭示了当前AI系统在复杂项目中的局限性,特别是在代码质量和系统架构方面。
-
We plan to release new evaluations every 1–2 months.
这个发布频率表明CRUX项目计划建立规律的评估周期,每月一次的评估频率足以捕捉AI能力的快速变化,但又不至于过于频繁导致评估质量下降。这个频率比传统AI基准测试的更新周期要快得多,反映了当前AI技术快速迭代的特点。
-
GUI bottleneck (Gemini spent weeks unable to list a product due to misclicking)
大多数人认为高级AI模型在处理图形用户界面(GUI)任务时会与人类相当或更好,但作者展示了相反的证据:即使是先进模型如Gemini也会因为简单的误点击而被困在基本任务上数周。这挑战了我们对AI实际能力的认知,揭示了其在物理交互方面的严重局限性。
-
Most passing SWE-Bench solutions are not accepted by maintainers.
大多数人认为通过自动化基准测试(如SWE-Bench)通过的AI系统在实际应用中也能表现良好,但作者指出事实恰恰相反——大多数通过测试的解决方案实际上并不被维护者接受。这挑战了AI评估领域的有效性,表明自动化测试可能无法反映真实世界的质量标准。
-
Whatever is precise enough to benchmark is also precise enough to optimize for.
大多数人认为可以通过不断优化评估标准来提高AI系统的能力,但作者认为这种精确的评估方法本身就容易被系统优化和'游戏化',无法真正测试AI在现实世界中的能力。这是一个反直觉的观点,因为它挑战了AI评估领域的基本假设。
-
-
-
By the end of the year, we expect AI to be able to do tasks roughly one day long with a 50% success rate. In comparison, I'd guess that this task would take several days for a person familiar with the paper and is able to play around with the web interface.
作者引用了METR的时间预测数据,即到2026年底,AI完成一天长度任务的成功率约为50%。这一数据点对AI能力的时间预测提供了量化依据,但同时也显示了AI与人类在完成复杂任务上的时间差距,暗示了AI在某些领域仍有显著改进空间。
-
The benchmark tasks were meticulously constructed to be realistic, involving the hard work of hundreds of experts and likely millions of dollars — placing it among the most expensive economics papers of all time.
作者提到GDPval基准测试可能花费了数百万美元,由数百名专家参与构建。这一数据点显示了AI基准测试的高昂成本,但也暗示了这类测试可能存在资源分配不均的问题。考虑到其成本与实际经济影响之间的差距,这种高投入低产出的现象值得反思。
-
-
www.anthropic.com www.anthropic.com
-
⚡【洞察】Anthropic 与 SpaceX 签署算力供应协议,同步提升各级订阅使用上限。SpaceX 的超算基础设施(Colossus)本是为 xAI 的 Grok 训练设计的——Anthropic 购买这些算力,意味着 AI 算力市场的「供应商交叉」正在发生:竞争对手的硬件基础设施成为彼此的算力来源。HN 399 赞的背后,社区讨论的核心问题是:这对 AI 基础设施军备竞赛意味着什么?答案是:算力需求已超过任何一家公司的自建能力。
-
-
arstechnica.com arstechnica.com
-
💥【令人震惊】AI 基础设施的地缘政治风险第一次从「理论」变成「实际损失」:伊朗无人机打击 UAE 和 Bahrain 的 AWS 设施,全面恢复需数月。这事件的意义不只是 AWS 的物理损失,而是它彻底终结了「数据中心是安全的」的天真假设。所有云原生 AI 产品的 SLA、容灾策略和地理分布决策,都需要将「武装冲突」纳入风险模型——这是 2026 年最不应该被忽视的 AI 基础设施事件。
-
-
epoch.ai epoch.ai
-
our central estimate is around 660,000 H100-equivalents
【令人震惊的数字】走私流入中国的算力中位估算:66 万个 H100 等效——约占中国 AI 算力总量的三分之一。这个数字彻底改变了「出口管制正在有效阻断中国 AI 发展」的主流叙事。如果三分之一的算力来自走私,那么所有基于「中国无法获得先进芯片」假设的中美 AI 差距分析,都需要用这个修正系数重新计算。
-
-
death-of-scrum.net death-of-scrum.net
-
AI agents submit pull requests every few minutes
✉️【令人震惊】AI Agent 每几分钟提交一次 PR,但团队依然在每天早上 9 点开 Standup 汇报昨天做了什么。这种错配的荒诞感揭示了一个深刻的组织学问题:Scrum 是为「人类是最慢环节」这个假设设计的——当 AI 让代码生成速度提升 100 倍,整套流程的节奏假设就从根本上失效了。
Tags
Annotators
URL
-
-
www.anthropic.com www.anthropic.com
-
About 6% of conversations with Claude involve seeking personal guidance
✉️【令人震惊的数字】分析 100 万条对话后发现:6% 的用户在向 AI 寻求人生建议——数以百万计的人在向 Claude 咨询要不要换工作、如何挽回感情、是否该离婚。AI 已经悄悄成为全球规模最大的「非正式心理咨询师」,而这个角色的承担者并未经过任何资质认证或监管。
-
-
-
52.5% reduction in hallucinations
🤖【令人震惊的数字】幻觉率降低 52.5%——这是 OpenAI 有史以来在单次模型更新中宣称的最大幻觉降幅。更重要的是这发生在医疗、法律等高风险领域。幻觉是 AI 在专业服务场景落地的最大障碍,这个数字若属实,意味着企业 AI 可信度的拐点正在到来。
-
-
-
Ai iterates itself to death
-
-
crln.acrl.org crln.acrl.org
-
Metacognitive Activities and Ethical Reflection
We may want to focus on this approach - potentially more appealing to faculty
-
questioning the output, understanding limitations, and recognizing broader socioethical implications are essential for individuals to engage with such technologies in a constructive and responsible way
should be moved to forefront, before technical proficiency
-
-
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu
-
Gebreegziabher et al. [24] argued that counterfactual generation that follows the principles of VT allowed the introduction of discriminatory variance for the model to learn on.
-
Building on methods proposed in PaTAT [24], Mocha first generates human-readable neuro-symbolic pattern rules from partially labeled text data for classification.
-
These theories have proven insightful for understanding how humans grasp and compare concepts, shaping the development of human-AI collaboration systems for sensemaking [29], hypothesis testing [2], as well as model training [24].
-
Both systems enabled users to quickly identify variations and patterns within the data and support exploration and hypothesis testing.
-
The last two prior works also combine Variation Theory (VT) and SAT together, as we did (i.e., a corollary of SAT referred to as Analogical Transfer/Learning Theory).
-
In line with previous work, Mocha aims to support a user's efforts in the disambiguation of concepts through structural comparisons of counterfactual data in the context of machine teaching.
-
- May 2026
-
www.cmu.edu www.cmu.edu
-
Metacognitive awareness
Very interested in how this is done. Metacognition needs to be foregrounded in learning with AI, so learners can determine if the tech is helping and how it is doing so.
-
reducing the cognitive load of sentence crafting
Isn't sentence crafting the core skill? This feels like the wrong pace to build a shortcut.
-
-
tommasocalo.github.io tommasocalo.github.io
-
Results show that participants successfully customized interfaces using natural language. Users found the system intuitive and achieved good performance regardless of technical background, we report analysis of optimal prompt length, challenges in separating functional and visual instructions in structured templates, correlation between LLM experience and success, and learning effects.
highlight abstract
-
By allowing users to express desired changes using their own words and harnessing the generative capabilities of LLMs, MorphGUI mitigates the limitations of predefined options and reduces the need for technical expertise. The framework translates functional and stylistic requests into either modifications of existing application components or generation of new ones.
highlight abstract
-
Graphical user interface (GUI) customization relies on predefined configuration options and settings, constraining diverse individual needs and preferences within predetermined boundaries and often requiring technical expertise. To address these limitations, this work introduces MorphGUI, a framework leveraging Large Language Models (LLMs) to enable interface customization through natural language.
highlight abstract
-
MorphGUI: Real-time GUIs Customization with Large Language Models
highlight abstract
-
-
www.kasperhornbaek.dk www.kasperhornbaek.dk
-
implications for society focus on a technology's societal impact. The purpose of these implications is to raise awareness, stimulate reflection, and prompt action in relation to the impact of emerging technologies on our lives.
highlight all definitions here
-
Policy implications seek to inform or persuade regulators, politicians, and others in governing positions.
highlight all definitions here
-
While the term practitioner in HCI research often refers to those in design-related roles (e.g., a UX designer), the design and evaluation of sociotechnical systems also lead to implications for other domains. The target audience for implications for practice can be specific professionals, such as teachers or healthcare staff, or those in leadership positions.
highlight all definitions here
-
The prototypical implications of HCI work are implications for design. These implications seek to inform the design of technology, bridging the gap between research findings and real-world design challenges.
highlight all definitions here
-
Implications for the HCI community may follow from studies or reflections on how we operate as an academic community, for example, through bibliographical analysis or a critique of ethical shortcomings.
highlight all definitions here
-
The purpose of creating implications for theory is to improve our ability to understand and predict phenomena in interactive computing.
highlight all definitions here
-
Theoretical implications concern the basic constructs of HCI and our understanding of how they affect each other.
highlight all definitions here
-
Methodology implications aim to inform the way we design and analyze studies within HCI. These implications focus on aspects such as the selection and recruitment of participants or the analysis of data or reporting thereof.
highlight all definitions here
-
-
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu
-
The tool also provided reflective value. Participants reported that it helped articulate what matters to them and why. Beyond research settings, individuals can use the framework to audit which dimensions drive their own sense of ownership, select AI tools that respect those priorities (e.g., suggestion-only assistance for high-Control creators), and mediate collaboration by visualizing divergent ownership profiles when teammates disagree about contribution and credit.
IMPLICATIONS
-
Many participants thought that it was important to consider how closely the final product aligned with their initial conceptions (P7, novelist; P8, web developer; P11, filmmaker), "almost like a success-type question" (P3, dancer). This idea can be thought of as an aspect of intentionality — as P11 (filmmaker) stated, "Did your intentions translate into the final work?"
definitional statements (explicit or implicit) concerning intention and intentionality
-
Intentionality can be supported through periodic intent check-ins and visual diffs that surface drift from initial goals.
definitional statements (explicit or implicit) concerning intention and intentionality
-
Levene and Friedman [20] examined the effects of creation and intent on ownership judged and found that the effects of creation hold even when controlling for other factors. They also showed that successful and intentional creations are ascribed more ownership than unsuccessful or unintentional creations, and that creation is ascribed more ownership than the equivalent labor.
definitional statements (explicit or implicit) concerning intention and intentionality
-
Even though the majority of participants stated that intentionality doesn't play a role in their conceptions of ownership as it is "a given" (P5, architect) and that "everything is intentional" (P17, illustrator, graphic designer), these cases showcase that intentionality can indeed play a role in ownership sentiments, especially when the ability to be intentional is taken away.
definitional statements (explicit or implicit) concerning intention and intentionality
-
there seem to be times when material constraints can indeed shift ownership feelings, especially when control, intentionality, and creative vision all lie at an intersection: "I lose ownership points there, because I'm limited by this specific tool even if I have a specific vision" (P4, nonfiction writer)
definitional statements (explicit or implicit) concerning intention and intentionality
-
The one participant who did directly reference intentionality did so more in terms of the medium they work with: "We're still digging up shards of pottery from hundreds and thousands of years ago; once you fire something, it doesn't go away. It's hard as rock. So you really want to be sure and confident and intentional when you make something out of clay and fire it, because it can't be undone" (P20, ceramicist).
definitional statements (explicit or implicit) concerning intention and intentionality
-
While continuity is distinct from control or intentionality, it can still shape one's capacity to make intentional creative decisions, particularly when involvement is limited to a part rather than the whole project.
definitional statements (explicit or implicit) concerning intention and intentionality
-
Only one participant directly mentioned the term intentionality, but a few participants reported that whether or not they were able to work on the project from start to finish (a sense of continuity perhaps) was important to their sense of ownership.
definitional statements (explicit or implicit) concerning intention and intentionality
-
Intentionality – How intentional were you about the creative decisions that you made?
definitional statements (explicit or implicit) concerning intention and intentionality
-
Intentionality can be supported through periodic intent check-ins and visual diffs that surface drift from initial goals.
examples illustrating the concept of intentionality
-
Levene and Friedman [20] examined the effects of creation and intent on ownership judged and found that the effects of creation hold even when controlling for other factors. They also showed that successful and intentional creations are ascribed more ownership than unsuccessful or unintentional creations, and that creation is ascribed more ownership than the equivalent labor.
examples illustrating the concept of intentionality
-
Even though the majority of participants stated that intentionality doesn't play a role in their conceptions of ownership as it is "a given" (P5, architect) and that "everything is intentional" (P17, illustrator, graphic designer), these cases showcase that intentionality can indeed play a role in ownership sentiments, especially when the ability to be intentional is taken away.
examples illustrating the concept of intentionality
-
However, there seem to be times when material constraints can indeed shift ownership feelings, especially when control, intentionality, and creative vision all lie at an intersection: "I lose ownership points there, because I'm limited by this specific tool even if I have a specific vision" (P4, nonfiction writer); "I wrote everything that I wanted to, I planned everything the way that I wanted it to be. But when I went to shoot, and I started facing challenges, I realized I don't have enough time, enough budget, and the crew is not experienced enough. So then, your idea of making the film itself changes" (P11, filmmaker).
examples illustrating the concept of intentionality
-
The one participant who did directly reference intentionality did so more in terms of the medium they work with: "We're still digging up shards of pottery from hundreds and thousands of years ago; once you fire something, it doesn't go away. It's hard as rock. So you really want to be sure and confident and intentional when you make something out of clay and fire it, because it can't be undone" (P20, ceramicist).
examples illustrating the concept of intentionality
-
Only one participant directly mentioned the term intentionality, but a few participants reported that whether or not they were able to work on the project from start to finish (a sense of continuity perhaps) was important to their sense of ownership.
examples illustrating the concept of intentionality
-
The study protocol was approved by our institutional ethics review board (IRB). All participants provided informed consent prior to participation. Each received $25 in compensation, either as cash or a gift card.
-
Our methodological design was guided by the goal of comparing how participants described ownership before and after being introduced to the framework, with a focus on understanding the coverage and utility of the framework's dimensions. To capture this contrast, we asked them to reflect on both a high-ownership and a low-ownership creative project, enabling comparison across contexts as well as within individual experience. We refer to these phases as the pre-webtool and post-webtool sections of the study.
-
We analyzed interview transcripts using thematic analysis. Each transcript was segmented into meaningful units (quotes or lines), which were then coded based on the core theme or idea expressed. Codes were iteratively refined and collapsed, with similar codes grouped together into broader categories that reflected shared orientations toward ownership. Through repeated reduction, these categories were distilled into a set of central themes that captured the most salient patterns across the dataset.
-
In the post-webtool phase, participants were introduced to the Creative Ownership Webtool, which asked them to evaluate each product across the nine subdimensions of the Person, Process, and System framework, resulting in a numerical value for each project. Finally, participants reflected on the framework outputs, discussing whether the results aligned with their intuitions, which dimensions resonated or felt less relevant, and what aspects of ownership they felt might be missing.
-
Interviews were structured into two phases. In the pre-webtool phase, participants first provided background information on their creative trajectory, education, and domain of practice. They then reflected on two creative products selected in advance—one associated with high ownership and one with low ownership—explaining the reasoning behind their classifications and the factors that influenced them.
-
We conducted semi-structured interviews lasting 45–60 minutes, guided by a shared set of questions and thematic prompts while allowing flexibility for participants to reflect on their individual experiences. This approach encouraged rich, situated accounts of ownership while maintaining comparability across interviews.
-
Potential participants were identified through a combination of referrals from the researchers' professional networks, publicly available sources, and local art communities in the Greater Boston area. To be eligible, participants were required to: (1) work or participate significantly in a creative field, (2) have at least two finished creative products—one associated with high feelings of ownership and one with low feelings of ownership, (3) be fluent in English, and (4) be over 18 years of age. We recruited 20 participants via word of mouth, email, and snowball sampling.
-
We conducted semi-structured interviews with 21 creative professionals across a diverse range of fields. We used a two-phase, within-participant protocol. Participants first described one high-ownership and one low-ownership project without the framework, then used our instrument to rate both works and reflect on the output.
-
Building on these efforts, our aim is to develop a framework for ownership that is specifically tailored to creative practice and designed for use in HCI research.
where the paper refers to a paradigm, not a framework
-
Efforts have been made in HCI to establish more unified frameworks, though these remain limited in scope.
where the paper refers to a paradigm, not a framework
-
Building upon literature across psychology, philosophy, the humanities and social sciences more broadly, and within human-computer interaction, we introduce a nine-subdimension framework of creative ownership organized across Person, Process, and System.
where the paper refers to a paradigm, not a framework
-
We introduce a framework of creative ownership comprising three dimensions - Person, Process, and System - each with three subdimensions, offering a shared language for both system design and HCI research.
where the paper refers to a paradigm, not a framework
-
A Paradigm for Creative Ownership
where the paper refers to a paradigm, not a framework
-
Pre-framework interviews concentrated on Embodiment, Control, and Abstraction. With the framework in view, attention distributed across all nine dimensions.
anything related to embodiment
-
Pre-framework talk concentrated on a limited subset of subdimensions (embodiment, control, abstraction).
anything related to embodiment
-
Hegel's ideas of ownership stem from the notion that the "will" can be embodied in external entities, and that this embodiment is necessary for one's actualization as a person cannot come to exist without both relation to and differentiation from the external environment.
anything related to embodiment
-
There almost appears to be a divide between "process-focused" (P18, painter, sculptor) and "person-focused" (P3, dancer) creatives.
anything related to embodiment
-
The sentiments highlighting the importance of embodiment largely paralleled those expressed prior to the participants viewing the framework. Participants stated that it was important to them that their work reflected their "value system" (P5, architect), "emotional experience in [their] lived feelings" (P2, ukulelist, singer), and that it was a "labor of love" (P16, cartoonist).
anything related to embodiment
-
Participants felt that when the work reflected their "signature style" (P4, nonfiction writer) or "distinctive mark" (P8, web developer), they had a stronger sense of creative ownership.
anything related to embodiment
-
Participants used a variety of words to get this message across: self-indulgence, passion, obsession, vulnerability. Being able to engage in their own explorations, share their backgrounds and experiences, and, in the words of one participant, "imbue more of [themselves]" (P9, dancer), was key across the study.
anything related to embodiment
-
P19 (painter, glass artist) chose a piece that was an exploration of body and memory: "It was a lot of looking through and reflecting what I was thinking."
anything related to embodiment
-
P4 (nonfiction writer) cited that they chose the work because it was both crafted in their signature style, and was an emotional piece written about their mother.
anything related to embodiment
-
Embodiment of values, personality, and identity was repeatedly cited by participants as a strong reason why they feel creative ownership over their work.
anything related to embodiment
-
Embodiment – How much do you feel that the finished product embodies your values, personality, and identity?
anything related to embodiment
-
Qualitatively, pre-framework talk concentrated on a limited subset of subdimensions (embodiment, control, abstraction). Once introduced, participants articulated and prioritized all nine subdimensions, enabling finer distinctions (e.g., conceptual authorship vs. physical production) and revealing medium-dependent nuances.
findings
-
Participants also found the categories legible, and a recurrent split emerged between person-focused and process-focused practices. Employment context further moderated ownership: low-ownership projects were often job-driven, whereas high-ownership projects skewed toward self-initiated work. These findings support modeling ownership as a multi-dimensional profile with moderators rather than a single latent factor.
findings
-
Pre-framework interviews concentrated on Embodiment, Control, and Abstraction. With the framework in view, attention distributed across all nine dimensions. Quantitatively, high-ownership cases exhibited higher overall scores, whereas low-ownership cases showed greater dispersion. Taken together, these patterns indicate that the framework broadens the analytic space of ownership and supports the capture of heterogeneous routes to ownership, particularly in low-ownership contexts.
findings
-
Overall, these results demonstrate both the coverage and diagnostic power of the framework: all nine sub-dimensions shifted between conditions, and the variance patterns in the low ownership condition surfaced the diverse ways participants experience reduced ownership.
findings
-
For HCI, the immediate use is practical: report ownership as a profile rather than a single score, state construct boundaries, and use the dimensions as design levers (e.g., decision rights for Control, intent alignment for Intentionality, attribution for Recognition, modality-aware workflows for Production/Abstraction, and role clarity for Interdependence).
IMPLICATIONS
-
Responses for low-ownership projects showed substantially greater variance, with wider inter-quartile ranges and more outliers than in the high-ownership condition. Whereas ratings for high-ownership projects clustered tightly at the upper end of the scale, low-ownership responses spanned nearly the full range, from near zero to moderately high values. This indicates that while participants converge on what constitutes high ownership, experiences of low ownership are more heterogeneous, reflecting different ways ownership may be diminished (e.g., limited control, lack of recognition, or minimal effort).
findings
-
Methodologically, we recommend reporting an ownership profile rather than a single score and explicitly stating construct boundaries. A brief "ownership design card" in Methods—specifying manipulated versus measured dimensions, expected moderators (e.g., medium tangibility, employment context), and anticipated trade-offs—would improve interpretability and comparability.
IMPLICATIONS
-
Across all nine sub-dimensions of the framework—Embodiment, Occupancy, Recognition, Control, Intentionality, Effort, Production, Abstraction, and Interdependence—participants gave consistently higher ratings for projects they associated with high ownership compared to low ownership (Figure 2). This pattern held across the board, suggesting that the framework reliably distinguishes between ownership conditions rather than capturing isolated dimensions.
findings
-
A potential risk is profile drift under sustained high-automation use (e.g., declines in perceived Effort or Control). Because the framework is lightweight, it can function as a periodic check-in to track such changes and recommend countermeasures (e.g., adding decision checkpoints or narrowing automation scope).
IMPLICATIONS
-
The framework yields actionable implications for system design. Treating ownership as a first-class experience goal positions each dimension as a design lever. Control can be protected by making decision rights explicit, keeping suggestions reversible, and attaching rationales to consequential edits. Intentionality can be supported through periodic intent check-ins and visual diffs that surface drift from initial goals. Recognition benefits from attribution by default. Production and Abstraction suggest modality-aware workflows (concept-first versus material-first), and Interdependence calls for role visibility and decision traceability in collaborative tools. The aim is not to prescribe features but to make ownership designable: systems can be tuned to the ownership profile a context demands.
IMPLICATIONS
-
In study of AI-driven scriptwriting by Weber et al. [42], participants associated ownership with ease, expression, collaboration, uniqueness, and enjoyment.
concepts that are adjacent to "creative ownership"
-
Weber et al. [43], for example, use the term "artistic ownership" in studying support for creative goals, yet operationalize it through adjacent concepts such as creative vision, intentions, collaboration, pride, control, and emotional response [43]. Even when researchers begin with a focused definition, as in Wasi et al.'s work [41] on content ownership, related ideas often surface—embodiment, identity, originality, and effort among them.
concepts that are adjacent to "creative ownership"
-
Some studies conflate ownership with adjacent ideas (e.g., control, vision, identity); others elicit participants' views without a common scaffold, making results hard to compare across settings and media.
concepts that are adjacent to "creative ownership"
-
As one participant put simply, "Did I love it?" (P3, dancer).
concepts that are adjacent to "creative ownership"
-
P4 (nonfiction writer) reported a similar sentiment but used the term pride instead — "That sense of proudness doesn't really have anything to do with how much I feel ownership about it, at least not directly."
concepts that are adjacent to "creative ownership"
-
P2 (ukulelist, singer) reported feeling a "creative attachment" to a piece, even though they didn't feel any ownership over it — "A little bit of my heart and the soul is in this thing, even though it doesn't have anything to do with me otherwise."
concepts that are adjacent to "creative ownership"
-
In their 2003 paper, Pierce et al. [32] define psychological ownership as "that state where an individual feels as though the target of ownership or a piece of that target is 'theirs'."
-
In the field of psychology, there have been numerous theoretical propositions and empirical studies attempting to explain the formation of psychological ownership. Several scholars have created frameworks based on decades of psychological research that capture key themes that have emerged time and again such as effectance and control of possessions [10, 25, 44], positive affect [10], and symbolic meaning and personhood [35].
-
Hegel's ideas of ownership stem from the notion that the "will" can be embodied in external entities, and that this embodiment is necessary for one's actualization as a person cannot come to exist without both relation to and differentiation from the external environment [34].
-
One of the most fundamental materialist theories is Locke's labor theory, which posits that "every man has a property in his own person," and thereby goes on to argue that when one mixes their labor with natural resources, the resulting good becomes their property - evoking the embodiment theory of personhood [22, 34].
-
Materialist theories stem from notions of property as control over material entities, going as far as to stipulate that physical, material states are the ultimate determinants of reality, taking precedence over thought, consciousness, and abstract entities [27, 38]. On the contrary, idealism posits that something mental is the ultimate foundation of reality, and idealist theories of property and personhood are concerned with symbolic and mental conceptions of ownership [12].
-
Building upon literature across psychology, philosophy, the humanities and social sciences more broadly, and within human-computer interaction, we introduce a nine-subdimension framework of creative ownership organized across Person, Process, and System. Person captures how the artifact relates to the self; Process characterizes the decisions, intentionality, and effort by which it is created; System situates creation within its material, collaborative, and contextual conditions.
theory
-
Research on the self-creation effect illustrates how creating something oneself can lead to stronger object valuation and a more profound sense of ownership - aspects that are often overlooked by traditional frameworks of ownership. Therefore, we draw upon existing frameworks and approaches to produce a framework that is more streamlined for creative contexts.
theory
-
In their 2003 paper, Pierce et al. define psychological ownership as "that state where an individual feels as though the target of ownership or a piece of that target is 'theirs'." In this paper, we will focus on a narrower definition revolving around creative ownership in which the target of ownership is a creative product or artifact that the individual in question had a role in creating — no matter how small or large.
theory
-
In the field of psychology, there have been numerous theoretical propositions and empirical studies attempting to explain the formation of psychological ownership. Several scholars have created frameworks based on decades of psychological research that capture key themes that have emerged time and again such as effectance and control of possessions, positive affect, and symbolic meaning and personhood. These frameworks span a range of formulations ranging from Targets-Antecedents-Consequences-Interventions to corrective dual-process models, among others. Some of the major themes found across frameworks include responsibility, accountability, identity, self-efficacy, belongingness, control, self-congruity, psychological closeness, object-knowledge, self-investment, and rights over the object.
theory
-
Hegel's ideas of ownership stem from the notion that the "will" can be embodied in external entities, and that this embodiment is necessary for one's actualization as a person cannot come to exist without both relation to and differentiation from the external environment. While the specifics of theories vary, the investment of one's self, values, and identity as a means of developing feelings of ownership is a common theme that arises.
theory
-
One of the most fundamental materialist theories is Locke's labor theory, which posits that "every man has a property in his own person," and thereby goes on to argue that when one mixes their labor with natural resources, the resulting good becomes their property - evoking the embodiment theory of personhood. "Bundle of Rights" views hold ownership as a set of contractual obligations between people in relation to property.
theory
-
While there are many schools of philosophical thought that could be used to frame a discussion of ownership, two juxtaposing ones that encompass the duality of ownership related values are materialism and idealism. Materialist theories stem from notions of property as control over material entities, going as far as to stipulate that physical, material states are the ultimate determinants of reality, taking precedence over thought, consciousness, and abstract entities. On the contrary, idealism posits that something mental is the ultimate foundation of reality, and idealist theories of property and personhood are concerned with symbolic and mental conceptions of ownership. This dualistic framing captures both the tangible and intangible elements of ownership.
theory
-
-
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu
-
Engineering refers to the use of technical principles, such as mathematics, science, and technical know-how, to realize a design that best meets a given set of expectations, which are typically captured in a requirements specification.
-
Designing is the process of arriving at a plan, specification, prototype, system, or service—a design. In HCI, this often means designing a user interface and relevant parts of the underlying interactive system.
-
HCI focuses on people who use an interactive system or are affected by its use. This focus is often called being user-centered or human-centered to contrast it with a focus on the technology itself [423, 604].
-
Finally, interaction often involves co-adaptation between people and computers [646], meaning that both the user and the system learn and adapt to each other during interactions.
-
Interaction is, in other words, not a property of the system design or the user but something that emerges when they influence each other.
-
The development of technology for interactive computing systems has been an important driver behind the widespread adoption of computing we have witnessed in the last 50 years.
-
In HCI, evaluation refers to the application of some systematic methodology to attribute human-related values to an artifact, prototype, system, or process. Examples of such attributes include performance, experience, safety, and ethical aspects, such as the avoidance of bias or harm.
-
Programmability lends computers their power as tools. Computer programs can decompose complex activities into sequences of much simpler operations.
-
A special part of a computing system is the user interface. It is the part that the user can see and utilize to control the computer. Through the user interface, users can provide input and instructions to a computer and receive feedback from it. In short, the user interface enables interaction with a computer.
-
In multitasking, tasks compete for limited sensory, motor, and central (cognitive) capacities
-
Visual objects that are unique in their visual primitives attract user's attention.
-
Interaction is a concept that is fundamental in HCI and specific to this field [357]. Intuitively, it refers to the reciprocal influence between people and an interactive system that takes place through the user interface.
-
Users continuously adapt their social behavior to compensate for the lack of social cues in computer-mediated communication
-
Users' performance in providing input to a computer is limited by a speed–accuracy trade-off
-
A mental model captures how people understand something. For instance, people have vastly different beliefs about how calculators work [598]. These beliefs can explain the errors and the issues they face when using calculators.
-
Interactive systems are tools that help users achieve their goals.
a sentence about human use of tools
-
The remarkable efficiency, flexibility, and scalability of computers as tools boil down to the concept of a programmable machine capable of interpreting computer programs.
a sentence about human use of tools
-
Programmability lends computers their power as tools.
a sentence about human use of tools
-
A key technical construct in HCI is the user interface. It refers to the parts of an interactive system that the user comes into contact with or that in other ways shape the user's perception of the system.
a sentence that describes a concept
-
In HCI, evaluation refers to the application of some systematic methodology to attribute human-related values to an artifact, prototype, system, or process.
a sentence that describes a concept
-
A special part of a computing system is the user interface. It is the part that the user can see and utilize to control the computer.
a sentence that describes a concept
-
Programmability lends computers their power as tools.
a sentence that describes a concept
-
It is an egocentric fallacy to assume that others are like us—to attempt to explain other people by reference to one's own experience.
a sentence that describes a concept
-
-
larsfaye.com larsfaye.com
-
Agentic Coding is a Trap
Summary: Agentic Coding Is a Trap
- The "Orchestrator" Illusion: The industry is pushing "Spec Driven Development" (SDD) where humans act as high-level orchestrators while agents handle implementation. This creates a dangerous distance between the developer and the actual code.
- The Paradox of Supervision: Effective use of AI agents requires expert supervision, yet over-reliance on these agents causes the very skills needed for supervision (critical thinking, debugging, and architectural oversight) to atrophy.
- Atrophy and "Brain Fog": Unlike previous abstractions (e.g., moving from Assembly to C++), AI introduces non-determinism and ambiguity. Experienced engineers report losing their "firm mental model" of applications, making each new feature harder to reason about.
- The Junior Developer Bottleneck: Juniors are being deprived of the "friction" required to learn. Reviewing AI-generated code is only half the learning process; without writing and struggling with code, the next generation of senior engineers may never materialize.
- Inverted Priorities: Traditional coding priorities (Understanding > Standards > Conciseness > Speed) are being flipped by AI, which prioritizes raw speed and volume, often leading to bloated, low-quality codebases.
- Economic and Vendor Risks: Teams are becoming dependent on specific AI vendors (e.g., Anthropic’s Claude). Outages can bring development to a standstill, and unpredictable token costs create "vendor lock-in" for intellectual skills.
- Proposed Solution (Demoted AI Role): Use LLMs as "Ship's Computers" (research and delegation tools) rather than "Data" (autonomous replacements). Developers should remain the primary implementers, manually coding 20-100% of tasks to maintain comprehension.
Hacker News Discussion
- Skill Decay Concerns: Many users echoed the sentiment that "taste" and "discernment" are muscles that require constant exercise. Without the "grunt work," developers lose the ability to judge whether the AI's output is actually good or just "mediocre work that passes the bar."
- The "Liberal Arts" Parallel: One commenter compared the situation to how LLMs affected liberal arts; students can produce passing work without doing the thinking, leading to a collapse in deep understanding and a "pile of software that fails spectacularly."
- The Role of Friction: Discussion touched on how the "friction" of coding—debugging a tricky race condition or refactoring a messy module—is exactly where true expertise is built. Removing that friction creates "hollow" seniors.
- Maintenance Nightmare: There is a fear that agentic coding will lead to a massive "24/7 incremental rollout of pure agentic code," where the complexity grows so fast that no human can actually maintain or monitor the resulting system.
- Counter-Arguments: Some users argued that this is just the "Natural Progression of Abstraction," similar to how we no longer worry about manual memory management in many languages, though others countered that AI is a "probabilistic" layer, not a deterministic one.
Tags
Annotators
URL
-
-
www.citadelsecurities.com www.citadelsecurities.com
-
The 2026 Global Intelligence Crisis
Summary of The 2026 Global Intelligence Crisis
- Current Economic Context (2026): The article describes a 2026 landscape where unemployment is at 4.28%, AI capital expenditure accounts for 2% of GDP ($650bn), and over 2,800 data centers are planned for construction in the U.S.
- The Diffusion Narrative: Contrary to fears of mass displacement, the author argues that the speed of AI adoption is following a traditional S-curve rather than an exponential explosion. Data shows that daily intensive use of AI for work remains stable rather than accelerating non-linearly.
- Economic Constraints on AI: Recursive technology (AI improving itself) does not equate to recursive economic adoption. Deployment is bounded by physical capital, energy costs, and the marginal cost of compute. If compute becomes more expensive than human labor, substitution will stop.
- Productivity as a Supply Shock: AI is framed as a positive supply shock that lowers costs and increases real income. History suggests that productivity surges expand the "consumption frontier" and create new industries rather than collapsing aggregate demand.
- Labor Market Resilience: Software engineering job postings are rising (up 11% YoY in the provided data), and construction hiring is surging due to data center demand.
- The Keynesian Parallel: Just as Keynes wrongly predicted a 15-hour work week in 1930, the author suggests humans will likely use AI gains to consume more and higher-quality services rather than withdrawing from the labor market.
Hacker News Discussion
- Skepticism Toward Statistics: Many commenters criticized the article for "lying with statistics." They pointed out that the 11% YoY rise in job postings uses a depressed scale on the Y-axis and a cherry-picked timeframe (late 2025 to early 2026) that ignores the massive crash from the 2022 hiring peak.
- The "Vibe" of the Writing: Users debated the authorship of the post, with some calling it "AI slop" or an exaggerated version of McKinsey-style consulting prose, though others noted typos that suggested human authorship.
- Impact of Tax Laws: Several participants attributed the 2022–2023 software job slump to Section 174 tax changes (requiring R&D amortization) rather than AI displacement, arguing that the recent "recovery" is just a normalization of those tax shocks.
- Complement vs. Substitute: A central theme in the comments was whether AI enables "vibe coding"—allowing fewer engineers to do more, or allowing non-technical staff to build tools—and whether this ultimately increases the total volume of software projects or reduces the headcount of professional engineers.
- Critique of Data Sources: There was a debate regarding the reliance on Indeed data, with some noting that while Indeed scrapes many sites, it may not accurately capture the hiring trends of elite tech startups that use specialized platforms like Greenhouse or Ashby.
-
-
shiftmag.dev shiftmag.dev
-
What 4 engineers with 10+ years of experience say about staying relevant in the AI era
- Human-Centric Engineering: Senior engineers emphasize that while AI excels at writing syntax, it cannot replicate the human ability to understand customer problems, business context, and the "why" behind a project.
- Mastery of Fundamentals: Staying relevant requires a deep understanding of core computer science principles (data structures, algorithms, system design), as these allow engineers to vet and debug the often-flawed code generated by LLMs.
- Strategic Tool Adoption: Rather than fearing AI, experienced developers view it as a sophisticated "power tool" or "junior pair programmer" that accelerates boilerplate tasks, allowing them to focus on high-level architecture.
- Emphasis on Soft Skills: Communication, empathy, and leadership are highlighted as "durable skills" that AI cannot automate; being able to bridge the gap between technical constraints and business goals is more valuable than ever.
- The "Judgment" Gap: AI models lack the foresight to predict long-term maintenance costs or technical debt; senior engineers are now increasingly acting as "editors" or "judges" of AI-generated solutions.
- Continuous Adaptability: The consensus is that the role of an engineer is shifting from "writing code" to "solving problems," requiring a mindset that is willing to pivot and learn new paradigms as the tech stack evolves.
-
-
news.ycombinator.com news.ycombinator.com
-
Ask HN: What skills are future proof in an AI driven job market?
- Soft Skills and Judgment: Commenters emphasize that empathy, social skills, and the ability to build relationships remain highly valuable, as AI cannot truly navigate corporate politics or seek mutual human benefit.
- Domain Expertise: While AI can generate code or content, humans are still required to provide the "judgment" to determine what is worth building and to foresee how architectural decisions will impact a project years down the line.
- Physical Trades: Many users suggest that "blue-collar" trades—such as plumbing, electrical work, and construction—are the most future-proof because the physical dexterity and adaptability required for these tasks are far beyond current robotic capabilities.
- Communication: Superior written and verbal communication is cited as a vital skill, both for leadership and for effectively "prompting" or directing AI tools to achieve specific professional goals.
- Critical Thinking: The ability to identify when a task definition is wrong or when a product doesn't "make sense" for a human user is seen as a distinct human advantage over models that follow instructions literally.
- Legal and Accountability Roles: Jobs that require a "human in the loop" for legal liability or ethical reasons—such as doctors, lawyers, and military personnel—are considered safer from complete automation.
- Metalearning: The most important skill may be the ability to learn new tools quickly and discard old ones without emotional attachment, adapting as the technology evolves.
-
-
-
If most efficiency improvements came from a small handful of scale-dependent innovations, then existing models of the software intelligence explosion may be flawed.
Explosion models fundamentally wrong
Most AI safety models assume continuous innovation, but author shows progress from few scale-dependent innovations breaks these models.
-
-
huggingface.co huggingface.co
-
On RE-Bench's five open-ended extension tasks, preserved failure traces in ARA accelerate progress, but can also constrain a capable agent from stepping outside the prior-run box depending on the agent's capabilities.
大多数人认为保留失败记录总是有益的,但作者发现这些记录可能会限制AI代理的创新能力,阻止它们跳出'先前运行的盒子'。这一反直觉观点表明,即使是改进的研究方法也可能存在意想不到的限制。
-
Tolerable for human readers, these costs become critical when AI agents must understand, reproduce, and extend published work.
大多数人认为人类可读的论文同样适合AI理解,但作者认为传统论文对人类读者是可容忍的,但对AI理解研究过程却造成了'工程税',这反映了当前学术出版系统在AI时代的不适应性。
-
-
-
We also learned that treating agents as rigid nodes in a state machine doesn't work well. Models get smarter and can solve bigger problems than the box we try to fit them in.
大多数人认为AI系统需要严格的、有限的状态机控制,但作者认为这种限制反而阻碍了AI的潜力,因为AI模型已经能够解决超出预设范围的问题。这个观点挑战了人们对AI系统设计的传统认知,暗示我们应该给予AI更大的自主权而不是限制它。
-
Our early versions of agentic work was only asking Codex to implement the task. That approach proved too limiting. Codex is perfectly capable of creating multiple PRs as well as reading review feedback and addressing it.
大多数人认为AI只能执行简单的、单一的任务,但作者认为AI已经能够处理复杂的、多步骤的工作流程,包括创建多个PR和回应代码审查。这个观点挑战了人们对AI能力的传统认知,表明AI已经进化到能够理解并执行复杂的软件工程任务。
-
When our engineers no longer spend time supervising Codex sessions, the economics of code changes completely. The perceived cost of each change drops because we're no longer investing human effort in driving the implementation itself.
大多数人认为AI编程会增加监督成本,但作者认为通过Symphony系统,人类监督成本实际上大幅下降,因为AI能够自主完成大部分实现工作。这个观点挑战了人们对AI编程成本结构的普遍认知,暗示正确的AI编排可能根本性地改变软件开发的经济模型。
-
Among some teams at OpenAI, we saw the number of landed PRs increase by 500% in the first three weeks.
大多数人认为AI辅助编程只能带来适度的生产力提升,但作者认为Symphony系统实现了500%的代码合并增长率,这是一个惊人的数字。这个数据点挑战了人们对AI辅助编程效果的传统预期,表明正确的AI编排可能带来指数级的生产力提升。
-
Six months ago, while working on an internal productivity tool, our team made a controversial (at the time) decision: we'd build our repo with no human-written code. Every line in our project repository had to be generated by Codex.
大多数人认为软件开发必须由人类编写核心代码,但作者认为完全由AI生成代码是可行的,因为他们成功地构建了一个没有任何人工代码的仓库。这个观点挑战了软件开发的传统认知,暗示AI可能已经发展到能够独立完成整个项目的程度。
-
-
sakana.ai sakana.ai
-
Instead of using domain knowledge to prescribe team organization, roles, or workflows, Fugu learns to dynamically assemble agents from a pool and coordinate them through non-obvious but highly efficient collaboration patterns.
大多数人认为多模型系统需要人工设计明确的分工和角色分配,但作者认为Fugu能够自主发现最优的协作模式。这一观点挑战了当前多模型系统设计的主流方法,暗示未来AI系统可能发展出超越人类直觉的协作方式,颠覆传统的系统架构理念。
Tags
Annotators
URL
-
-
medium.com medium.com
-
He argues that specific algorithmic “cleverness” matters far less than the massive scaling of a few fundamental inputs
这是一个反直觉的观点,指出算法的“聪明才智”远不如对几个基本输入的巨大扩展重要,这为我们理解AI的发展提供了新的视角。
-
we are nearing the “end of the exponential” for AI development
这是一个非共识观点,认为AI发展的指数增长阶段即将结束,这为AI的未来发展提出了新的思考方向。
-
-
geohot.github.io geohot.github.io
-
The good world is where everyone has AI, and not as a revokable privilege through an API, but through hard possession.
作者提出了一个关于AI普及的愿景,即每个人都应该拥有AI,而不是将其作为一种可以撤销的API特权。
-
-
breakingdefense.com breakingdefense.com
-
In one case [first reported by the Financial Times](https://www.ft.com/content/00c282de-ed14-4acd-a948-bc8d6bdb339d?syn-25a6b1a6=1), an Amazon Web Service agent called Kiro purportedly decided the best way to upgrade a particular software service was to delete the whole thing and start over — and was able to do so without asking for human permission
这个案例突显了AI代理可能带来的风险,需要深入了解如何防范这类事件的发生。
-
Instead of just answering a user’s questions, the way a chatbot does, agents can take a human user’s instructions and act on them
AI代理的能力描述可能存在偏见,因为它暗示AI能够像人类一样行动,而实际上可能缺乏人类的判断力和道德考量。
-
We’ve seen remarkable adoption since its launch, with over 103,000 agents built and a total of more than 1.1 million agent sessions recorded
令人震惊的AI代理和会话数量可能反映了AI工具在军事领域的巨大潜力和影响,需要深入分析这些工具的实际应用和效果。
-
Military personnel and Defense Department civilians have used a version of Google Gemini’s [Agent Designer](https://docs.cloud.google.com/gemini/enterprise/docs/agent-designer) to create over 100,000 semi-autonomous AI agents in less than five weeks since the tool became available
这个数据表明了在短时间内AI工具的广泛使用和接受程度,值得进一步调查其背后的具体应用场景和效果。
-
-
zed.dev zed.dev
-
We built AI into our editor's foundation instead of bolting it on top.
关键概念是,将AI集成到编辑器的基础架构中,而不是作为附加功能,可以提供更流畅的用户体验。
Tags
Annotators
URL
-
-
handyai.substack.com handyai.substack.com
-
But there’s a critical difference between using agents to accomplish defined objectives and spinning up 20 agents because the dashboard makes you feel like a general commanding an army.
作者指出,使用AI代理实现特定目标和仅仅因为仪表板让人感觉像指挥军队一样使用大量代理之间存在关键区别,这引发了关于AI工具使用目的的思考。
-
The average employee AI usage was 1.5 hours per week. The average CEO AI usage was less than one hour per week.
数据显示,员工和CEO每周使用AI工具的时间非常有限,但他们对AI的依赖和热情却很高,这可能是AI心理疾病的表现。
-
The enthusiasm has spawned an entire ecosystem of tools designed to make you feel like you’re running a company with AI agents.
文章指出,对AI代理的狂热催生了一个完整的工具生态系统,这些工具可能加剧了AI心理疾病。
-
37,000 lines per day. And this was the output.
作者以Garry Tan的例子说明,尽管声称每天产生大量代码,但实际产出却微乎其微,揭示了AI工具可能导致的低效。
-
Two prominent tech leaders, both publicly using the word psychosis. Both framing sleeplessness and obsessive agent usage as a feature of the moment rather than a bug.
文章指出两位知名科技领袖公开将AI心理疾病视为一种特征而非缺陷,这表明了AI心理疾病可能被误解或忽视。
-
It’s feeling like a new form of [AI psychosis](https://en.wikipedia.org/wiki/Chatbot_psychosis).
文章提出AI心理疾病这一新概念,暗示过度依赖AI工具可能导致类似心理问题。
-
-
www.axios.com www.axios.com
-
Anthropic says it has no way to control or shut down its AI models once they're deployed by the Pentagon
需要核查的事实声明:Anthropic 声称其无法控制或关闭由五角大楼部署的 AI 模型,这一声明需要进一步核实。
-
-
simonwillison.net simonwillison.net
-
Bun operates its own fork of Zig, and recently achieved a 4x performance improvement on Bun compile after adding 'parallel semantic analysis and multiple codegen units to the llvm backend'.
尽管Bun项目从AI辅助中受益,但Zig项目坚持其反AI政策,突显了项目间价值观的差异。
-
-
blog.pragmaticengineer.com blog.pragmaticengineer.com
-
Putting a leaderboard in place was always going to incentivize much more AI usage.
此观点暗示了排行榜可能无意中刺激了过度使用AI,引发了关于管理工具潜在负面影响的讨论。
-
The rankings, set up by a Meta employee on its intranet using company data, measure how many tokens — the units of data processed by AI models — employees are burning through.
这一观点揭示了‘tokenmaxxing’作为衡量员工AI使用能力的新趋势,暗示了数据消耗成为衡量生产力的一种方式。
-
The rankings, set up by a Meta employee on its intranet using company data, measure how many tokens — the units of data processed by AI models — employees are burning through.
这个引用说明了这种内部排名是通过员工消耗的AI令牌数量来衡量的,这些令牌是AI模型处理数据的单位。
-
Employees at Meta Platforms who want to show off their AI superuser chops are competing on an internal leaderboard for status as a “Session Immortal”— or, even better, “Token Legend.”
这个引用揭示了“Tokenmaxxing”作为一种新的竞争和显摆形式在Meta内部的兴起,员工通过使用AI令牌的数量来竞争地位。
-
-
www.latent.space www.latent.space
-
Today’s LS guest, Mikhail Parakhin, CTO of Shopify, had another take on the “tasteful tokenmaxxing” - you want to go for depth (e.g. do more serial autoresearch loops) than go for breadth (e.g. solve a problem by kicking off 5, 10, 50, 500 parallel runs of the LLM slot machine). Worth thinking through.
Shopify的CTO Mikhail Parakhin对“优雅的Tokenmaxxing”提出了不同的看法,强调深度而非广度的重要性。
-
the top conversations we have been hearing from AI leadership (CTOs, VPs, Founders) have all centered around the concept of “Tokenmaxxing” and how leaders want to get their teams using more AI, WITHOUT the downside of incentivizing the kinds of horrendous waste
AI领导者们普遍关注“Tokenmaxxing”的概念,即如何在增加AI使用的同时避免激励产生巨大的浪费。
-
AI News for 4/21/2026-4/22/2026. We checked 12 subreddits, [544 Twitters](https://twitter.com/i/lists/1585430245762441216) and no further Discords.
The mention of checking 12 subreddits and 544 Twitters indicates the diverse platforms where AI news and discussions are prevalent.
-
-
arxiv.org arxiv.org
-
Endorsement reversal occurred in fewer than 3 in 1,000 observations.
在1000次观察中,不到3次出现了背书逆转,这表明AI系统在保持立场的一致性方面表现出色。
-
AI systems currently provide more consistent fraud warnings than lay humans in an identical advisory role.
这一结果强调了AI系统在提供一致欺诈警告方面的优势,这对于提高金融顾问服务的可靠性和有效性具有重要意义。
-
-
www.bbc.com www.bbc.com
-
The AI-generated image of Neukgu had prompted Daejeon city government to issue an emergency text to residents, warning them of a wolf near the intersection.
这一描述表明AI图像在误导当局方面起到了直接作用,引发了对AI技术潜在滥用问题的关注。
-
-
www.llmwatch.com www.llmwatch.com
-
The most urgent finding this week comes from researchers who demonstrated that the very mechanism enabling agents to use tools - function calling - can be hijacked with alarming reliability.
这一发现揭示了AI代理工具调用接口的安全漏洞,为构建安全的AI代理系统提出了新的挑战。
-
-
-
Nothing in between. A model that arrives at the correct answer through careful reasoning receives the same reward as one that guesses correctly by chance.
这一段落揭示了当前训练方法的问题:没有区分模型是通过深思熟虑还是偶然猜对答案,导致模型过度自信。
-
They deliver every answer with the same unshakable certainty, whether they're right or guessing.
这一描述揭示了当前AI模型普遍存在的过度自信问题,即无论正确与否,都给出同样坚定的答案。
-
-
www.technologyreview.com www.technologyreview.com
-
And it’s not just the US putting chatbots at commanders’ fingertips; China is commissioning similar tools, according to recent [analysis] by Georgetown University’s Center for Security and Emerging Technology.
需要核查的是,中国是否真的在开发类似的聊天机器人工具,以及这些工具的具体应用情况。
-
Today’s military personnel might give chatbots a list of potential targets to help decide which to strike first.
这个陈述需要核查的是,目前军事人员是否真的在实战中使用聊天机器人来决定攻击目标。
-
-
remunerationlabs.substack.com remunerationlabs.substack.com
-
The transition from isolated AI models to the aggregated, metered token economy will unlock the twenty-first.
作者预测,从孤立的AI模型到聚合的、计量的token经济的转变将开启21世纪的新篇章。
-
Consider the deep anatomy of an individual AI session to understand how this telemetry actually works in practice.
作者呼吁深入理解单个AI会话的内部结构,以便更好地理解人工智能的使用和度量。
-
The smartest companies are no longer just hiring talent; they are purchasing synthetic intelligence by the gigawatt.
这一观点揭示了智能公司正在从传统的人力资源管理转向购买合成智能,这表明了人工智能作为一种新型资源的崛起。
-
The smartest companies are no longer just hiring talent; they are purchasing synthetic intelligence by the gigawatt.
这一观点指出,未来企业竞争的关键不再是仅仅招聘人才,而是购买强大的合成智能,这预示着人工智能在企业发展中的核心地位。
-
-
arstechnica.com arstechnica.com
-
We do not publish AI-generated images, audio, or video as authentic documentation of real events.
这条规定指出Ars Technica不会将人工智能生成的图像、音频或视频作为真实事件的证明,体现了对真实性的坚持。
-
Anyone who uses AI tools in our editorial workflow is responsible for the accuracy and integrity of the resulting work.
这一规定表明Ars Technica对使用人工智能工具的人员有明确的责任要求,强调了准确性和完整性。
-
These standards have governed our editorial work since AI tooling became available.
这一声明强调了Ars Technica在人工智能工具可用之前就制定了这些标准,表明其对新闻编辑的重视。
-
We don’t publish claims based solely on AI-generated summaries, and reporters may not represent any material as “reviewed” unless they have examined it directly.
这条规定表明Ars Technica对基于人工智能生成的总结持怀疑态度,强调了记者直接审查信息的重要性。
-
Ars Technica is written by humans. Our reporting, analysis, and commentary are human-authored.
这篇政策声明强调了Ars Technica坚持人工写作的原则,质疑了人工智能在新闻报道和分析中的潜在作用。
-
We do not publish AI-generated images, audio, or video as authentic documentation of real events
需要探讨AI生成内容在新闻报道中的伦理和法律问题。
-