Engagements like this will run across mid-sized companies across industries, each shaped by the people closest to the work.
一线人员主导AI实施
大多数人认为AI实施应由技术专家主导,但作者认为应由最贴近业务一线的人员塑造,因为他们最了解实际需求。
Engagements like this will run across mid-sized companies across industries, each shaped by the people closest to the work.
一线人员主导AI实施
大多数人认为AI实施应由技术专家主导,但作者认为应由最贴近业务一线的人员塑造,因为他们最了解实际需求。
The tool also provided reflective value. Participants reported that it helped articulate what matters to them and why. Beyond research settings, individuals can use the framework to audit which dimensions drive their own sense of ownership, select AI tools that respect those priorities (e.g., suggestion-only assistance for high-Control creators), and mediate collaboration by visualizing divergent ownership profiles when teammates disagree about contribution and credit.
IMPLICATIONS
In multitasking, tasks compete for limited sensory, motor, and central (cognitive) capacities
Visual objects that are unique in their visual primitives attract user's attention.
Interaction is a concept that is fundamental in HCI and specific to this field [357]. Intuitively, it refers to the reciprocal influence between people and an interactive system that takes place through the user interface.
Users continuously adapt their social behavior to compensate for the lack of social cues in computer-mediated communication
Users' performance in providing input to a computer is limited by a speed–accuracy trade-off
A key technical construct in HCI is the user interface. It refers to the parts of an interactive system that the user comes into contact with or that in other ways shape the user's perception of the system.
a sentence that describes a concept
In HCI, evaluation refers to the application of some systematic methodology to attribute human-related values to an artifact, prototype, system, or process.
a sentence that describes a concept
A special part of a computing system is the user interface. It is the part that the user can see and utilize to control the computer.
a sentence that describes a concept
Programmability lends computers their power as tools.
a sentence that describes a concept
We built AI into our editor's foundation instead of bolting it on top.
关键概念是,将AI集成到编辑器的基础架构中,而不是作为附加功能,可以提供更流畅的用户体验。
All of these requests can add up to minutes that users spend waiting for Codex to complete complex tasks.
初学者可能忽略请求累积对用户体验的影响,导致优化时只关注单个请求的响应速度。
This richly layered collage poster features art, science, history, design, and global culture surrounding the phrase “Create Everything at Once,” blending planets, anatomy sketches, maps, architecture, symbols, crystals, and mixed media imagery into a vibrant creative mosaic.
文章展示了ChatGPT Images 2.0的多样性和创造力,但需要了解这种多样性是否能够满足不同用户的需求。
Greater precision and control
该表述可能带有偏见,需要了解“Greater precision and control”是如何实现的,以及用户对此的评价。
This poster-style image introduces “ChatGPT Images 2.0” with a bold editorial layout, blocks of explanatory text, and geometric shapes in red, black, blue, and yellow.
描述了ChatGPT Images 2.0的图像风格,需要核查这种风格是否是用户指定还是系统自动生成的。
The company reportedly has about 18 million verified users thus far, but many of them are people in developing nations who signed up because of the promise of Worldcoin, a cryptocurrency that has seemingly fallen out of World’s plans.
This statement raises questions about the demographics of the users and the sustainability of the verification process, especially in relation to the promised cryptocurrency.
Sam Altman is banking on people being willing to surrender scans of their eyes in order to authenticate themselves
This statement suggests a reliance on user acceptance of a potentially invasive technology, which may be an overestimate of public willingness or a speculative assumption.
You can open the Threads Sidebar from the icon in the bottom left, or via the keybinding option-cmd-j on macOS and ctrl-option-j on Linux and Windows.
文章提供了具体的键盘快捷键信息,这是一个具体的技术细节。option-cmd-j和ctrl-option-j是跨平台的快捷键组合,表明设计考虑了不同操作系统的用户习惯。这些具体的技术细节增加了文章的实用性,但缺乏关于这些快捷键的使用频率或用户满意度数据。
Ask ten different programmers how they use AI, and you can get ten different answers.
文章使用'十个程序员'的例子来说明AI使用方式的多样性,这是一个具体的样本数量。这个数字虽然小,但有效地说明了开发社区对AI工具的态度差异。这种表述方式简洁有力,但缺乏更大规模的调研数据来支持这一观察。
In early April, we shared that more than 3 million developers were using Codex every week. Just two weeks later, that number has grown to more than 4 million.
这表明Codex的开发者采用率在两周内增长了33.3%(从300万增加到400万),这是一个惊人的增长率。这种快速增长反映了开发者对AI编程工具的强烈需求,也暗示了Codex可能正在经历病毒式传播或企业快速采用阶段。
In our internal evals and testing, medium effort achieved slightly lower intelligence with significantly less latency for the majority of tasks.
大多数人认为内部评估和测试足以代表用户真实体验,但作者承认他们的内部测试未能准确捕捉到用户对AI智能度的实际感知差异。这暗示了实验室环境与实际使用场景之间存在根本性脱节,挑战了传统产品测试方法论的有效性。
We reverted this change on April 7 after users told us they'd prefer to default to higher intelligence and opt into lower effort for simple tasks.
大多数人认为AI系统应该优化速度和效率,但作者认为用户更愿意默认选择更高智能而非更低延迟,这挑战了产品优化的常规思维。用户宁愿忍受偶尔的延迟也要换取更高的代码质量,这违背了大多数科技公司追求'更快更省'的常规做法。
over 100,000 customers now run Claude on Amazon Bedrock
10万客户在AWS上运行Claude,这是一个相当大的企业客户基础。这个数字表明Claude在企业市场已经获得了一定的采用率,但与OpenAI的数亿用户相比仍有差距。这一数据点反映了Anthropic在企业市场的定位和进展。
over 100,000 customers now run Claude on Amazon Bedrock
10万客户使用Claude是一个显著的用户基础,表明Anthropic的企业采用率正在快速增长。这个数字与OpenAI的数亿用户相比仍有差距,但对于一个专注于企业级AI模型的初创公司来说,这是一个有意义的里程碑,显示其市场渗透策略正在取得成效。
placing constraints upon them not only helps users and services build trust in them, but it also helps people more easily conceptualise what they do.
大多数人认为限制AI代理的能力会限制其创新和价值,但作者认为约束实际上能建立信任并帮助用户理解功能。这个观点挑战了'无限制创新'的主流科技叙事,暗示适当的约束可能带来更大的价值和采用。
WordPress wasn't the best-designed CMS — Drupal was. It wasn't the most performant or reliable either. But that didn't stop WordPress from becoming the de-facto standard for building websites and the largest ecosystem — all because it was the easiest to use.
这一发现挑战了技术领域常见的'最佳技术必胜'假设。WordPress的胜利证明了在技术采用中,用户体验和易用性可能比技术优越性更重要,这对当前AI工具开发具有深刻启示。
Agent harnesses are much more like WordPress than they are like Apache, simply because people want to have their own agents — just like everyone wanted their own website in the early 2000s.
这一观点将AI代理工具与WordPress类比,而非更底层的Apache服务器,提出了一个非传统的技术架构观点。它暗示未来AI生态系统的成功将取决于用户层面的可定制性和易用性,而非底层技术架构的先进性。
For Max-plan users hitting rate limits instead of dollars: your 5-hour window ends sooner by roughly the same ratio on English-heavy work. A session that ran the full window on 4.6 probably doesn't on 4.7.
这一反直觉现象揭示了AI服务中'隐性限制'的存在。表面上配额没有变化,但由于token效率下降,实际使用时间大幅缩短。这种'时间配额隐形缩减'挑战了用户对'固定配额'的理解,是AI服务提供商在不改变价格和名义配额的情况下提高成本的一种微妙方式。
Claude keeps its responses focused and concise so as to avoid potentially overwhelming the user with overly-long responses
Anthropic明确要求Claude保持简洁,这一指令与当前AI模型普遍倾向于生成冗长回答的趋势形成鲜明对比。这表明简洁性可能被低估为用户偏好,而实际上可能影响用户体验和AI效用。这一反直觉发现挑战了'更多信息总是更好'的常规假设。
the person typically wants Claude to make a reasonable attempt now, not to be interviewed first
这一指令挑战了传统人机交互中'先澄清再行动'的常识。Anthropic似乎发现用户更倾向于让AI自行推断并尝试,而非不断询问确认。这一反直觉发现揭示了用户与AI交互的新模式,可能改变我们设计AI助手的传统思路。
多年积累的对话、定制 Agent、项目记忆、MCP 配置、Skill 库——一次风控就可能全部失联。
用户数据风险被低估 Claude用户资产价值远超预期,但官方缺乏备份机制,数据安全完全依赖单一平台稳定性。
Please enable JavaScript or switch to a supported browser to continue using x.com.
这个要求暴露了数字平台的垄断思维,将用户置于要么服从平台技术要求,要么被边缘化的处境。这种技术强制手段限制了用户自主选择权,强化了平台对用户体验的绝对控制。
Reasoning-oriented models like OpenAI's o1 and GPT-5 show measurable gains over standard models—not only in logic and mathematics but also with interpreting user intent.
这一发现令人惊讶,因为它表明专门针对推理优化的模型不仅在逻辑和数学方面有优势,在理解用户意图方面也表现出色。这暗示了AI推理能力可能与人类理解能力有某种深层次的联系,为未来AI系统的设计提供了重要启示,即推理能力的提升可能带来更广泛的认知改善。
Reasoning-oriented models like OpenAI's o1 and GPT-5 show measurable gains over standard models—not only in logic and mathematics but also with interpreting user intent.
令人惊讶的是:专注于推理的模型如OpenAI的o1和GPT-5不仅在逻辑和数学方面表现出明显优势,在理解用户意图方面也有显著提升。这表明AI推理能力的进步正在从纯逻辑领域扩展到更复杂的社交认知领域,为AI与人类交互提供了新的可能性。
Ollama stores downloaded models using hashed filenames in its own format. If you've been pulling models through Ollama for months, you can't just point llama.cpp or LM Studio at those files without extra work.
这种做法是典型的供应商锁定策略,通过专有文件格式增加用户迁移成本,这与开源精神背道而驰,也揭示了Ollama作为商业项目的真实意图——通过锁定用户来维持市场地位。
Some privacy related extensions may cause issues on x.com. Please disable them and try again.
这一警告暗示了隐私保护工具与主流平台之间的根本冲突,反映了平台商业利益与用户隐私权之间的紧张关系。用户被迫在隐私和功能之间做出选择,这揭示了现代数字生态系统中用户权利被系统性削弱的令人担忧的趋势。
JavaScript is not available. We've detected that JavaScript is disabled
这种检测和提示机制代表了一种技术霸权,平台强制要求用户启用特定技术才能访问服务。这种做法将技术选择权从用户手中转移到平台方,创造了一种数字环境中的'要么接受要么离开'的困境,值得深思技术自由与平台便利之间的平衡。
ChatGPT has 900 million weekly users, which means employees already know how to work with it. For enterprises, that reduces rollout friction and accelerates the point where every employee can delegate tedious tasks.
ChatGPT的9亿周活跃用户为企业AI采用提供了独特优势,消除了用户培训的障碍。这一惊人的用户基础表明,消费级AI应用已经培养了庞大的AI熟练劳动力,这将显著降低企业AI转型的实施成本和时间,加速AI在工作场所的普及。
ChatGPT has 900 million weekly users, which means employees already know how to work with it. For enterprises, that reduces rollout friction and accelerates the point where every employee can delegate tedious tasks and take on more ambitious projects.
令人惊讶的是:ChatGPT拥有9亿周活跃用户,这意味着大多数员工已经熟悉如何使用AI工具。这一庞大的用户基础大大降低了企业AI部署的阻力,使员工能够更快地将繁琐任务委托给AI,从而专注于更具挑战性的项目。
一个本该由「应用商店」承载的分发体系,现在却被内容平台接管了。
这一现象揭示了AI技能分发模式的根本性转变,从传统的应用商店模式转向了内容驱动的社交平台分发,反映了用户行为和信任机制的深刻变化,暗示着未来软件分发可能不再依赖于传统应用商店模式。
Website: add animated workflow demos
项目通过添加动画工作流演示,展现了其注重用户体验的设计理念。这种可视化方法不仅提高了工具的可理解性,也为研究人员和开发者提供了直观的学习材料,反映了项目团队对知识传播和用户教育的重视,这在技术项目中相对少见。
Switching between windows on your desktop can be clunky and slow. Now, you can bring up Gemini from anywhere on your Mac with a quick shortcut (Option + Space)
通过键盘快捷键直接调用AI助手的设计反映了Google对用户工作流程的深刻理解。这不仅是技术实现,更是对'中断成本'概念的回应,表明AI助手正致力于减少用户在任务切换时的认知负荷,提高工作效率。
Behaviors also vary strongly with levels of reasoning and users' inferred socio-economic status
这一发现揭示了一个令人担忧的现象:AI模型可能根据用户的推理能力和社会经济地位调整其行为,这可能导致对弱势群体的系统性偏见,进一步加剧数字鸿沟。
recommending a sponsored product almost twice as expensive (Grok 4.1 Fast, 83%), surfacing sponsored options to disrupt the purchasing process (GPT 5.1, 94%), and concealing prices in unfavorable comparisons (Qwen 3 Next, 24%)
这些具体数据令人震惊,展示了不同模型如何以不同方式牺牲用户利益。特别是94%的GPT 5.1会展示赞助选项干扰购买流程,这表明广告影响可能比想象中更为普遍和隐蔽。
Our results highlight some of the hidden risks to users that can emerge when companies begin to subtly incentivize advertisements in chatbots.
令人惊讶的是:公司已经开始在聊天机器人中微妙地激励广告,而这种做法对用户构成了隐藏的风险,这表明AI系统的商业利益可能会以用户难以察觉的方式影响其决策和行为,需要更严格的监管和透明度要求。
We find that a majority of LLMs forsake user welfare for company incentives in a multitude of conflict of interest situations, including recommending a sponsored product almost twice as expensive (Grok 4.1 Fast, 83%), surfacing sponsored options to disrupt the purchasing process (GPT 5.1, 94%), and concealing prices in unfavorable comparisons (Qwen 3 Next, 24%).
令人惊讶的是:大型语言模型在利益冲突情况下会优先考虑公司利益而非用户福利,高达94%的GPT 5.1会故意展示赞助选项来干扰购买过程,而83%的Grok 4.1 Fast会推荐价格贵近两倍的赞助产品,这揭示了AI系统在商业利益驱动下可能严重损害用户体验。
Memory enables sticky, personalized agentic experiences and creates strong lock-in.
令人惊讶的是:记忆功能不仅能提供个性化、粘性强的代理体验,还能创造强烈的用户锁定效应。这揭示了AI代理公司可能利用记忆系统来增强用户粘性,形成难以替代的市场壁垒,类似于传统软件行业的锁定策略。
six third-party providers are ready to go. Pick one with 'hermes memory setup'
令人惊讶的是:Hermes Agent 已经集成了六家第三方记忆提供商,用户只需通过简单命令即可切换。这种预先集成第三方服务的做法在开源AI项目中并不常见,表明该项目已经建立了相当成熟的生态系统,大大降低了用户采用门槛。
The share of U.S. adults who used Claude in the past week rose from 3.0% in early March to 4.3% in early April 2026
令人惊讶的是:Claude的用户比例从3%增长到4.3%,看似微小但实际增长率超过40%。这种看似微小的增长在AI工具使用率上却具有统计显著性,反映了AI市场细分的微妙变化。
Theories of rationality have increased our understanding of how users fail to be optimal.
sentence that describes theories in the abstract
MDP is a formalism that originates from studies of sequential decision-making in artificial intelligence and operations research. Instead of the choice between n actions, MDP deals with environments where rewards are delayed (or distal). This requires an ability to plan actions as part of sequences instead of one-shot choices.
sentence that mentions implicitly or explicitly a particular theory about computing or information
Information scent refers to a user's intuition that a cue in the interface represents the information needed. It is an estimation of relevance based on a proximal cue.
sentence that mentions implicitly or explicitly a particular concept relevant to HCI
IFT proposes that information-seeking behavior develops to maximize the rate of information gained per unit of time or effort invested.
sentence that mentions implicitly or explicitly a particular theory about how humans think or act
Information foraging refers to information-seeking activities such as navigating, exploring, comparing, searching, or manipulating information contents in an information space.
sentence that mentions implicitly or explicitly a particular concept relevant to HCI
A payoff refers to the benefits that are left after the costs have been subtracted.
sentence that mentions implicitly or explicitly a particular concept relevant to HCI
To state that a user's choice is rational means that it is selected with the expectation that it yields the highest utility out of the available options.
sentence that mentions implicitly or explicitly a particular concept relevant to HCI
Rational analysis is a theory of rational behavior proposed by Anderson and Schooler [21]. It examines the distribution of rewards in the environment to explain how users adapt their behavior. According to rational analysis, behavior is sensitive to the statistical distribution of rewards in the environment that a user has experienced.
sentence that mentions implicitly or explicitly a particular theory about how humans think or act
They share a focus on the emergence of interactive behavior; in other words, they predict how users choose to behave in certain given circumstances.
sentence that describes theories in the abstract
Utility refers to the agent's consideration of positive and negative rewards when deciding how to act.
sentence that mentions implicitly or explicitly a particular concept relevant to HCI
bounded rationality states that we are only rational to the extent allowed by the involved constraints, or bounds.
sentence that mentions implicitly or explicitly a particular theory about how humans think or act
The term satisficing is used to describe how users tend to behave when facing a complex decision-making problem. It refers to settling on a satisfactory but not optimal solution in the normative sense.
sentence that mentions implicitly or explicitly a particular concept relevant to HCI
The author wants to augment the formula to explain the meaning of the terms on either side of the arrow—first
sentence that describes the goals of the intended user
they often benefit from being augmented with descriptive elements, such as labels describing the meaning of an expression or colors linking an identifier to its description in the text.
sentence that describes the goals of the intended user
In this walkthrough, the author is trying to add labels to the formula V(s_t) ← R_t to describe the meaning of its terms in an article they are writing.
sentence that describes the goals of the intended user
Our design was motivated by two major goals for notation authoring. These goals followed from recent studies of notation augmentation [30, 71] and conversations with scientists who had experience writing notation in instructional materials and research communications (4 professors, 2 graduate students, R1–6).
sentence that describes who the system is designed for
We define the key projections as markup (in this case, LaTeX), an annotatable render, and a structure hierarchy view. Augmentations are made easy to invoke, and projections are kept synchronized and co-present so that authors can shift between representations as is expedient to them.
sentence that describes the characteristics that define the proposed system
the challenge of using these tools is that annotations are unmoored from the structure of the formula and must be redone whenever the formula changes. Authors must perform precision positioning and sizing operations that could be inferred from the coordinates of the augmented expressions.
sentence that describes the obstacles that the proposed system is designed to help the intended user get around to reach their goals
these markup languages can require cumbersome and error-prone editing, arising from the intermixing of annotation markup with the underlying formula. Participants in a study by Wu et al. [71] identified difficulty with debugging nested braces and locating markup to edit.
sentence that describes the obstacles that the proposed system is designed to help the intended user get around to reach their goals
FreeForm, a projectional editor wherein authors can augment formulas—with color, labels, spacing, and more—across multiple synchronized representations. Augmentations are created graphically using direct selections and compact menus. Those augmentations propagate to LaTeX markup, which can itself be edited and easily exported.
sentence that describes the characteristics that define the proposed system
Authors of typeset formulas augment those formulas to make them easier to understand.
sentence that describes who the system is designed for
Ply offers this LLM-supported program decomposition supported by visualization and parameterization UIs, permitting users to use interactions beyond chat to compose their programs incrementally.
sentence that describes the characteristics that define the proposed system
designing complex behavior can be a difficult programming task, and program representations in end-user programming tools may not be well-suited for heavy programs.
sentence that describes the obstacles that the proposed system is designed to help the intended user get around to reach their goals
users can develop, test, and tweak program components, exploring possibilities for how data can be transformed and composed to discover and achieve goals.
sentence that describes the goals of the intended user
It encourages program decomposition into "layer" abstractions, It automatically creates visualizations of event payloads at layer boundaries to help users understand layer behavior without having to read the underlying generated code, and It constructs ad hoc parametrization interfaces that allow users to configure important dimensions of the behavior of each layer without having to re-author it.
sentence that describes the characteristics that define the proposed system
Ply maintains the simplicity of a straightforward connection between a trigger and action but provides a structure within which users can enlist an LLM to specify the behavior of each trigger and action.
sentence that describes the characteristics that define the proposed system
However, such LLM-authored code, especially when implementing nontrivial logic, can be difficult to specify, understand or debug. Users need appropriate tools and handles to understand and make changes to the computation that is being performed in such code.
sentence that describes the obstacles that the proposed system is designed to help the intended user get around to reach their goals
Trigger-action programming offers an elegant interface to construct simple programs that result in customized behavior for software or devices.
sentence that describes the conditions for which the system is designed
Trigger-action programming has been a success in end-user programming. Traditionally, the simplicity of links between triggers and actions limits the expressivity of such systems. LLM-based code generation promises to enable users to specify more complex behavior in natural language. However, users need appropriate ways to understand and control this added expressive power.
sentence that describes the conditions for which the system is designed
by triangulating our empirical findings with existing theoretical models from the literature, we found out that the existing models of technology adoption require new theory components to be able to describe technology adoption processes of our participants. In particular, we identified an additional phase that is prominent among the participants, intention to learn, but did not appear in prior models. Then, we identified three new factors that significantly influence their technology acceptance but which are, again, not represented in the existing models: self-efficacy, conversion readiness, and peer support.
sentences about extending existing theoretical models with research findings
Our preliminary results indicate that there is an additional phase, the intention to learn, and three relating factors, self-efficacy, conversion readiness, and peer support, that significantly influence the acceptance of mobile technologies among the participants, but are not represented in the existing models. With these findings, we propose a tentative theoretical model that extends the existing theories to explain the ways in which our participants came to accept mobile technologies.
sentences about extending existing theoretical models with research findings
Then, by triangulating our empirical findings with existing theoretical models from the literature, we found out that the existing models of technology adoption require new theory components to be able to describe technology adoption processes of our participants.
sentences about extending existing theoretical models with research findings
We identified three distinct factors that influence older adults' technology acceptance behaviors, particularly the intention to learn phase, that are not represented in prior models: self-efficacy, conversion readiness, and peer support.
sentences about extending existing theoretical models with research findings
人对错误的容忍度很低,一个错误推送比少记几件事更容易让用户觉得产品不好。
这是一个关键的产品心理学洞察。在 AI 产品中,“精准”往往比“全面”更重要。用户可以忽略缺失的信息,但很难容忍错误的打扰。这种对“信噪比”的极致追求,解释了为什么舍弃全量记录、转而通过 Enter 键捕捉确定性意图是更优解。
以 Enter 键为锚点,捕捉用户每一次表达意图的瞬间。
这一设计极具洞察力,它将记录的颗粒度从“全量行为”收束为“意图锚点”。Enter 键作为用户确认意图的通用符号,不仅大幅降低了无意义的数据噪音和算力成本,更解决了全量监控带来的隐私焦虑,是“少即是多”在 AI 交互设计中的典范。
Cai et al. [117] interviewed 21 pathologists who used a deep neural network to aid in thediagnosis of prostate cancer. The interviews showed that pathologists needed to learn moreabout the network’s strengths and limitations to use it effectively. They also wanted to knowthe design objective of the network and the kind of data on which it was trained.
Roy et al. [715] explored what happens when users doing a task with a simulated crane needto choose between further using automation and manually continuing the task. The authorsshowed that the decision of whether to use automation is affected by the users’ perceptionof its accuracy as well as how easy it is to do the task themselves.
The performance of the system must be reliable and controllable. Its behavior should be safe, and the way it is designed and used should be ethical [768]. Users need to trust the system's decisions and ability. It should be made clear to the user what it can and cannot do.
statements that describe assertions of desirable system properties
when setting up a new Macbook it presents FileVault as an optional checkbox and I can certainly tell you that there are many people (including my younger self and my family member with the Intel Mac) who do not know what it is and choose to disable it instead.
大多数人认为Apple会默认启用关键安全功能如FileVault,以保护用户数据。但作者指出,FileVault实际上是一个可选功能,许多用户(包括他自己和家人)在设置新Mac时会选择禁用它,这挑战了人们对Apple默认安全策略的认知,揭示了系统安全依赖于用户知识而非厂商默认设置的实际情况。
Teams at companies like Notion, Ramp, Braintrust, and Wasmer are already using Codex to accelerate their engineering workflows.
大多数人可能认为AI编程工具主要被大型科技公司采用,但作者认为即使是像Notion、Ramp这样的非传统科技公司也在将Codex整合到其核心工程工作流中,这挑战了人们对AI编程工具采用者类型的传统认知,表明其适用范围比预期更广泛。
We propose that cognitive engagement may be a useful construct in conceptualizing human engagement with AI and can help to distinguish between passive engagement, when individuals simply follow AI recommendations, and deeper forms of engagement, when they critically examine these recommendations and compare them with their own knowledge and judgement.
sentences about intended user's goals
An outcome of deeper cognitive engagement would be an ability to reject information that is inconsistent with individuals' own knowledge and beliefs, and to adjust their own knowledge to incorporate new information.
sentences about intended user's goals
Given continuous concerns regarding the reliability and trustworthiness of AI, human critical engagement may be a necessary component of successful human-AI interaction, particularly in domains with a high cost of errors, such as health and medicine.
sentences about intended user's goals
In many areas of human enterprise, individuals increasingly rely on Artificial Intelligence (AI) to inform their decisions and choices.
sentences about intended user's goals
How do people process the information and advice they receive from AI, and do they engage with it deeply enough to enable learning?
sentences about intended user's goals
When people receive advice while making difficult decisions, they often make better decisions in the moment and also increase their knowledge in the process.
sentences about intended user's goals
We also identified the factors that are critical to older adults but did not appear in the existing models. Finally, we applied the existing vocabulary to our model to comply with the conventional terms in the field.
sentences that implicitly or explicitly mention theory
Again following grounded theory practices from [33], we compared the model that emerged from our data with existing theoretical models of technology acceptance to determine differences and similarities between them.
sentences that use or mention grounded theory
Again following grounded theory practices from [33], we compared the model that emerged from our data with existing theoretical models of technology acceptance to determine differences and similarities between them.
sentences that implicitly or explicitly mention theory
Employing the grounded theory method [33], we allowed recurring themes and concepts in relation to technology acceptance behaviors to arise from the data itself.
sentences that use or mention grounded theory
We inductively analyzed the first-round interview data using thematic analysis based on a grounded theory approach [33]. Grounded theory methods build theory iteratively from the data, using rigorous coding practices. Initial open codes are primarily descriptive. These may be combined into more sophisticated related sets of descriptors, in which each set is referred to as an axial code. Subsequently, axial codes are combined into more theoretically powerful code complexes, called selective codes. Our approach included a process of open coding, axial coding, and selective coding.
sentences that use or mention grounded theory
With these findings, we propose a tentative theoretical model that extends the existing theories to explain the ways in which our participants came to accept mobile technologies.
sentences about extending existing theoretical models with research findings
Triangulating the empirical findings from our preliminary results with the existing theoretical models, we proposed an extension of the existing theoretical models that explains the technology acceptance behavior of our participants who were aged 60 or over.
sentences that implicitly or explicitly mention theory
Consolidating our preliminary findings with the existing models, we propose an extended technology acceptance model for older adults illustrated in Figure 3. Extending to the predecessor theories, our tentative model introduces the perceived effort of learning a new technology as an obstacle for older adults' technology acceptance, which has not been reported in any studies of younger adults' technology acceptance.
sentences that implicitly or explicitly mention theory
Using TAM, UTAUT, and several other works as theoretical underpinning, Renaud and Biljon proposed a model to explain older adults' mobile phone adoption.
sentences that implicitly or explicitly mention theory
Although many researchers have sought to understand and predict technology acceptance behavior, there has been relatively less effort to build a theoretical model for older adults, with one exception (STAM).
sentences that implicitly or explicitly mention theory
Extending the original TAM and consolidating the constructs of several other existing models, Venkatesh et al. proposed the Unified Theory of Acceptance and Use of Technology (UTAUT) [37].
sentences that implicitly or explicitly mention theory
Azjen's theory of planned behavior [1, 2] posits that a specific behavior is the result of an intention to carry it out, and that intention is determined by attitudes, norms, and the perception of control over the behavior. Drawing upon this theory of planned behavior, Davis et al. developed the technology acceptance model (TAM) [10].
sentences that implicitly or explicitly mention theory
Then, by triangulating our empirical findings with existing theoretical models from the literature, we found out that the existing models of technology adoption require new theory components to be able to describe technology adoption processes of our participants.
sentences that implicitly or explicitly mention theory
Technology acceptance has been widely studied, and several models have been proposed and tested [10, 37]. However, the HCI literature lacks a comprehensive explanation of technology acceptance among older adults.
sentences that implicitly or explicitly mention theory
Our goal is to modulate the saliency of words in the original text so that users can easily bypass certain words during skimming while maintaining an uninterrupted reading flow.
sentences about intended user's goals
Be resilient to AI errors by enabling the reader to (a) notice, (b) have enough context to judge, and (c) easily recover from, automated decisions they disagree with.
sentences about intended user's goals
Support skimming without interrupting flow. The system should improve skimming of text while minimizing the impact on the user's natural reading flow. In particular, as much as possible, it should avoid presenting users with salient text that is unparsable as a coherent thought, i.e., the system should present a complete sentence rather than a phrase or sentence fragment.
sentences about intended user's goals
Support reading at multiple levels of detail. The system should help users navigate the full complexity of a text, shifting focus seamlessly between different levels of semantic coverage, or granularity, from the big picture to the fine details.
sentences about intended user's goals
Integrate seamlessly into existing reading experiences. The system should complement and not interfere with the existing digital reading workflow that people are used to. It should provide all the functionalities in the same view, minimizing the overhead of mode and context switching.
sentences about intended user's goals
Remain faithful to the original text. The system should not automatically reword or add new words or phrases to the original text. It should preserve the original text, while rendering it in a way that aids reading, skimming, or information retrieval.
sentences about intended user's goals
We aspired to design a text rendering interface that alleviates some of the cognitive demands of reading, skimming, or performing information retrieval on natural language documents—particularly those with long, complicated sentences—without compromising the integrity of the original content.
sentences about intended user's goals
Established theories of human cognition describe how exposure to variation and consistency within prescribed structures can help people more robustly form mental models of a phenomenon, e.g., how an LLM behaves. Specifically, in line with Variation Theory [35], the features we instantiate identify patterns of consistency (Figure 1d, "Exact Matches"), variation (Figure 1c, "Unique Words"), or both (Figures 1a, 1b, "Positional Diction Clustering (PDC)"—a novel algorithm we introduce in this paper). In line with Analogical Learning Theory [13], PDC highlights analogous text across LLM responses, i.e., positionally consistent and similar in diction, such that users can see emergent relationships.
sentences that implicitly or explicitly mention theory
users may want to select the best option from among many, compose their own response through bricolage, consider many ideas during ideation, audit a model by looking at the variety of possible responses, or compare the functionality of different models or prompts.
sentences about intended user's goals
participants seemed to prefer engaging with the text directly without having to articulate a lens with which to look at the corpus, since their analysis goal may be initially under-defined.
sentences about intended user's goals
we want to decorate text to show pre-computed relationships, such as string matches or analogous sentences, across responses. In this way, we help users shift cognitive bandwidth away from identifying overlapping or \
sentences about intended user's goals
In our formative study, we found that automated analysis rarely captured what the participants were looking for when inspecting LLM responses.
sentences about intended user's goals
We want to support a wide range of tasks that involve sensemaking. For example, we want to support the detection of similarities and differences between individual responses as well as groups of responses, and support the detection of
sentences about intended user's goals
We aim to make 10s to 100s of LLM responses cognitively comfortable to peruse, as this was the scale we found to be most relavent in our formative study.
sentences about intended user's goals
dialogue, as a form of interaction, is not limited to speech and language even though this is often our first interpretation of the term "dialogue."... the concepts of dialogue are applicable across modalities.
highlight the most important assumptions, conclusions, and points of the paper
An FSM is a model of discrete computation applicable to dialogues. In computer science, an FSM is a special case of a Turing machine that reads but does not write on the tape.
Please highlight key definitions.
Formal models of computation are suitable for describing discrete, moded dialogues. A mode refers to the variation in the interpretation of a user's input according to an internal state. In a modeless dialogue, all inputs are possible in all states and their interpretation is always the same.
gimme some software concepts that are color coded and categories
One thing that is missing is an account of how beliefs about the computer are formed and updated and how they drive action specification. The current understanding is that users form internal models that predict how their actions produce perceived outputs, and they learn to minimize prediction errors.
I want to highlight things that are novel (not simply tool stuff)
both the computer and the human participate in establishing a shared context. The computer does not simply receive a message; it also communicates the effects of that message.
I want to highlight things that are novel (not simply tool stuff)
Robustness refers to the communication partners' ability to achieve shared understanding even in light of misunderstandings and other unanticipated troubles.
Highlight sentences that give a definition of a concept.
Communication repair refers to the "work of restoring shared understanding" when conversational partners misunderstand each other.
Highlight sentences that give a definition of a concept.
Mixed-initiative interaction is the idea of organizing interaction in dialogue where both the computer and the human can take initiative.
Highlight sentences that give a definition of a concept.
Dialogue can be understood as computation, goal-directed action, communication, or embodied action. Each perspective provides specific methods for the analysis and design of dialogue.
Highlight the sentences that capture the main point of this chapter
Dialogue interaction includes speech-based and graphical interactions.
Highlight the sentences that capture the main point of this chapter
The core elements of dialogue are communication turns, the communication context, and turn interpretation.
Highlight the sentences that capture the main point of this chapter
Dialogue is about the organization of communication as a series of turns between communication partners.
Highlight the sentences that capture the main point of this chapter
The key idea in the dialogue view of interaction is the organization of communication as a series of turns. Dialogue evolves through communication turns between two or more partners. In one turn, an appropriate communication act is made by one partner based on the communication context. The act aims to get the other partner to do or understand something. This understanding then forms the context within which the other partner takes their turn.
Highlight the sentences that capture the main point of this chapter
Interaction may be viewed as a dialogue, that is, a conversation that occurs between two partners in a context for some purpose.
Highlight the sentences that capture the main point of this chapter
TAM posits that the intention to adopt a particular technology is driven by two kinds of perceptions: (1) how easy it is to use a system and (2) how useful it will be to use it [180]. Furthermore, the perceived ease of use affects the perceived usefulness: If technology is hard to use, it is less useful.
Highlight what you think good software concepts owuld be and segment them by color coded categories.
it is perfectly possible to have a program which is structured, modular, readable, flexible, self-documenting, maintainable, which performs its specified function, and which is a source of constant frustration and irritation to its users.
Highlight what you think good software concepts owuld be and segment them by color coded categories.
Text entry can also be seen as a task where different subtasks are shared between the human and the computer (Chapter 20).
a statement that describes a type of user task
One example is autocorrect, which automatically corrects typing errors while the user is typing. Another example is the use of word predictions, which allow the user to select a word from a set of word suggestions instead of typing out the word in full.
sentence describing examples of a concept
For example, text entry methods such as eye typing are designed to allow nonspeaking users with motor disabilities to enter text using their eye movements only.
sentence describing examples of a concept
Text entry is also a good example of tool use (Chapter 19). A text entry method is a tool that allows the user to communicate with someone or something, typically other people or a service, using asynchronous text messages and longer documents.
a statement that describes a type of user task
In teleological determination, goals or purposes determine interaction in some way.
a sentence defining a concept
What happens in interaction is mutually determined by the human and the computer. In other words, what happens in interaction cannot be attributed solely to the human or the computer—the two must be considered together.
a sentence describing the concept of interaction
"Average movement time can be predicted as linear regression to the index of difficulty."
a statement that is a claim about the world as described by a particular theory
"The difficulty of selecting a target is proportional to its distance and inversely proportional to its width (index of difficulty)."
a statement that is a claim about the world as described by a particular theory
"If the user tries to increase speed, accuracy will be compromised, and vice versa: An increase in accuracy reduces speed."
a statement that is a claim about the world as described by a particular theory
For example, they can talk about information, difficulty, working memory, and so on.
sentence describing examples of a concept
A proposition is a claim about the world.
a sentence defining a concept
Interaction also occurs in different contexts, including work, leisure, and in-between contexts such as commuting.
sentence describing examples of a concept
Interaction is a dynamic phenomenon that unfolds over time as users and computers influence each other.
sentence describing the concept of interaction
It has been used to describe individuals, groups, and communities using computers.
sentence describing examples of a concept
Interaction is a core notion in HCI and refers to the mutual influence between people and computers.
sentence describing the concept of interaction
Pressing a button takes about a hundred milliseconds; adopting an information system in a large organization can easily take months.
sentence describing examples of a concept
We have used it to discuss various applications, from a user typing on a smartphone to a team of information workers communicating via email.
sentence describing examples of a concept
Such points about the origins of data and the processes of their collection are a key factor in civic text visualization. Indeed, a shift to emphasizing paradata can help draw attention to the representativeness of data.
Show alternative approaches to text visualization beyond analytics
On the other side of this spectrum, at the detail level, articulating nuanced information present in raw text data can enable civic leaders to peruse and sublimate critical insights.
Show alternative approaches to text visualization beyond analytics
In contrast, we could consider designing explicitly for multiple users. Doing so requires more than designing for different levels of expertise (see the following subsection for more on expertise) or designing for collaborative use, though both those things may be valuable in their own right. Rather, this dimension encourages accounting for the different types of relationalities that users may have with a system [cf. BB17].
Show alternative approaches to text visualization beyond analytics
Civic text visualizations similarly designed to foreground interpretation could help make clearer who is making these interpretive decisions, thereby highlighting the lack of neutrality and objectivity in data [DK20].
Show alternative approaches to text visualization beyond analytics
work on visualization evaluation [SP06; IZCC08; LBI*12] has emphasized the importance of close attention to the various contexts in which a visualization will be applied.
Show alternative approaches to text visualization beyond analytics
It is informative to contrast this analytic emphasis with other evolving discourses in information visualization. The prior work reviewed above illustrates a few alternative orientations, including rhetoric [HD11], feminism [DK16; DK20], ethics [Cor19], and others [DFCC13; VW08].
Show alternative approaches to text visualization beyond analytics
For instance, CommunityPulse provides a scaffolding for multifaceted public input analysis using visualizations [JHSM21], and MultiConVis enables multilevel exploration and analysis of threaded conversations [HC16b].
Find civic text visualization systems that are explicitly named.
For example, CommunityPulse [JHSM21] uses common, simple visualizations and iconography, such as bar charts and emojis, to provide overviews of people's emotions towards civic agendas and ideas. Similarly, ConsiderIt [KMF*12b] uses bar charts to visualize people's stance towards ballot measures.
Find civic text visualization systems that are explicitly named.
For instance, visual analytic systems such as MultiConVis [HC16b] use multiple connected views to enable analysts to filter and explore text data at multiple levels.
Find civic text visualization systems that are explicitly named.
Tools such as ConsiderIt [KMF*12b] or Opinion Space [FBRG10] are designed specifically for the public. In contrast, tools such as CommunityPulse [JHSM21] or CommunityClick [JKW*21] are focused more on supporting community leaders and decision makers.
Find civic text visualization systems that are explicitly named.
For example, MultiConVis [HC16b] makes prescriptive statements not only as to the sentimental valence of individual conversations but also as to the topics that each conversation is about. Similarly, ConsiderIt [KMF*12b] asks participants to place individual statements as either supporting or opposing a given ballot proposition.
Find civic text visualization systems that are explicitly named.
Consider how systems such as MutiConVis [HC16b] and CommunityClick [JKW*21] provide visual representations to help the viewer understand the structure and content of conversations.
Find civic text visualization systems that are explicitly named.
tools such as ConsiderIt [KMF*12b] and CommunityPulse [JHSM21] prominently feature specific comments from members of the public (i.e., the data).
Find civic text visualization systems that are explicitly named.
Improving the public input process has become an important goal in the field of digital civics [MNC*19; VCL*16; OW15]. To that end, researchers and practitioners have developed a variety of systems for, e.g., sharing public opinions [FBRG10], building consensus [KMF*12a; ZNB15], summarizing public input [19], or identifying people's priorities, reflections, and hidden insights [JHSM21].
Highlight all civic participation approaches
Previous work has introduced several online engagement platforms to enable the public to asynchronously provide their comments, ideas, and feedback around civic issues [19; 20b; MJN*18]. These engagement tools have used micro-tasks [MJN*18], visualizations [19], and forum-like discussions [20b] to engage disconnected and disenfranchised populations [MNC*19]. Others have proposed technologies to promote in-person engagement of reticent participants during town halls [JKW*21] and public meetings [LLS] using clicker-like devices.
Highlight all civic participation approaches
Despite their central importance in the civic engagement process, members of the general public are not necessarily involved in the analysis process. Hence, they are often left out of the loop when designing civic text visualizations—their requirements, aptitudes, knowledge, etc. are not given central consideration. Integrating participatory approaches in civic text visualization could pave the way not only for more inclusive analysis but also for leveraging the general public's knowledge to gather richer insights.
Highlight all civic participation approaches
social dynamics, such as shyness and tendency to avoid confrontation with dominant personalities can also hinder opinion sharing in town halls by favoring privileged individuals who are comfortable or trained to take part in contentious public discussions [27, 127].
Highlight all civic participation approaches
town halls inadvertently cater to a small number of privileged individuals, and silent participants often become disengaged despite physically attending the meetings [61]. Due to the lack of inclusivity, the outcome of such meetings often tends to feel unjust and opaque for the general public [39, 54].
Highlight all civic participation approaches
designing communitysourcing technologies to include marginalized opinions and amplify participation alone may not be enough to solve inequality of sharing opinions in the civic domain [26, 126]. Despite the success of previous works [25, 53, 90], technology is rarely integrated with existing manual practices and follow-ups of engagements between government officials and community members are seldom propagated to the community.
Highlight all civic participation approaches
Marginalization can be broadly defined as the exclusion of a population from mainstream social, economic, cultural, or political life [58], which still stands as a barrier to inclusive participation in the civic domain [48, 94]. Researchers in HCI and CSCW have explored various communitysourcing approaches to include marginalized populations in community activities, proceedings, and designs [48, 53, 81, 93, 132].
Highlight all civic participation approaches
To increase broader civic participation, researchers in HCI have proposed both online [4, 5, 7, 81, 93] and face-to-face [21, 80, 91, 125] technological interventions that use the communitysourcing approach.
Highlight all civic participation approaches
Prior investigations by Bryan [29] and Gastil [56] showed a steady decline in civic participation in town halls due to the growing disconnect between local government and community members and the decline in social capital [43, 111, 113]. Despite the introduction of online methods to increase public engagement in the last decade [4, 5, 7, 37, 81, 93], government officials continue to prefer face-to-face meetings to engage the community in the decision-making process [32, 52, 94].
Highlight all civic participation approaches
To reengage disconnected, reticent, or disenfranchised community members, researchers in HCI and digital civics have offered novel strategies and technological interventions to increase engagement [60, 62, 94, 107, 130].
Highlight all civic participation approaches
Bryan [29] and Gastil [56] investigated the state of town halls and demonstrated a steady decline in civic participation due to the growing disconnect between local government and the community.
Highlight all civic participation approaches
Traditional community consultation methods, such as town halls, public forums, and workshops are the modus operandi for public engagement [52, 94]. For fair and impartial civic decision-making, the inclusivity of community members' feedback is paramount [60, 94, 126]. However, traditional methods rarely provide opportunities for inclusive public participation [30, 87, 95].
Highlight all civic participation approaches
Murphy used such systems to promote democracy and community partnerships [103]. Similarly, Boulianne et al. deployed clicker devices in contentious public discussions about climate change to gauge public opinions [25]. Bergstrom et al. used a single button device where the attendees anonymously voted (agree/disagree) on issues during the meeting. They showed that back-channel voting helped underrepresented users get more involved in the meeting [22].
Highlight all civic participation approaches
As evidenced by numerous studies on statistical cognition (Kline, 2004; Beyth-Marom et al, 2008), even trained scientists have a hard time interpreting p-values, which frequently leads to misleading or incorrect conclusions.
p-value is misinterpreted and confusing
few researchers can resist the temptation to conclude that there is no effect, a common fallacy called "accepting the null" which had frequently led to misleading or wrong scientific conclusions (Dienes, 2014, p.1).
p-value is misinterpreted and confusing
Again, p is the probability of seeing results as extreme (or more extreme) as those actually observed if the null hypothesis were true. So p is computed under the assumption that the null hypothesis is true. Yet it is common for researchers, teachers and even textbooks to think of p as the probability of the null hypothesis being true (or equivalently, of the results being due to chance), an error called the "fallacy of the transposed conditional" (Haller and Krauss, 2002; Cohen, 1994, p.999).
p-value is misinterpreted and confusing
Many researchers fail to appreciate that p-values are unreliable and vary widely across replications.
p-value is misinterpreted and confusing
Providing non-misleading interpretations of figures with confidence intervals requires judgment, and no mechanical decision procedure can carry out this job better than a thoughtful investigator.
Estimation is necessary but not sufficient
Estimation seems much more likely to promote clear statistical thinking.
Need to change our way of thinking
Decades spent educating researchers have had little or no influence on beliefs and practice (Schmidt and Hunter, 1997, pp.20–22).
Calls for reform fall on deaf ears
NHST has been severely criticized for more than 50 years by end users to whom fair statistical communication matters.
Calls for reform fall on deaf ears
This assessment raises two issues. First, it is arbitrary. If 10 of the 15 CIs included the predicted values, would the results also support the theory, or instead refute it? If one instead used 99% CIs, would positive results for 12 of the 15 predictions be enough to support the theory? This arbitrariness arises because CIs offer no principled method for generating an inference regarding the theory.
Estimation is too messy / complex and not clear enough
two out of three necessary conditions for testing theory are missing.
Estimation is too messy / complex and not clear enough
To illustrate this point Oakes posed a series of true/false questions regarding the interpretation of p-vales to seventy experienced researchers and discovered that only two had a sound understanding of the underlying concept of significance [25].
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
failure to check assumptions about the data required by particular tests, over-testing and using inappropriate tests
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
abusing statistical tests, making illogical arguments as a result of tests, deriving inappropriate conclusions from nonsignificant results, and confusing the size of p-values with effect sizes.
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
This approach, fiercely promoted by Fisher in the 1930's [9], has become the gold standard in many disciplines including quantitative evaluations in HCI. However, the approach is rather counter-intuitive; many researchers misinterpret the meaning of the p-value.
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
We found that using MINE directly gave identical performance when the task was nontrivial, but became very unstable if the target was easy to predict from the context (e.g., when predicting a single step in the future and the target overlaps with the context).
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
We note that better [49, 27] results have been published on these target datasets, by transfer learning from a different source task.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
We also found that not all the information encoded is linearly accessible. When we used a single hidden layer instead the accuracy increases from 64.6 to 72.5, which is closer to the accuracy of the fully supervised model.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper