it is crucial to prioritize and direct human efforts toward more "suspicious" outputs from LLMs
Please highlight any phrases that describe recommendations made in the paper
it is crucial to prioritize and direct human efforts toward more "suspicious" outputs from LLMs
Please highlight any phrases that describe recommendations made in the paper
we advocate a collaborative approach where humans and LLMs work together to produce reliable and high-quality labels
Please highlight any phrases that describe recommendations made in the paper
LLM annotators and human annotators should not be treated the same, and annotation tools should carefully design their data models and workflows to accommodate both types of annotators
Please highlight any phrases that describe recommendations made in the paper
it is advisable to either mask any confidential information or only use in-house LLMs
Please highlight any phrases that describe recommendations made in the paper
it is recommended that the format of a prompt be similar to the one used in training as some LLMs have different prompt format than the others
Please highlight any phrases that describe recommendations made in the paper
the selection of label options may work better if it is similar to common options for given tasks, such as [positive, neutral, negative] > [super positive, positive, ..., negative] for sentiment classification
Please highlight any phrases that describe recommendations made in the paper
designing an annotation task and a prompt similar to more widely used and standardized NLP tasks is beneficial
Please highlight any phrases that describe recommendations made in the paper
errors encountered during API calls are handled in two ways: handle within our system or delegate to users. We handle known LLM API errors that can be solved by user-side intervention. This would be in cases such as a Timeout or RateLimitError in OpenAI models
Please highlight any phrases that describe the libraries and tools used to implement the idea
errors such as APIConnectionError in OpenAI models occur because of an issue with the LLM API server itself and requires intervention from OpenAI.
Please highlight any phrases that describe the libraries and tools used to implement the idea
While MEGAnno+ is designed to support any open-source LLM or commercial LLM APIs, in this work, we only demonstrate OpenAI Completion models for clarity and brevity.
Please highlight any phrases that describe the libraries and tools used to implement the idea
Data Model MEGAnno+ extends MEGAnno's data model where data Record, Label, Annotation, Metadata (e.g., text embedding or confidence score) are persisted in the service database along with the task Schema.
Please highlight any phrases that describe the libraries and tools used to implement the idea
To implement our system as an extension to MEGAnno (Zhang et al., 2022), an in-notebook exploratory annotation tool.
Please highlight any phrases that describe the libraries and tools used to implement the idea
MEGAnno+ is designed to provide a convenient and robust workflow for users to utilize LLMs in text annotation. To use our tool, users operate within their Jupyter notebook (Kluyver et al., 2016) with the MEGAnno+ client installed.
Please highlight any phrases that describe the libraries and tools used to implement the idea
LLM annotators and human annotators should not be treated the same, and annotation tools should carefully design their data models and workflows to accommodate both types of annotators.
Please highlight any phrases that describe the theory behind this work
we go beyond using LLMs to assist annotation for human annotators or to replace human annotators. Rather, MEGAnno+ advocates for a collaboration between humans and LLMs with our dedicated system design and annotation-verification workflows.
Please highlight any phrases that describe the theory behind this work
Despite these advancements, it is essential to acknowledge that LLMs have limitations, necessitating human intervention in the data annotation process. One challenge is that the performance of LLMs varies extensively across different tasks, datasets, and labels. LLMs often struggle to comprehend subtle nuances or contexts in natural language, making involvement of humans with social and cultural understanding or domain expertise crucial.
Please highlight any phrases that describe the theory behind this work
Large language models (LLMs) can label data faster and cheaper than humans for various NLP tasks. Despite their prowess, LLMs may fall short in understanding of complex, sociocultural, or domain-specific context, potentially leading to incorrect annotations. Therefore, we advocate a collaborative approach where humans and LLMs work together to produce reliable and high-quality labels.
Please highlight any phrases that describe the theory behind this work
Valarie A Zeithaml and William L Fuerst. 1983. Age differences in response to grocery store price information. Journal of consumer affairs 17, 2 (1983), 402–420.
any bibliographic entry relating to older adults
Mary E Sesto, Curtis B Irwin, Karen B Chen, Amrish O Chourasia, and Douglas A Wiegmann. 2012. Effect of touch screen button size and spacing on touch characteristics of users with and without disabilities. Human Factors: The Journal of the Human Factors and Ergonomics Society 54, 3 (2012), 425–436.
any bibliographic entry relating to older adults
Zhao Xia Jin, Tom Plocher, and Liana Kiff. 2007. Touch screen user interfaces for older adults: button size and spacing. In Universal acess in human computer interaction. coping with diversity. Springer, 933–941.
any bibliographic entry relating to older adults
Robin Brewer, Raymundo Cornejo Garcia, Tedmond Schwaba, Darren Gergle, and Anne Marie Piper. 2016. Exploring Traditional Phones as an E-Mail Interface for Older Adults. ACM Transactions on Accessible Computing (TACCESS) 8, 2 (2016), 6.
any bibliographic entry relating to older adults
Janan Al-Awar Smither and Curt C Braun. 1994. Technology and older adults: Factors affecting the adoption of automatic teller machines. The Journal of General Psychology 121, 4 (1994), 381–389.
any bibliographic entry relating to older adults
Wiktoria Wilkowska and Martina Ziefle. 2009. Which factors form older adults' acceptance of mobile information and communication technologies? Springer.
any bibliographic entry relating to older adults
Kerryellen G Vroman, Sajay Arthanat, and Catherine Lysack. 2015. "Who over 65 is online?" Older adults' dispositions toward information communication technology. Computers in Human Behavior 43 (2015), 156–166.
any bibliographic entry relating to older adults
Phil Turner, Susan Turner, and Guy Van de Walle. 2007. How older people account for their experiences with interactive technology. Behaviour & Information Technology 26, 4 (2007), 287–296.
any bibliographic entry relating to older adults
Hironobu Takagi, Akihiro Kosugi, Tatsuya Ishihara, and Kentarou Fukuda. 2014. Remote IT education for senior citizens. In Proceedings of the 11th Web for All Conference. ACM, 41.
any bibliographic entry relating to older adults
Karen Renaud and Judy Van Biljon. 2008. Predicting technology acceptance and adoption by the elderly: a qualitative study. In Proceedings of the 2008 annual research conference of the South African Institute of Computer Scientists and Information Technologists on IT research in developing countries: riding the wave of technology. ACM, 210–219.
any bibliographic entry relating to older adults
Chee Wei Phang, Juliana Sutanto, Atreyi Kankanhalli, Yan Li, Bernard CY Tan, and Hock-Hai Teo. 2006. Senior citizens' acceptance of information systems: A study in the context of e-government services. Engineering Management, IEEE Transactions on 53, 4 (2006), 555–569.
any bibliographic entry relating to older adults
Bjorn Niehaves and Ralf Plattfaut. 2014. Internet adoption by the elderly: employing IS technology acceptance theories for understanding the age-related digital divide. European Journal of Information Systems 23, 6 (2014), 708–726.
any bibliographic entry relating to older adults
HH Nap and HP de Greef. 2010. Self-efficacy & stress in senior computer interaction. In Proceedings of the 28th Annual European Conference on Cognitive Ergonomics. ACM, 227–230.
any bibliographic entry relating to older adults
Michael G Morris and Viswanath Venkatesh. 2000. Age differences in technology adoption decisions: Implications for a changing work force. Personnel psychology 53, 2 (2000), 375–403.
any bibliographic entry relating to older adults
Tracy L Mitzner, Wendy A Rogers, Arthur D Fisk, Walter R Boot, Neil Charness, Sara J Czaja, and Joseph Sharit. 2014. Predicting older adults' perceptions about a computer system designed for seniors. Universal Access in the Information Society (2014), 1–10.
any bibliographic entry relating to older adults
Chaiwoo Lee and Joseph F Coughlin. 2014. PERSPECTIVE: Older Adults' Adoption of Technology: An Integrated Approach to Identifying Determinants and Barriers. Journal of Product Innovation Management (2014).
any bibliographic entry relating to older adults
Sri Kurniawan. 2008. Older people and mobile phones: A multi-method investigation. International Journal of Human-Computer Studies 66, 12 (2008), 889–901.
any bibliographic entry relating to older adults
Vicki L Hanson. 2011. Technology skill and age: what will be the same 20 years from now? Universal Access in the Information Society 10, 4 (2011), 443–452.
any bibliographic entry relating to older adults
Mary C Gilly and Valarie A Zeithaml. 1985. The elderly consumer and adoption of technologies. Journal of consumer research (1985), 353–357.
any bibliographic entry relating to older adults
Nancy M Gell, Dori E Rosenberg, George Demiris, Andrea Z LaCroix, and Kushang V Patel. 2013. Patterns of technology use among older adults with and without disabilities. The Gerontologist (2013), gnt166.
any bibliographic entry relating to older adults
Helene Gelderblom, Tobie van Dyk, and Judy van Biljon. 2010. Mobile phone adoption: Do existing models adequately capture the actual usage of older adults?. In Proceedings of the 2010 annual research conference of the south african institute of computer scientists and information technologists. ACM, 67–74.
any bibliographic entry relating to older adults
Arthur D Fisk, Wendy A Rogers, Neil Charness, Sara J Czaja, and Joseph Sharit. 2009. Designing for older adults: Principles and creative human factors approaches. CRC press.
any bibliographic entry relating to older adults
Anna Dickinson, Alan F Newell, Michael J Smith, and Robin L Hill. 2005. Introducing the Internet to the over-60s: Developing an email system for older novice computer users. Interacting with Computers 17, 6 (2005), 621–642.
any bibliographic entry relating to older adults
Mario Conci, Fabio Pianesi, and Massimo Zancanaro. 2009. Useful, social and enjoyable: Mobile phone adoption by older people. In Human-Computer Interaction–INTERACT 2009. Springer, 63–76.
any bibliographic entry relating to older adults
Miha Cimperman, Maja Makovec Brenčič, Peter Trkman, and Mateja de Leonni Stanonik. 2013. Older adults' perceptions of home telehealth services. Telemedicine and e-Health 19, 10 (2013), 786–790.
any bibliographic entry relating to older adults
Luca Buccoliero and Elena Bellio. 2014. The adoption of silver e-Health technologies: first hints on technology acceptance factors for elderly in Italy. In Proceedings of the 8th International Conference on Theory and Practice of Electronic Governance. ACM, 304–307.
any bibliographic entry relating to older adults
Today's generations of older adults have not grown up with information and communications technologies that are widely available these days. Thus, there is "a natural confound of age and experience, since today's older adults are exposed to these technologies at a different point in their lives than today's young adults." [17]
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
Older people are less likely to have peers with sufficient technology experiences compared to their younger counterparts.
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
Incorporating these human factors and practical design suggestions for older adults, Fisk et al. proposed key recommendations for designing mobile devices for this age group [12].
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
Studies have shown that typical interaction components and techniques of a smartphone often prevent older adults from smooth and instant interactions with it. For example, the small size and the low contrast of buttons on a mobile display has a significant negative influence on interaction performance such as speed and accuracy [18], and decline in motor skills is correlated with time required to complete a task [30].
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
Lee and Coughlin reviewed studies of older adults' technology acceptance and identified ten factors that are critical facilitators or determinants of older adults' acceptance of technology: value, usability, affordability, accessibility, technical support, social support, emotion, independence, experience, and confidence [20].
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
most works point out that an individual's personal context [38] and the social context [36] in which the technology is introduced are the primary factors influencing the perception of, experience with, and evaluation of new technological developments among older adults [19].
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
One exception is the senior technology acceptance model (STAM) [28]. Using TAM, UTAUT, and several other works as theoretical underpinning, Renaud and Biljon proposed a model to explain older adults' mobile phone adoption.
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
Several studies have attempted to determine older adults' acceptance of technologies in general, and healthcare-related systems in particular, using the UTAUT framework. (e.g., email [14], a telehealth service [7]).
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
As a result, older adults and their adoption of new technologies have been a topic of active research since the advent of consumer technologies (e.g., automated teller machine [32], scanner-equipped grocery stores [41], electronic funds transfer [15]).
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
Seniors have historically been late adopters to the world of technology compared to their younger counterparts [24, 40].
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
Nowadays, older adults are increasingly adopting and adapting to information and communication technologies [5].
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
smartphone ownership among older adults has significantly risen in recent years [3]. However, its adoption levels among older adults in the US still sit at 27% as of 2015, whereas some 85% of Americans aged 18-29 are smartphone owners [31].
citations about older adults; for example, the citation numbers being highlighted when the citation is in regards to older adults
With lack of knowledge and experience of software conventions or general usage of technology, older people judge that technology is too complex.
the word computer
Younger people have often learned how to use a computer at school or at work. This is often not the case for older people, especially those whose occupation did not involve computer use.
the word computer
I am so used to computers with a real keyboard with a screen. Probably I would not use it (a smartphone) because I wouldn't be able to use its capabilities.
the word computer
The language that you people use versus people who don't know anything about a computer is one of the big things for me. You know when you call apps but I don't know what it is.
the word computer
We also identified the factors that are critical to older adults but did not appear in the existing models. Finally, we applied the existing vocabulary to our model to comply with the conventional terms in the field.
sentences that implicitly or explicitly mention theory
Again following grounded theory practices from [33], we compared the model that emerged from our data with existing theoretical models of technology acceptance to determine differences and similarities between them.
sentences that use or mention grounded theory
Again following grounded theory practices from [33], we compared the model that emerged from our data with existing theoretical models of technology acceptance to determine differences and similarities between them.
sentences that implicitly or explicitly mention theory
Employing the grounded theory method [33], we allowed recurring themes and concepts in relation to technology acceptance behaviors to arise from the data itself.
sentences that use or mention grounded theory
We inductively analyzed the first-round interview data using thematic analysis based on a grounded theory approach [33]. Grounded theory methods build theory iteratively from the data, using rigorous coding practices. Initial open codes are primarily descriptive. These may be combined into more sophisticated related sets of descriptors, in which each set is referred to as an axial code. Subsequently, axial codes are combined into more theoretically powerful code complexes, called selective codes. Our approach included a process of open coding, axial coding, and selective coding.
sentences that use or mention grounded theory
Lastly, while our findings are based on only 24 participants, the sample size is commensurate with the Ground Theory approach.
sentences that use or mention grounded theory
We analyzed the second-round interview data using inductive and deductive approaches informed by grounded theory and other qualitative analysis methods [33, 22].
sentences that use or mention grounded theory
With these findings, we propose a tentative theoretical model that extends the existing theories to explain the ways in which our participants came to accept mobile technologies.
sentences about extending existing theoretical models with research findings
We identified three distinct factors that influence older adults' technology acceptance behaviors, particularly the intention to learn phase, that are not represented in prior models: self-efficacy, conversion readiness, and peer support.
sentences about extending existing theoretical models with research findings
Components in red boldface in Figure 3 provide a preview of the new elements we have identified and their relationship to the components proposed in earlier models.
sentences about extending existing theoretical models with research findings
Triangulating the empirical findings from our preliminary results with the existing theoretical models, we proposed an extension of the existing theoretical models that explains the technology acceptance behavior of our participants who were aged 60 or over. Our proposed model incorporates key elements of prior models and introduces novel components that significantly influence the participants' technology acceptance, namely one new phase, intention to learn, and three factors, self-efficacy, conversion readiness and peer support.
sentences about extending existing theoretical models with research findings
Consolidating our preliminary findings with the existing models, we propose an extended technology acceptance model for older adults illustrated in Figure 3. Extending to the predecessor theories, our tentative model introduces the perceived effort of learning a new technology as an obstacle for older adults' technology acceptance, which has not been reported in any studies of younger adults' technology acceptance.
sentences about extending existing theoretical models with research findings
In particular, we identified an additional phase that is prominent among the participants, intention to learn, but did not appear in prior models. Then, we identified three new factors that significantly influence their technology acceptance but which are, again, not represented in the existing models: self-efficacy, conversion readiness, and peer support.
sentences about extending existing theoretical models with research findings
Then, by triangulating our empirical findings with existing theoretical models from the literature, we found out that the existing models of technology adoption require new theory components to be able to describe technology adoption processes of our participants.
sentences about extending existing theoretical models with research findings
Triangulating the empirical findings from our preliminary results with the existing theoretical models, we proposed an extension of the existing theoretical models that explains the technology acceptance behavior of our participants who were aged 60 or over.
sentences that implicitly or explicitly mention theory
Consolidating our preliminary findings with the existing models, we propose an extended technology acceptance model for older adults illustrated in Figure 3. Extending to the predecessor theories, our tentative model introduces the perceived effort of learning a new technology as an obstacle for older adults' technology acceptance, which has not been reported in any studies of younger adults' technology acceptance.
sentences that implicitly or explicitly mention theory
Using TAM, UTAUT, and several other works as theoretical underpinning, Renaud and Biljon proposed a model to explain older adults' mobile phone adoption.
sentences that implicitly or explicitly mention theory
Although many researchers have sought to understand and predict technology acceptance behavior, there has been relatively less effort to build a theoretical model for older adults, with one exception (STAM).
sentences that implicitly or explicitly mention theory
Extending the original TAM and consolidating the constructs of several other existing models, Venkatesh et al. proposed the Unified Theory of Acceptance and Use of Technology (UTAUT) [37].
sentences that implicitly or explicitly mention theory
Azjen's theory of planned behavior [1, 2] posits that a specific behavior is the result of an intention to carry it out, and that intention is determined by attitudes, norms, and the perception of control over the behavior. Drawing upon this theory of planned behavior, Davis et al. developed the technology acceptance model (TAM) [10].
sentences that implicitly or explicitly mention theory
Then, by triangulating our empirical findings with existing theoretical models from the literature, we found out that the existing models of technology adoption require new theory components to be able to describe technology adoption processes of our participants.
sentences that implicitly or explicitly mention theory
Technology acceptance has been widely studied, and several models have been proposed and tested [10, 37]. However, the HCI literature lacks a comprehensive explanation of technology acceptance among older adults.
sentences that implicitly or explicitly mention theory
The beauty of the GP-TSM technique lies in its simplicity: at its core, all GP-TSM does is change the visual saliency of words by adjusting their opacity. This preserves the integrity of the original text and minimizes "ergonomic obtrusiveness" [100] while providing readers with a form of "contextual cuing" to arm them with "incidental knowledge about global context", which they can harness to better assign visual attention and memory when reading [40].
sentences that implicitly or explicitly mention theory
Furthermore, according to Stevens's power law, people perceive changes in gray scale not linearly, but rather by a factor of approximately 0.5 [71]. For instance, a threefold increase in opacity might only be perceived as 1.5 times more significant, further complicating the differentiation of levels.
sentences that implicitly or explicitly mention theory
This sequence resonates with efficient content absorption strategies highlighted in speed reading literature, where readers first capture the gist and then delve deeper [1, 63]. The interface, therefore, may inadvertently facilitate this structured, layered reading approach, which might explain the improvement in reading efficiency and comprehension.
sentences that implicitly or explicitly mention theory
We adopt the term "saliency" based on its definition (a "bottom-up, stimulus-driven perceptual quality which makes some items stand out from their neighbors") [42], and its use in augmented reality [85, 88], computer vision [17, 55], and cognitive science [37, 56].
sentences that implicitly or explicitly mention theory
Modulating text saliency is a widely studied aspect of textual information representation. This technique modifies the visual attributes of text to promote words of interest and guide readers' attention, making pertinent information more perceptible and thereby enhancing comprehension and the user experience [12, 42].
sentences that implicitly or explicitly mention theory
compressive summarization aims to select the shortest subsequence of words within a sentence that yields an informative and grammatical sentence [64]. This framework allows for a more concise representation of the original content while retaining the essence of its meaning.
sentences that implicitly or explicitly mention theory
Given the cognitive effort reading requires, readers frequently resort to skimming, which is a rapid, selective, and non-linear form of reading [2]. Eye tracking studies [30, 74] validate that such behavior is extremely common. However, multiple studies have suggested a significant trade-off between reading speed and comprehension [65, 66, 76, 87].
sentences that implicitly or explicitly mention theory
Specifically, automated summarization methods can introduce multiple types of errors: "crimes" of omission, hallucination, and misrepresentation.
sentences that implicitly or explicitly mention theory
Automated text summarization techniques, including but not limited to crowd-powered systems [10], prompting large language models (LLMs) [105], and other AI technologies, can address a subset of these difficulties, i.e., the resulting text may be shorter, with simpler sentence structures and fewer unusual words [62]. However, unless there is information within the original document that is truly redundant, the result is a lossy representation of the original document, regardless of whether the process is abstractive or extractive.
sentences that implicitly or explicitly mention theory
Our goal is to modulate the saliency of words in the original text so that users can easily bypass certain words during skimming while maintaining an uninterrupted reading flow.
sentences about intended user's goals
Be resilient to AI errors by enabling the reader to (a) notice, (b) have enough context to judge, and (c) easily recover from, automated decisions they disagree with.
sentences about intended user's goals
Support skimming without interrupting flow. The system should improve skimming of text while minimizing the impact on the user's natural reading flow. In particular, as much as possible, it should avoid presenting users with salient text that is unparsable as a coherent thought, i.e., the system should present a complete sentence rather than a phrase or sentence fragment.
sentences about intended user's goals
Support reading at multiple levels of detail. The system should help users navigate the full complexity of a text, shifting focus seamlessly between different levels of semantic coverage, or granularity, from the big picture to the fine details.
sentences about intended user's goals
Integrate seamlessly into existing reading experiences. The system should complement and not interfere with the existing digital reading workflow that people are used to. It should provide all the functionalities in the same view, minimizing the overhead of mode and context switching.
sentences about intended user's goals
Remain faithful to the original text. The system should not automatically reword or add new words or phrases to the original text. It should preserve the original text, while rendering it in a way that aids reading, skimming, or information retrieval.
sentences about intended user's goals
We aspired to design a text rendering interface that alleviates some of the cognitive demands of reading, skimming, or performing information retrieval on natural language documents—particularly those with long, complicated sentences—without compromising the integrity of the original content.
sentences about intended user's goals
Established theories of human cognition describe how exposure to variation and consistency within prescribed structures can help people more robustly form mental models of a phenomenon, e.g., how an LLM behaves. Specifically, in line with Variation Theory [35], the features we instantiate identify patterns of consistency (Figure 1d, "Exact Matches"), variation (Figure 1c, "Unique Words"), or both (Figures 1a, 1b, "Positional Diction Clustering (PDC)"—a novel algorithm we introduce in this paper). In line with Analogical Learning Theory [13], PDC highlights analogous text across LLM responses, i.e., positionally consistent and similar in diction, such that users can see emergent relationships.
sentences that implicitly or explicitly mention theory
users may want to select the best option from among many, compose their own response through bricolage, consider many ideas during ideation, audit a model by looking at the variety of possible responses, or compare the functionality of different models or prompts.
sentences about intended user's goals
One prior piece of HCI work, ParaLib [51], does explicitly exploit these theories for system feature design, but does this in the domain of code.
sentences that implicitly or explicitly mention theory
There are two hypothesized benefits of this view. One is based on an understanding of human perception: the grid layout should help users compare more LLM responses because the spatial arrangement assists their memory. The other benefit is based on Variation Theory, which posits that discerning the impact of a critical aspect, for example model temperature, is only possible when experiencing variation along that dimension, isolated from variation along other dimensions.
sentences that implicitly or explicitly mention theory
Given that the features implemented in this work are in line with design implications of Variation Theory and Analogical Learning Theory, the results suggest that there may be further utility of these theories for guiding the design of future systems that help users make sense of data and form mental models from examples.
sentences that implicitly or explicitly mention theory
Theories of human concept learning suggest that a key step in forming accurate, robust mental models of a phenomenon is to be able to discern the underlying dimensions of variation (Variation Theory) and any latent structures beneath superficial details (Analogical Learning Theory). By detecting and communicating which sentences are both structurally analogous (by virtue of their position within the response) and semantically related (by virtue of highly overlapping content), users should be able to more easily identify emergent structures, as well as compare and contrast particular compositions of structural elements across responses and syntactic elements that may vary in meaningful ways across analogous sentences within those responses. These theories assert that these subtasks are key ingredients in forming those robust accurate mental models, i.e., learning from the LLM responses in order to better perform their overarching task.
sentences that implicitly or explicitly mention theory
In this work, in line with Variation Theory, the existing and novel features instantiated and described in the next subsection collectively identify patterns of consistency, variation, or both; they are explicitly designed to make emergent dimensions of consistency and variation easier for the user to perceive.
sentences that implicitly or explicitly mention theory
Variation Theory describes how helping people perceive the different dimensions of consistency and variation across examples (here, LLM responses) of the object of learning helps them more quickly and robustly leap to more accurate mental models. Analogical Learning Theory describes how people can form mental models or schema from perceiving structural analogical relationships across superficially varying examples (again, here LLM responses).
sentences that implicitly or explicitly mention theory
Variation Theory [35] and Analogical Learning Theory [13, 14] each propose mechanisms for how people may conceive and update their mental models based on concrete examples, or use their mental model in new situations.
sentences that implicitly or explicitly mention theory
participants seemed to prefer engaging with the text directly without having to articulate a lens with which to look at the corpus, since their analysis goal may be initially under-defined.
sentences about intended user's goals
we want to decorate text to show pre-computed relationships, such as string matches or analogous sentences, across responses. In this way, we help users shift cognitive bandwidth away from identifying overlapping or \
sentences about intended user's goals
In our formative study, we found that automated analysis rarely captured what the participants were looking for when inspecting LLM responses.
sentences about intended user's goals
We want to support a wide range of tasks that involve sensemaking. For example, we want to support the detection of similarities and differences between individual responses as well as groups of responses, and support the detection of
sentences about intended user's goals
We aim to make 10s to 100s of LLM responses cognitively comfortable to peruse, as this was the scale we found to be most relavent in our formative study.
sentences about intended user's goals
No one person on the Excel team is focused on the macro recorder.
Find macros
The macro recorder doesn't work!
Find macros
Four tips for using the macro recorder
Find macros
Never use AutoSum or Quick Analysis while recording a macro.
Find macros
Assigning a macro to a form control, text box, or shape
Find macros
Creating a macro button on the Quick Access Toolbar
Find macros
Creating a macro button on the ribbon
Find macros
Running a macro
Find macros
Filling out the Record Macro dialog box
Find macros
Overview of recording, storing, and running a macro
Find macros
As corporate IT departments have found themselves with long backlogs of requests, Excel users have discovered that they can produce the reports needed to run their businesses themselves using the macro language Visual Basic for Applications (VBA).
Find macros
VBA enables you to achieve tremendous efficiencies in your day-to-day use of Excel. VBA helps you figure out how to import data and produce reports in Excel so that you don't have to wait for the IT department to help you.
Find macros
Eye-typing forces users to think in terms of individual letters. This has a cognitive cost and is not a fluid means of communication.
Highlight tasks
To use such a switch for typing, the SGD interface must be designed with this in mind from the beginning. The most common solution is a scanning keyboard.
Highlight tasks
TTF refers to the ability of technology to support a task [197]. The capabilities of the technology should match the demands of the task and the skills of the individual; in this case, the fit is perfect.
Highlight tasks
A system may be usable for some tasks and less usable for others; it may be usable for some users but not for others.
Highlight tasks
Usability concerns how easily computer-based tools may be operated by users trying to accomplish a task. Usability differs from utility. Usability concerns whether users can use the product in a way that makes it possible to realize its utility; utility is about whether the goal is important to the user.
Highlight tasks
The utility of an interactive system concerns its match with the tasks of users. If the match is good, the tool has high utility; if the tasks that users want to do are not supported by the tool, the tool has low utility.
Highlight tasks
Users actively repurpose tools to make them more personally usable and relevant. Design should support such repurposing. For example, Renom et al. [696] conducted a study on text editing using a novel user interface. They found that exploration and technical reasoning facilitate creative tool use. Users who explore available commands in a tool are better at repurposing its functionality. More surprisingly, engaging in technical reasoning (reasoning about functionality and objects) supports repurposing more than procedural knowledge inherited from other software.
Highlight tasks
Tversky and Jamalian [833] proposed that embodied action is at the core of this. We move our bodies and toss, push, and pull objects. These movements can be thought about, imagined, and referred to in language. This, in turn, can change the substrate of thinking.
Highlight theories. a theory consists of a set of propositions, or statements
Davis [180] proposed that whether an individual ends up using a system, that is, their usage behavior, depends on their intention to use the system.
Highlight theories. a theory consists of a set of propositions, or statements
The theory of task–technology fit (TTF) can illuminate what users consider useful and how this affects their decision to adopt a particular technology. TTF refers to the ability of technology to support a task [197]. The capabilities of the technology should match the demands of the task and the skills of the individual; in this case, the fit is perfect. TTF theory posits that a rational user will choose the tool with the highest fit due to its efficacy and efficiency. Conversely, a system that does not offer a good fit will not be used.
Highlight theories. a theory consists of a set of propositions, or statements
TAM posits that the intention to adopt a particular technology is driven by two kinds of perceptions: (1) how easy it is to use a system and (2) how useful it will be to use it [180]. Furthermore, the perceived ease of use affects the perceived usefulness: If technology is hard to use, it is less useful.
Highlight theories. a theory consists of a set of propositions, or statements
Renom et al. [696] conducted a study on text editing using a novel user interface. They found that exploration and technical reasoning facilitate creative tool use.
What are examples of tasks that the reading gives?
Students who learned to do calculations with an abacus solve mathematical problems differently from others [796]. They rely more on mental imagery of the movement of beads on the abacus, which makes their mental calculations highly efficient for certain types of calculations.
What are examples of tasks that the reading gives?
For example, augmentative and alternative communication (AAC) is concerned with supporting non-speaking individuals with motor disabilities. AAC users rely on speech-generating devices (SGDs) to communicate with other people.
What are examples of tasks that the reading gives?
TTF has been used to assess users' willingness to use various technologies such as email or spreadsheets.
What are examples of tasks that the reading gives?
They provided an example of the usability of software installation. This was quantified through the time it takes to install software.
What are examples of tasks that the reading gives?
For example, a scrollbar is an interaction instrument, or tool, that operates on documents.
What are examples of tasks that the reading gives?
a user using a system to accomplish a task is not markedly different from a person using a hammer to drive nails or an algebraic rule to do calculations in one's head.
What are examples of tasks that the reading gives?
While a tool can enhance performance in cognitively challenging tasks, its extended use may erode the cognitive capability of the user.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
The tool itself may become 'transparent' and we start perceiving 'through it.'
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
Using a tool for extended periods can fundamentally change the way a user thinks and perceives both the tool and the world.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
accessibility concerns the match between a user's abilities and the system's required abilities.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
TTF theory posits that a rational user will choose the tool with the highest fit due to its efficacy and efficiency. Conversely, a system that does not offer a good fit will not be used.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
TAM posits that the intention to adopt a particular technology is driven by two kinds of perceptions: (1) how easy it is to use a system and (2) how useful it will be to use it. Furthermore, the perceived ease of use affects the perceived usefulness: If technology is hard to use, it is less useful.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
usability is multidimensional. This means that in most settings, a valid characterization of usability will need to employ several dimensions and measures.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
usability is measurable, that is, it is possible to quantify usability based on users' behaviors or opinions.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
usability is relational; it arises as an interplay between people, tasks (problems), and interactive systems (tools)
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
Usability concerns how easily computer-based tools may be operated by users trying to accomplish a task. Usability differs from utility. Usability concerns whether users can use the product in a way that makes it possible to realize its utility; utility is about whether the goal is important to the user.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
The utility of an interactive system concerns its match with the tasks of users. If the match is good, the tool has high utility; if the tasks that users want to do are not supported by the tool, the tool has low utility.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
Users actively repurpose tools to make them more personally usable and relevant.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
Utility centers what users want from technology.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
Usability is one of the best predictors of users' willingness to adopt software.
Highlight propositions. Propositions make a claim about the world. Propositions characterize entities and link them to other entities, some of which are conceptual.
Cognitive integration means that we internalize the operation of the tool. We not only act but also start thinking as defined by the unique constraints and mechanisms of the tool.
Highlight concepts
accessibility concerns the match between a user's abilities and the system's required abilities. As such, it differs from usability (which is about the relationship between users, tools, and tasks) and utility (which is about whether a tool may be used to complete a task).
Highlight concepts
TTF refers to the ability of technology to support a task. The capabilities of the technology should match the demands of the task and the skills of the individual; in this case, the fit is perfect.
Highlight concepts
TAM posits that the intention to adopt a particular technology is driven by two kinds of perceptions: (1) how easy it is to use a system and (2) how useful it will be to use it. Furthermore, the perceived ease of use affects the perceived usefulness: If technology is hard to use, it is less useful.
Highlight concepts
The second dimension, social acceptability, concerns whether interactions map well to the social norms and roles in the settings where they occur.
Highlight concepts
Acceptability has two main dimensions. The first dimension, practical acceptability, includes costs, the reliability of the interactive system, and its compatibility with other systems. The perceptions of utility and usability may also influence the judgment of practical acceptability.
Highlight concepts
usability is multidimensional. This means that in most settings, a valid characterization of usability will need to employ several dimensions and measures.
Highlight concepts
usability is measurable, that is, it is possible to quantify usability based on users' behaviors or opinions.
Highlight concepts
usability is relational; it arises as an interplay between people, tasks (problems), and interactive systems (tools)
Highlight concepts
The utility of an interactive system concerns its match with the tasks of users. If the match is good, the tool has high utility; if the tasks that users want to do are not supported by the tool, the tool has low utility.
Highlight concepts
Usability concerns how easily computer-based tools may be operated by users trying to accomplish a task. Usability differs from utility. Usability concerns whether users can use the product in a way that makes it possible to realize its utility; utility is about whether the goal is important to the user.
Highlight concepts
Research has drawn from linguistics, especially pragmatics, to understand how the way we talk with computers changes depending on the communication context.
theories
According to Suchman, robustness is a key consideration in the design of dialogue. Robustness refers to the communication partners' ability to achieve shared understanding even in light of misunderstandings and other unanticipated troubles.
theories
HCI researchers have developed a rich palette of theories to understand such dialogues. These theories explain what happens in dialogue and how it shapes the relationship between the partners. These theories also have implications for how we design interaction.
theories
Comparing mode-based interactions. A device is designed to allow users to control the relative humidity in their house. The device has two modes. In Automatic mode, the system keeps the relative humidity in the 50%–60% range. In the Manual mode, the user can set the desired level of relative humidity and the system will attempt to maintain it. The device is a small wall-mounted unit with the following UI elements. (a) The visual display indicates the current level of relative humidity and whether the system is in Automatic or Manual mode. (b) The "–" and "+" buttons enable the user to reduce or increase the desired level of relative humidity, respectively. (c) The "Automatic" button puts the system in Automatic mode. If the user pushes the "–" or "+" button, the system switches to Manual mode and remains in that mode until the user pushes the "Automatic" button. (a) Draw a state diagram for this system. (b) By viewing interaction with this system as goal-directed action, explain the steps comprising the gulf of evaluation and the gulf of execution for this UI. (c) State the type and level of automation of this system. (d) Is this system a mixed-initiative interface? Justify your answer.
the tasks from the paper
Mixed-initiative interfaces. Pick any AI-assisted feature that you are familiar with. Assess it against Horvitz's principles of mixed-initiative interfaces.
the tasks from the paper
Gulfs. Pick a graphical user interface, for example, something you use for education. Then, choose a task, for example, "sending a message to the teacher." Assess this task through the lens of Norman's two gulfs: the gulf of evaluation and the gulf of execution.
the tasks from the paper
Theories of human–computer dialogue. Consider the following potential dialogue interfaces: (a) a user interacting with an automated chat agent from an airline to resolve a delayed flight; (b) a child uploading homework using a web interface; and (c) a user who is trying to show a picture on their mobile phone on a nearby television screen. Make any necessary assumptions about the interfaces and discuss which model of dialogue would provide the most insight for each interface: (a) FSMs, (b) dialogue as goal-directed action, (c) dialogue as embodied action, or (d) dialogue from a communication perspective.
the tasks from the paper
Communication partners: Who are the actors in the dialogue? Communication goals: What is the final state the computer should be in for the user to consider the task completed? Communication act: What are the possible communication acts? In other words, what are the possible utterances or messages that can be delivered? Communication sequence: Draw a sequence of the communication turns leading to the goal, similar to Figure 18.1. Initiative: To which degree can each partner initiate communication on their own? Cue: Which cues are shown to help the user understand the state of the computer? Feedback: Which cues are shown to help the user understand the effects of their communication acts?
the tasks from the paper
Core concepts of dialogue interaction. Dialogue offers a rich conceptual framework for understanding interaction. First, choose an everyday interaction with which you are familiar. It can be anything from filling out a form to chatting with a chatbot. Then, choose a particular dialogue to focus on, for example, creating a user account or printing a document. Now, provide the following information for the dialogue:
the tasks from the paper
Generally, it is beneficial when mixed-initiative interfaces learn and adapt to individual users.
a statement with a condition, relating one or more concepts, with a consequence/result
Because users' goals and situations change over time, the system is never "ready."
a statement with a condition, relating one or more concepts, with a consequence/result
The feasible communication acts and their effects are conditioned by the state of the partner.
a statement with a condition, relating one or more concepts, with a consequence/result
The paradoxical effect of hyperarticulation is that despite trying to improve understanding, it can make speech recognition worse.
a statement with a condition, relating one or more concepts, with a consequence/result
When an automated action is taken, it is important to consider the timing, as incorrectly timed automated actions can distract the user.
a statement with a condition, relating one or more concepts, with a consequence/result
If there is ambiguity about what the user wants and wrong automation might harm the user, the system should ask for more information or not carry out the command.
a statement with a condition, relating one or more concepts, with a consequence/result
Since the system will be unlikely to always automate functions successfully, it is important that users can directly trigger and terminate functions.
a statement with a condition, relating one or more concepts, with a consequence/result
If the system is uncertain about the user's intent, the system should ask the user after having considered the cost of interrupting the user.
a statement with a condition, relating one or more concepts, with a consequence/result
If a system operates under a high uncertainty of the user's goals, the system should perform less automation to avoid interrupting the user with poor suggestions.
a statement with a condition, relating one or more concepts, with a consequence/result
When there is a misunderstanding about the context of the dialogue, errors may happen, and the partners must recover from them.
a statement with a condition, relating one or more concepts, with a consequence/result
If the supervising user wants to intervene, the gulf of evaluation becomes relevant.
a statement with a condition, relating one or more concepts, with a consequence/result
The mapping requires the user to figure out how to accomplish a goal with an interface. It implies that "The user must translate the psychological goals and intentions into the desired system state, then determine what settings of the control mechanisms will yield that state, and then determine what physical manipulations of the mechanism are required" [600, p. 37].
sentences that cite other researchers, exhaustive list
In direct manipulation interfaces (Chapter 28), the visual presentation of an object resembles its physical correspondent and can be directly acted on. For example, text in a text editor can be highlighted, deleted, or changed by point-and-click-style interactions [416].
sentences that cite other researchers, exhaustive list
the seven-stage model of interaction proposed by Norman [600] applies to all modalities of interaction
sentences that cite other researchers, exhaustive list
They also aid design and engineering by highlighting desirable properties of a dialogue system [5].
sentences that cite other researchers, exhaustive list
This AAC system was designed to use context to facilitate the creation of personal narratives [75].
sentences that cite other researchers, exhaustive list
The cognitive scientist Kirsh presented a criticism of Norman's view of dialogue and developed an alternative based on the theory of embodied cognition [416].
sentences that cite other researchers, exhaustive list
Horvitz [360] summarized the principles of mixed-initiative interfaces as follows:
sentences that cite other researchers, exhaustive list
A research group at the University of Washington [60] recruited 10 families and recorded their communications with Amazon Echo Dot (Alexa) for four weeks.
sentences that cite other researchers, exhaustive list
Communication repair refers to the "work of restoring shared understanding" when conversational partners misunderstand each other [60].
sentences that cite other researchers, exhaustive list
Section 18.3 outlines a view of dialogue developed by Suchman [804] that emphasizes the situated nature of dialogue.
sentences that cite other researchers, exhaustive list
A cornerstone of this research is the book Plans and Situated Action: The Problem of Human–Machine Communication by Suchman [804].
sentences that cite other researchers, exhaustive list
According to Scholtz [745], the two gulfs manifest differently in the different roles a user may have when interacting with a robot:
sentences that cite other researchers, exhaustive list
Norman's model stresses the need for users' acts to be understood by the computer and for users to understand the computer. Successful interfaces should also "provide a strong sense of understanding and control" [600, p. 49].
sentences that cite other researchers, exhaustive list
Affordance, which we discussed in Chapter 3, refers to how well users can interpret what actions are possible with a widget. Visibility is a handy related concept in design that underlies direct manipulation interfaces [416].
sentences that cite other researchers, exhaustive list