Hypothesis

5,047 Matching Annotations

Mar 2026
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Intro_to_HCI_18_dialogue.pdf

35
1. liur 30 Mar 2026
  
  in Public
  
  Gulf of execution: This gulf is about knowing what to do to bring about a desired state change in the computer.
  
  Highlight sentences that give a definition of a concept.
  
  ai-pending concept-definition
2. liur 30 Mar 2026
  
  in Public
  
  Code-switching refers to a switch in language to match the capabilities of the communication partner.
  
  Highlight sentences that give a definition of a concept.
  
  ai-pending concept-definition
3. liur 30 Mar 2026
  
  in Public
  
  An FSM is a model of discrete computation applicable to dialogues. In computer science, an FSM is a special case of a Turing machine that reads but does not write on the tape.
  
  Highlight sentences that give a definition of a concept.
  
  ai-pending concept-definition
4. liur 30 Mar 2026
  
  in Public
  
  A mode refers to the variation in the interpretation of a user's input according to an internal state. In a modeless dialogue, all inputs are possible in all states and their interpretation is always the same.
  
  Highlight sentences that give a definition of a concept.
  
  ai-pending concept-definition
5. liur 30 Mar 2026
  
  in Public
  
  The key idea in the dialogue view of interaction is the organization of communication as a series of turns. Dialogue evolves through communication turns between two or more partners. In one turn, an appropriate communication act is made by one partner based on the communication context. The act aims to get the other partner to do or understand something. This understanding then forms the context within which the other partner takes their turn.
  
  Highlight sentences that give a definition of a concept.
  
  ai-pending concept-definition
6. liur 30 Mar 2026
  
  in Public
  
  Interaction may be viewed as a dialogue, that is, a conversation that occurs between two partners in a context for some purpose.
  
  Highlight sentences that give a definition of a concept.
  
  ai-pending concept-definition
7. cwong10 30 Mar 2026
  
  in Public
  
  Dialogue is about the organization of communication as a series of turns between communication partners. The core elements of dialogue are communication turns, the communication context, and turn interpretation. Dialogue interaction includes speech-based and graphical interactions. Dialogue can be understood as computation, goal-directed action, communication, or embodied action.
  
  highlight the key concepts in this paper
  
  ai-pending
8. cwong10 30 Mar 2026
  
  in Public
  
  Kirsh argued that we are not just passively reacting to computer-generated options. If we look at interaction at a higher level, beyond a single action, we see that users are also actively influencing their environments. Users are 'architects' of their environments, as Kirsh put it.
  
  highlight the key concepts in this paper
  
  ai-pending
9. cwong10 30 Mar 2026
  
  in Public
  
  Kirsh points out that Norman's model makes an unrealistic assumption: The user is assumed to know the environment and its options and is merely picking an option. In practice, we do not always know what the options mean or even what options are available. Kirsh argued that users need to actively explore interfaces to become aware of the available functions and how they work.
  
  highlight the key concepts in this paper
  
  ai-pending
10. cwong10 30 Mar 2026
  
  in Public
  
  Mixed-initiative interaction is the idea of organizing interaction in dialogue where both the computer and the human can take initiative. Unlike in the case of an FSM, the computing system can take action without a command from the user; the initiative is mixed.
  
  highlight the key concepts in this paper
  
  ai-pending
11. cwong10 30 Mar 2026
  
  in Public
  
  Code-switching refers to a switch in language to match the capabilities of the communication partner.
  
  highlight the key concepts in this paper
  
  ai-pending
12. cwong10 30 Mar 2026
  
  in Public
  
  Communication repair refers to the 'work of restoring shared understanding' when conversational partners misunderstand each other.
  
  highlight the key concepts in this paper
  
  ai-pending
13. cwong10 30 Mar 2026
  
  in Public
  
  Robustness refers to the communication partners' ability to achieve shared understanding even in light of misunderstandings and other unanticipated troubles.
  
  highlight the key concepts in this paper
  
  ai-pending
14. cwong10 30 Mar 2026
  
  in Public
  
  Human–machine interaction, according to Suchman, is similar to but different from human–human dialogue. It is similar in the sense that people pursue a shared understanding: They actively work to make themselves understood. It is different in the sense that the communication abilities of computers are limited, which requires humans to adapt.
  
  highlight the key concepts in this paper
  
  ai-pending
15. cwong10 30 Mar 2026
  
  in Public
  
  FSMs are an effective way to describe how dialogue is structured; however, they are limited to memory-free dialogue.
  
  highlight the key concepts in this paper
  
  ai-pending
16. cwong10 30 Mar 2026
  
  in Public
  
  An FSM is a model of discrete computation applicable to dialogues.
  
  highlight the key concepts in this paper
  
  ai-pending
17. cwong10 30 Mar 2026
  
  in Public
  
  A mode refers to the variation in the interpretation of a user's input according to an internal state. In a modeless dialogue, all inputs are possible in all states and their interpretation is always the same.
  
  highlight the key concepts in this paper
  
  ai-pending
18. cwong10 30 Mar 2026
  
  in Public
  
  Dialogue can be described using models of computation from computer science. Such models include finite state machines (FSMs), pushdown automata, and Petri nets.
  
  highlight the key concepts in this paper
  
  ai-pending
19. cwong10 30 Mar 2026
  
  in Public
  
  Affordance, which we discussed in Chapter 3, refers to how well users can interpret what actions are possible with a widget. Visibility is a handy related concept in design that underlies direct manipulation interfaces.
  
  highlight the key concepts in this paper
  
  ai-pending
20. cwong10 30 Mar 2026
  
  in Public
  
  Mapping and feedback are crucial concepts for understanding a computer's turn.
  
  highlight the key concepts in this paper
  
  ai-pending
21. cwong10 30 Mar 2026
  
  in Public
  
  Gulf of evaluation: This gulf refers to knowing how a perceived change in the computer has moved it closer to the intended goal state.
  
  highlight the key concepts in this paper
  
  ai-pending
22. cwong10 30 Mar 2026
  
  in Public
  
  Gulf of execution: This gulf is about knowing what to do to bring about a desired state change in the computer.
  
  highlight the key concepts in this paper
  
  ai-pending
23. cwong10 30 Mar 2026
  
  in Public
  
  Norman offered two central concepts to help us understand these cognitive efforts: the gulf of execution and the gulf of evaluation. These two concepts describe inferential breakpoints for users seeking to express their intentions and interpret feedback from the system, respectively.
  
  highlight the key concepts in this paper
  
  ai-pending
24. cwong10 30 Mar 2026
  
  in Public
  
  A significant early theory of dialogue interaction is the seven-stage model of Norman [600]. It considers interaction as goal-directed, turn-based dialogue.
  
  highlight the key concepts in this paper
  
  ai-pending
25. cwong10 30 Mar 2026
  
  in Public
  
  both the computer and the user may have initiative. For example, a pop-up window can be presented to confirm a risky selection. When there is a misunderstanding about the context of the dialogue, errors may happen, and the partners must recover from them.
  
  highlight the key concepts in this paper
  
  ai-pending
26. cwong10 30 Mar 2026
  
  in Public
  
  both the computer and the human participate in establishing a shared context. The computer does not simply receive a message; it also communicates the effects of that message. Therefore, the design of feedback, affordances, and cues is central to dialogue-based interaction.
  
  highlight the key concepts in this paper
  
  ai-pending
27. cwong10 30 Mar 2026
  
  in Public
  
  in dialogue, acts of communication are conditional: The meaning of a turn depends on the communication context.
  
  highlight the key concepts in this paper
  
  ai-pending
28. cwong10 30 Mar 2026
  
  in Public
  
  the concepts of dialogue are applicable across modalities.
  
  highlight the key concepts in this paper
  
  ai-pending
29. cwong10 30 Mar 2026
  
  in Public
  
  The key idea in the dialogue view of interaction is the organization of communication as a series of turns. Dialogue evolves through communication turns between two or more partners. In one turn, an appropriate communication act is made by one partner based on the communication context.
  
  highlight the key concepts in this paper
  
  ai-pending
30. cferreira1 30 Mar 2026
  
  in Public
  
  Dialogue can be understood as computation, goal-directed action, communication, or embodied action. Each perspective provides specific methods for the analysis and design of dialogue.
  
  Highlight the sentences that capture the main point of this chapter
  
  ai-user-approved
31. cferreira1 30 Mar 2026
  
  in Public
  
  Dialogue interaction includes speech-based and graphical interactions.
  
  Highlight the sentences that capture the main point of this chapter
  
  ai-user-approved
32. cferreira1 30 Mar 2026
  
  in Public
  
  The core elements of dialogue are communication turns, the communication context, and turn interpretation.
  
  Highlight the sentences that capture the main point of this chapter
  
  ai-user-approved
33. cferreira1 30 Mar 2026
  
  in Public
  
  Dialogue is about the organization of communication as a series of turns between communication partners.
  
  Highlight the sentences that capture the main point of this chapter
  
  ai-user-approved
34. cferreira1 30 Mar 2026
  
  in Public
  
  The key idea in the dialogue view of interaction is the organization of communication as a series of turns. Dialogue evolves through communication turns between two or more partners. In one turn, an appropriate communication act is made by one partner based on the communication context. The act aims to get the other partner to do or understand something. This understanding then forms the context within which the other partner takes their turn.
  
  Highlight the sentences that capture the main point of this chapter
  
  ai-user-approved
35. cferreira1 30 Mar 2026
  
  in Public
  
  Interaction may be viewed as a dialogue, that is, a conversation that occurs between two partners in a context for some purpose.
  
  Highlight the sentences that capture the main point of this chapter
  
  ai-user-approved
Visit annotations in context

Tags

ai-pending

ai-user-approved

concept-definition

Annotators

liur

cferreira1

cwong10

URL

glassmanlab.seas.harvard.edu/annotated_works/Intro_to_HCI_18_dialogue.pdf
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Intro_to_HCI_19_Tools.pdf

13
1. vsuraj 30 Mar 2026
  
  in Public
  
  The theory of task–technology fit (TTF) can illuminate what users consider useful and how this affects their decision to adopt a particular technology. TTF refers to the ability of technology to support a task [197]. The capabilities of the technology should match the demands of the task and the skills of the individual; in this case, the fit is perfect.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
2. vsuraj 30 Mar 2026
  
  in Public
  
  Users actively repurpose tools to make them more personally usable and relevant. Design should support such repurposing. For example, Renom et al. [696] conducted a study on text editing using a novel user interface. They found that exploration and technical reasoning facilitate creative tool use.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
3. vsuraj 30 Mar 2026
  
  in Public
  
  One prominent definition of accessibility is given by ISO 9241-171, which defines it as 'the usability of a product, service, environment or facility by people with the widest range of capabilities.'
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
4. vsuraj 30 Mar 2026
  
  in Public
  
  The second dimension, social acceptability, concerns whether interactions map well to the social norms and roles in the settings where they occur.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
5. vsuraj 30 Mar 2026
  
  in Public
  
  Acceptability has two main dimensions [591]. The first dimension, practical acceptability, includes costs, the reliability of the interactive system, and its compatibility with other systems. The perceptions of utility and usability may also influence the judgment of practical acceptability.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
6. vsuraj 30 Mar 2026
  
  in Public
  
  usability is multidimensional. This means that in most settings, a valid characterization of usability will need to employ several dimensions and measures.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
7. vsuraj 30 Mar 2026
  
  in Public
  
  usability is measurable, that is, it is possible to quantify usability based on users' behaviors or opinions.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
8. vsuraj 30 Mar 2026
  
  in Public
  
  usability is relational; it arises as an interplay between people, tasks (problems), and interactive systems (tools)
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
9. vsuraj 30 Mar 2026
  
  in Public
  
  ISO 9241-11 definition... defines usability as the 'extent to which a system, product or service can be used by specified users to achieve specified goals with effectiveness, efficiency and satisfaction in a specified context of use.'
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
10. vsuraj 30 Mar 2026
  
  in Public
  
  One shorthand way of expressing this is that utility is 'whether the functionality of a system in principle can do what is needed' [591, p. 25]. In practice, whether people can do anything concerns—among other things—usability.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
11. vsuraj 30 Mar 2026
  
  in Public
  
  The utility of an interactive system concerns its match with the tasks of users. If the match is good, the tool has high utility; if the tasks that users want to do are not supported by the tool, the tool has low utility.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
12. vsuraj 30 Mar 2026
  
  in Public
  
  Usability concerns how easily computer-based tools may be operated by users trying to accomplish a task. Usability differs from utility. Usability concerns whether users can use the product in a way that makes it possible to realize its utility; utility is about whether the goal is important to the user. Ideally, the user can use the tool without unnecessary effort so that the use is direct, transparent, and unnoticeable.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
13. vsuraj 30 Mar 2026
  
  in Public
  
  Usability is one of the best predictors of users' willingness to adopt software. For example, the User Burden Scale is a questionnaire for measuring the felt burden in software use [806]. It consists of six subscales: difficulty of use, physical burden, time and social burden, mental and emotional burden, privacy burden, and financial burden.
  
  Highlight what you think good software concepts owuld be and segment them by color coded categories.
  
  ai-pending software-concepts
Visit annotations in context

Tags

ai-pending

software-concepts

Annotators

vsuraj

URL

glassmanlab.seas.harvard.edu/annotated_works/Intro_to_HCI_19_Tools.pdf
harvard-ml-courses.github.io harvard-ml-courses.github.io

sec06_soln.pdf

1
1. chrisli124 30 Mar 2026
  
  in Public
  
  However, self-attention alone is permutation-invariant, i.e., if we reorder the rows of X, then the mechanism has no built-in sense of which token came first. Since word order matters, we must inject positional information. We often add a position vector pt to the token embedding: h(0)_t = e(xt) + pt One classical choice for the positional encoding is called the sinusoidal positional encoding. pt[2k] = sin(t / 10000^{2k/d}), pt[2k+1] = cos(t / 10000^{2k/d}) The sinusoidal features give each position a distinct geometric signature across many frequencies. Nearby positions have related encodings while distant positions remain distinguishable. This lets the network reason about relative offsets.
  
  highlight where positional encoding is mentioned
  
  ai-pending
Visit annotations in context

Tags

ai-pending

Annotators

chrisli124

URL

harvard-ml-courses.github.io/cs181-web/static/sec06/sec06_soln.pdf
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Intro_to_HCI_16_interaction.pdf

40
1. elglassman 30 Mar 2026
  
  in Public
  
  For example, in control-theoretical analysis, we see the interaction as dynamically changing states.
  
  a sentence describing examples of a concept
  
  ai-pending concept examples
2. elglassman 30 Mar 2026
  
  in Public
  
  For example, in interaction-as-rationality, courses of action emerge as a joint function of the user's goals and capabilities and the properties of the environment.
  
  a sentence describing examples of a concept
  
  ai-pending concept examples
3. elglassman 30 Mar 2026
  
  in Public
  
  For example, statistical models, such as Fitts' law, describe a relationship that is considered a statistical determination.
  
  a sentence describing examples of a concept
  
  ai-pending concept examples
4. elglassman 30 Mar 2026
  
  in Public
  
  For example, interaction-as-tool-use focuses on this idea (Chapter 19). Tools change people and their activities; in turn, this changes the tools.
  
  a sentence describing examples of a concept
  
  ai-pending concept examples
5. elglassman 30 Mar 2026
  
  in Public
  
  An interactive AI system can possess an objective function it pursues when acting, for example, when correcting a character it flags as a typo.
  
  a sentence describing examples of a concept
  
  ai-pending concept examples
6. elglassman 30 Mar 2026
  
  in Public
  
  In the case of particle physics, the propositions that make up a theory may concern the nature and behavior of particles.
  
  a sentence describing examples of a concept
  
  ai-pending concept examples
7. elglassman 30 Mar 2026
  
  in Public
  
  HCI theories contain statements that link humans and technology and possibly some outcomes (e.g., poor usability, high user experience).
  
  a sentence describing examples of a concept
  
  ai-pending concept examples
8. elglassman 30 Mar 2026
  
  in Public
  
  Text entry can also be seen as a task where different subtasks are shared between the human and the computer (Chapter 20).
  
  a statement that describes a type of user task
  
  task description ai-user-approved concept: human-computer division of labor
9. elglassman 30 Mar 2026
  
  in Public
  
  One example is autocorrect, which automatically corrects typing errors while the user is typing. Another example is the use of word predictions, which allow the user to select a word from a set of word suggestions instead of typing out the word in full.
  
  sentence describing examples of a concept
  
  concept examples ai-user-approved task: text entry concept: human-computer division of labor
10. elglassman 30 Mar 2026
  
  in Public
  
  For example, text entry methods such as eye typing are designed to allow nonspeaking users with motor disabilities to enter text using their eye movements only.
  
  sentence describing examples of a concept
  
  concept examples ai-user-approved task: text entry concept: accessibility concept: tool use
11. elglassman 30 Mar 2026
  
  in Public
  
  Text entry is also a good example of tool use (Chapter 19). A text entry method is a tool that allows the user to communicate with someone or something, typically other people or a service, using asynchronous text messages and longer documents.
  
  a statement that describes a type of user task
  
  task description ai-user-approved concept: tool use
12. elglassman 30 Mar 2026
  
  in Public
  
  A fundamental goal of interaction is to provide users with tools that allow them to achieve goals that they would otherwise not have been able to achieve.
  
  a statement that describes a type of user task
  
  ai-pending task description
13. elglassman 30 Mar 2026
  
  in Public
  
  One goal of interaction is communicating intentions, for example, issuing commands to a computer, selecting graphical elements in image processing software, or entering text.
  
  a statement that describes a type of user task
  
  ai-pending task description
14. elglassman 30 Mar 2026
  
  in Public
  
  Accessibility is an extension of usability to ensure that as many people as possible find a tool easy to operate, regardless of individual capabilities.
  
  a sentence defining a concept
  
  ai-pending concept definition
15. elglassman 30 Mar 2026
  
  in Public
  
  Usability refers to the ease with which a tool is operated.
  
  a sentence defining a concept
  
  ai-pending concept definition
16. elglassman 30 Mar 2026
  
  in Public
  
  Utility captures how well a tool supports users in achieving their goals.
  
  a sentence defining a concept
  
  ai-pending concept definition
17. elglassman 30 Mar 2026
  
  in Public
  
  A model is a formally expressed set of propositions that follows some axiomatic system, such as algebra, logic, or a programming language.
  
  a sentence defining a concept
  
  ai-pending concept definition
18. elglassman 30 Mar 2026
  
  in Public
  
  The explanatory power of a theory refers to the empirical accuracy and coverage of the explanations offered by the theory.
  
  a sentence defining a concept
  
  ai-pending concept definition
19. elglassman 30 Mar 2026
  
  in Public
  
  A latent factor is something that affects observations about interaction without being directly observable.
  
  a sentence defining a concept
  
  ai-pending concept definition
20. elglassman 30 Mar 2026
  
  in Public
  
  Finally, in quantitative determination, interaction is described as a continuous unfolding of states.
  
  a sentence defining a concept
  
  ai-pending concept definition
21. elglassman 30 Mar 2026
  
  in Public
  
  In structural determination, the end results of the interaction are jointly determined by multiple causes that make up the whole.
  
  a sentence defining a concept
  
  ai-pending concept definition
22. elglassman 30 Mar 2026
  
  in Public
  
  In statistical determination, there is a stochastic relationship between the two entities.
  
  a sentence defining a concept
  
  ai-pending concept definition
23. elglassman 30 Mar 2026
  
  in Public
  
  In mechanical determination, an antecedent determines a consequent.
  
  a sentence defining a concept
  
  ai-pending concept definition
24. elglassman 30 Mar 2026
  
  in Public
  
  Propositions characterize entities and link them to other entities, some of which are conceptual.
  
  a sentence defining a concept
  
  ai-pending concept definition
25. elglassman 30 Mar 2026
  
  in Public
  
  In teleological determination, goals or purposes determine interaction in some way.
  
  a sentence defining a concept
  
  concept definition ai-user-approved concept: teleological determination
26. elglassman 30 Mar 2026
  
  in Public
  
  What happens in interaction is mutually determined by the human and the computer. In other words, what happens in interaction cannot be attributed solely to the human or the computer—the two must be considered together.
  
  a sentence describing the concept of interaction
  
  concept: interaction ai-user-approved
27. elglassman 30 Mar 2026
  
  in Public
  
  "Average movement time can be predicted as linear regression to the index of difficulty."
  
  a statement that is a claim about the world as described by a particular theory
  
  proposition example ai-user-approved
28. elglassman 30 Mar 2026
  
  in Public
  
  "The difficulty of selecting a target is proportional to its distance and inversely proportional to its width (index of difficulty)."
  
  a statement that is a claim about the world as described by a particular theory
  
  proposition example ai-user-approved
29. elglassman 30 Mar 2026
  
  in Public
  
  "If the user tries to increase speed, accuracy will be compromised, and vice versa: An increase in accuracy reduces speed."
  
  a statement that is a claim about the world as described by a particular theory
  
  proposition example ai-user-approved
30. elglassman 30 Mar 2026
  
  in Public
  
  For example, they can talk about information, difficulty, working memory, and so on.
  
  sentence describing examples of a concept
  
  concept examples ai-user-approved HCI theory
31. elglassman 30 Mar 2026
  
  in Public
  
  A proposition is a claim about the world.
  
  a sentence defining a concept
  
  concept definition ai-user-approved
32. elglassman 30 Mar 2026
  
  in Public
  
  Phenomena in interaction are emergent, that is, they are not attributable to the user or to the computer alone.
  
  a sentence describing the concept of interaction
  
  ai-pending concept: interaction
33. elglassman 30 Mar 2026
  
  in Public
  
  Interaction also occurs in different contexts, including work, leisure, and in-between contexts such as commuting.
  
  sentence describing examples of a concept
  
  concept examples ai-user-approved
34. elglassman 30 Mar 2026
  
  in Public
  
  Interaction is a dynamic phenomenon that unfolds over time as users and computers influence each other.
  
  sentence describing the concept of interaction
  
  concept: interaction ai-user-approved
35. elglassman 30 Mar 2026
  
  in Public
  
  What happens in interaction cannot be attributed solely to the human or the computer—the two must be considered together.
  
  individual sentence describing the concept of interaction
  
  ai-pending concept: interaction
36. elglassman 30 Mar 2026
  
  in Public
  
  It has been used to describe individuals, groups, and communities using computers.
  
  sentence describing examples of a concept
  
  concept examples ai-user-approved
37. elglassman 30 Mar 2026
  
  in Public
  
  Interaction is a core notion in HCI and refers to the mutual influence between people and computers.
  
  sentence describing the concept of interaction
  
  concept: interaction ai-user-approved
38. elglassman 30 Mar 2026
  
  in Public
  
  Pressing a button takes about a hundred milliseconds; adopting an information system in a large organization can easily take months.
  
  sentence describing examples of a concept
  
  concept examples ai-user-approved
39. elglassman 30 Mar 2026
  
  in Public
  
  We have used it to discuss various applications, from a user typing on a smartphone to a team of information workers communicating via email.
  
  sentence describing examples of a concept
  
  concept examples ai-user-approved
40. elglassman 30 Mar 2026
  
  in Public
  
  For example, a photocopier can automatically sort and collate copies.
  
  sentence describing examples of a concept
  
  ai-pending concept examples
Visit annotations in context

Tags

concept: accessibility

concept examples

ai-user-approved

HCI theory

concept: interaction

concept: teleological determination

concept: tool use

task: text entry

concept definition

concept: human-computer division of labor

ai-pending

proposition example

task description

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/annotated_works/Intro_to_HCI_16_interaction.pdf
ericbaumer.com ericbaumer.com

Untitled document

19
1. mjasim 30 Mar 2026
  
  in Public
  
  Such points about the origins of data and the processes of their collection are a key factor in civic text visualization. Indeed, a shift to emphasizing paradata can help draw attention to the representativeness of data.
  
  Show alternative approaches to text visualization beyond analytics
  
  alternatives ai-user-approved
2. mjasim 30 Mar 2026
  
  in Public
  
  On the other side of this spectrum, at the detail level, articulating nuanced information present in raw text data can enable civic leaders to peruse and sublimate critical insights.
  
  Show alternative approaches to text visualization beyond analytics
  
  alternatives ai-user-approved
3. mjasim 30 Mar 2026
  
  in Public
  
  In contrast, we could consider designing explicitly for multiple users. Doing so requires more than designing for different levels of expertise (see the following subsection for more on expertise) or designing for collaborative use, though both those things may be valuable in their own right. Rather, this dimension encourages accounting for the different types of relationalities that users may have with a system [cf. BB17].
  
  Show alternative approaches to text visualization beyond analytics
  
  alternatives ai-user-approved
4. mjasim 30 Mar 2026
  
  in Public
  
  Civic text visualizations similarly designed to foreground interpretation could help make clearer who is making these interpretive decisions, thereby highlighting the lack of neutrality and objectivity in data [DK20].
  
  Show alternative approaches to text visualization beyond analytics
  
  alternatives ai-user-approved
5. mjasim 30 Mar 2026
  
  in Public
  
  work on visualization evaluation [SP06; IZCC08; LBI*12] has emphasized the importance of close attention to the various contexts in which a visualization will be applied.
  
  Show alternative approaches to text visualization beyond analytics
  
  alternatives ai-user-approved
6. mjasim 30 Mar 2026
  
  in Public
  
  It is informative to contrast this analytic emphasis with other evolving discourses in information visualization. The prior work reviewed above illustrates a few alternative orientations, including rhetoric [HD11], feminism [DK16; DK20], ethics [Cor19], and others [DFCC13; VW08].
  
  Show alternative approaches to text visualization beyond analytics
  
  alternatives ai-user-approved
7. mjasim 30 Mar 2026
  
  in Public
  
  work in the digital humanities often explicitly emphasizes the interpretation both of texts themselves and of computational analysis thereof [Ram03; Joc13; Und14; BSM*20].
  
  Show alternative approaches to text visualization beyond analytics
  
  ai-pending alternatives
8. mjasim 30 Mar 2026
  
  in Public
  
  For instance, CommunityPulse provides a scaffolding for multifaceted public input analysis using visualizations [JHSM21], and MultiConVis enables multilevel exploration and analysis of threaded conversations [HC16b].
  
  Find civic text visualization systems that are explicitly named.
  
  text-visualization ai-user-approved
9. mjasim 30 Mar 2026
  
  in Public
  
  For example, CommunityPulse [JHSM21] uses common, simple visualizations and iconography, such as bar charts and emojis, to provide overviews of people's emotions towards civic agendas and ideas. Similarly, ConsiderIt [KMF*12b] uses bar charts to visualize people's stance towards ballot measures.
  
  Find civic text visualization systems that are explicitly named.
  
  text-visualization ai-user-approved
10. mjasim 30 Mar 2026
  
  in Public
  
  For instance, visual analytic systems such as MultiConVis [HC16b] use multiple connected views to enable analysts to filter and explore text data at multiple levels.
  
  Find civic text visualization systems that are explicitly named.
  
  text-visualization ai-user-approved
11. mjasim 30 Mar 2026
  
  in Public
  
  Tools such as ConsiderIt [KMF*12b] or Opinion Space [FBRG10] are designed specifically for the public. In contrast, tools such as CommunityPulse [JHSM21] or CommunityClick [JKW*21] are focused more on supporting community leaders and decision makers.
  
  Find civic text visualization systems that are explicitly named.
  
  text-visualization ai-user-approved
12. mjasim 30 Mar 2026
  
  in Public
  
  For example, MultiConVis [HC16b] makes prescriptive statements not only as to the sentimental valence of individual conversations but also as to the topics that each conversation is about. Similarly, ConsiderIt [KMF*12b] asks participants to place individual statements as either supporting or opposing a given ballot proposition.
  
  Find civic text visualization systems that are explicitly named.
  
  text-visualization ai-user-approved
13. mjasim 30 Mar 2026
  
  in Public
  
  Consider how systems such as MutiConVis [HC16b] and CommunityClick [JKW*21] provide visual representations to help the viewer understand the structure and content of conversations.
  
  Find civic text visualization systems that are explicitly named.
  
  text-visualization ai-user-approved
14. mjasim 30 Mar 2026
  
  in Public
  
  tools such as ConsiderIt [KMF*12b] and CommunityPulse [JHSM21] prominently feature specific comments from members of the public (i.e., the data).
  
  Find civic text visualization systems that are explicitly named.
  
  text-visualization ai-user-approved
15. mjasim 30 Mar 2026
  
  in Public
  
  Some tools provide both computational and visualization features. For instance, CommunityPulse provides a scaffolding for multifaceted public input analysis using visualizations [JHSM21], and MultiConVis enables multilevel exploration and analysis of threaded conversations [HC16b].
  
  Highlight all civic participation approaches
  
  ai-pending civic-participation
16. mjasim 30 Mar 2026
  
  in Public
  
  Researchers in HCI and digital civics have begun to explore methods to improve the analysis capabilities of visual analytics tools [JHSM21; MJS20b]. Although the broader community of visualization researchers acknowledges the importance of designing for varied levels of expertise [Mun14; GTS10; SNHS13], existing work on text analytics in general, as well as civic text visualizations in particular, focuses research efforts towards designing for analysts. Less effort has been put on designing and developing text visualization for non-experts—people who are not trained in or have had limited exposure to visualization and analytics.
  
  Highlight all civic participation approaches
  
  ai-pending civic-participation
17. mjasim 30 Mar 2026
  
  in Public
  
  Improving the public input process has become an important goal in the field of digital civics [MNC*19; VCL*16; OW15]. To that end, researchers and practitioners have developed a variety of systems for, e.g., sharing public opinions [FBRG10], building consensus [KMF*12a; ZNB15], summarizing public input [19], or identifying people's priorities, reflections, and hidden insights [JHSM21].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
18. mjasim 30 Mar 2026
  
  in Public
  
  Previous work has introduced several online engagement platforms to enable the public to asynchronously provide their comments, ideas, and feedback around civic issues [19; 20b; MJN*18]. These engagement tools have used micro-tasks [MJN*18], visualizations [19], and forum-like discussions [20b] to engage disconnected and disenfranchised populations [MNC*19]. Others have proposed technologies to promote in-person engagement of reticent participants during town halls [JKW*21] and public meetings [LLS] using clicker-like devices.
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
19. mjasim 30 Mar 2026
  
  in Public
  
  Despite their central importance in the civic engagement process, members of the general public are not necessarily involved in the analysis process. Hence, they are often left out of the loop when designing civic text visualizations—their requirements, aptitudes, knowledge, etc. are not given central consideration. Integrating participatory approaches in civic text visualization could pave the way not only for more inclusive analysis but also for leveraging the general public's knowledge to gather richer insights.
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
Visit annotations in context

Tags

ai-user-approved

civic-participation

alternatives

text-visualization

ai-pending

Annotators

mjasim

URL

ericbaumer.com/wp-content/uploads/sites/2/2022/04/TextVisSurvey_EuroVis2022-20220413-CameraReady.pdf
bit.csc.lsu.edu bit.csc.lsu.edu

Untitled document

10
1. mjasim 30 Mar 2026
  
  in Public
  
  social dynamics, such as shyness and tendency to avoid confrontation with dominant personalities can also hinder opinion sharing in town halls by favoring privileged individuals who are comfortable or trained to take part in contentious public discussions [27, 127].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
2. mjasim 30 Mar 2026
  
  in Public
  
  town halls inadvertently cater to a small number of privileged individuals, and silent participants often become disengaged despite physically attending the meetings [61]. Due to the lack of inclusivity, the outcome of such meetings often tends to feel unjust and opaque for the general public [39, 54].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
3. mjasim 30 Mar 2026
  
  in Public
  
  designing communitysourcing technologies to include marginalized opinions and amplify participation alone may not be enough to solve inequality of sharing opinions in the civic domain [26, 126]. Despite the success of previous works [25, 53, 90], technology is rarely integrated with existing manual practices and follow-ups of engagements between government officials and community members are seldom propagated to the community.
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
4. mjasim 30 Mar 2026
  
  in Public
  
  Marginalization can be broadly defined as the exclusion of a population from mainstream social, economic, cultural, or political life [58], which still stands as a barrier to inclusive participation in the civic domain [48, 94]. Researchers in HCI and CSCW have explored various communitysourcing approaches to include marginalized populations in community activities, proceedings, and designs [48, 53, 81, 93, 132].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
5. mjasim 30 Mar 2026
  
  in Public
  
  To increase broader civic participation, researchers in HCI have proposed both online [4, 5, 7, 81, 93] and face-to-face [21, 80, 91, 125] technological interventions that use the communitysourcing approach.
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
6. mjasim 30 Mar 2026
  
  in Public
  
  Prior investigations by Bryan [29] and Gastil [56] showed a steady decline in civic participation in town halls due to the growing disconnect between local government and community members and the decline in social capital [43, 111, 113]. Despite the introduction of online methods to increase public engagement in the last decade [4, 5, 7, 37, 81, 93], government officials continue to prefer face-to-face meetings to engage the community in the decision-making process [32, 52, 94].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
7. mjasim 30 Mar 2026
  
  in Public
  
  To reengage disconnected, reticent, or disenfranchised community members, researchers in HCI and digital civics have offered novel strategies and technological interventions to increase engagement [60, 62, 94, 107, 130].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
8. mjasim 30 Mar 2026
  
  in Public
  
  Bryan [29] and Gastil [56] investigated the state of town halls and demonstrated a steady decline in civic participation due to the growing disconnect between local government and the community.
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
9. mjasim 30 Mar 2026
  
  in Public
  
  Traditional community consultation methods, such as town halls, public forums, and workshops are the modus operandi for public engagement [52, 94]. For fair and impartial civic decision-making, the inclusivity of community members' feedback is paramount [60, 94, 126]. However, traditional methods rarely provide opportunities for inclusive public participation [30, 87, 95].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
10. mjasim 30 Mar 2026
  
  in Public
  
  Murphy used such systems to promote democracy and community partnerships [103]. Similarly, Boulianne et al. deployed clicker devices in contentious public discussions about climate change to gauge public opinions [25]. Bergstrom et al. used a single button device where the attendees anonymously voted (agree/disagree) on issues during the meeting. They showed that back-channel voting helped underrepresented users get more involved in the meeting [22].
  
  Highlight all civic participation approaches
  
  civic-participation ai-user-approved
Visit annotations in context

Tags

ai-user-approved

civic-participation

Annotators

mjasim

URL

bit.csc.lsu.edu/~mjasim/publications/Publication-mjasim2020cclickfull.pdf
ianarawjo.com ianarawjo.com

Fair Statistical Communication in HCI

10
1. ianarawjo 30 Mar 2026
  
  in Public
  
  As evidenced by numerous studies on statistical cognition (Kline, 2004; Beyth-Marom et al, 2008), even trained scientists have a hard time interpreting p-values, which frequently leads to misleading or incorrect conclusions.
  
  p-value is misinterpreted and confusing
  
  p-value is misleading ai-user-approved
2. ianarawjo 30 Mar 2026
  
  in Public
  
  few researchers can resist the temptation to conclude that there is no effect, a common fallacy called "accepting the null" which had frequently led to misleading or wrong scientific conclusions (Dienes, 2014, p.1).
  
  p-value is misinterpreted and confusing
  
  p-value is misleading ai-user-approved
3. ianarawjo 30 Mar 2026
  
  in Public
  
  Again, p is the probability of seeing results as extreme (or more extreme) as those actually observed if the null hypothesis were true. So p is computed under the assumption that the null hypothesis is true. Yet it is common for researchers, teachers and even textbooks to think of p as the probability of the null hypothesis being true (or equivalently, of the results being due to chance), an error called the "fallacy of the transposed conditional" (Haller and Krauss, 2002; Cohen, 1994, p.999).
  
  p-value is misinterpreted and confusing
  
  p-value is misleading ai-user-approved
4. ianarawjo 30 Mar 2026
  
  in Public
  
  Many researchers fail to appreciate that p-values are unreliable and vary widely across replications.
  
  p-value is misinterpreted and confusing
  
  p-value is misleading ai-user-approved
5. ianarawjo 30 Mar 2026
  
  in Public
  
  Providing non-misleading interpretations of figures with confidence intervals requires judgment, and no mechanical decision procedure can carry out this job better than a thoughtful investigator.
  
  Estimation is necessary but not sufficient
  
  Estimation is necessary but not sufficient ai-user-approved
6. ianarawjo 30 Mar 2026
  
  in Public
  
  NHST as it is carried out today consists of this incoherent mix of Fisher and Neyman–Pearson methods (Gigerenzer, 2004).
  
  p-value is misinterpreted and confusing
  
  ai-pending p-value is misleading
7. ianarawjo 30 Mar 2026
  
  in Public
  
  p-values give a seductive illusion of certainty and truth (Cumming, 2012, Chap. 1). The sacred α = .05 criterion amplifies this illusion, since results end up being either "significant" or "non-significant".
  
  p-value is misinterpreted and confusing
  
  ai-pending p-value is misleading
8. ianarawjo 30 Mar 2026
  
  in Public
  
  Estimation seems much more likely to promote clear statistical thinking.
  
  Need to change our way of thinking
  
  change thinking ai-user-approved
9. ianarawjo 30 Mar 2026
  
  in Public
  
  Decades spent educating researchers have had little or no influence on beliefs and practice (Schmidt and Hunter, 1997, pp.20–22).
  
  Calls for reform fall on deaf ears
  
  reform-failure ai-user-approved
10. ianarawjo 30 Mar 2026
  
  in Public
  
  NHST has been severely criticized for more than 50 years by end users to whom fair statistical communication matters.
  
  Calls for reform fall on deaf ears
  
  reform-failure ai-user-approved
Visit annotations in context

Tags

ai-user-approved

p-value is misleading

ai-pending

reform-failure

change thinking

Estimation is necessary but not sufficient

Annotators

ianarawjo

URL

ianarawjo.com/annotation-test/fairstats-last.pdf
ianarawjo.com ianarawjo.com

Why Hypothesis Tests Are Essential for Psychological Science

2
1. ianarawjo 30 Mar 2026
  
  in Public
  
  This assessment raises two issues. First, it is arbitrary. If 10 of the 15 CIs included the predicted values, would the results also support the theory, or instead refute it? If one instead used 99% CIs, would positive results for 12 of the 15 predictions be enough to support the theory? This arbitrariness arises because CIs offer no principled method for generating an inference regarding the theory.
  
  Estimation is too messy / complex and not clear enough
  
  Messiness of estimation as a solution ai-user-approved
2. ianarawjo 30 Mar 2026
  
  in Public
  
  two out of three necessary conditions for testing theory are missing.
  
  Estimation is too messy / complex and not clear enough
  
  Messiness of estimation as a solution ai-user-approved
Visit annotations in context

Tags

Messiness of estimation as a solution

ai-user-approved

Annotators

ianarawjo

URL

ianarawjo.com/annotation-test/morey-et-al-2014-why-hypothesis-tests-are-essential-for-psychological-science.pdf
ianarawjo.com ianarawjo.com

Rethinking statistical analysis methods for CHI

4
1. ianarawjo 30 Mar 2026
  
  in Public
  
  To illustrate this point Oakes posed a series of true/false questions regarding the interpretation of p-vales to seventy experienced researchers and discovered that only two had a sound understanding of the underlying concept of significance [25].
  
  Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
  
  ai-user-approved
2. ianarawjo 30 Mar 2026
  
  in Public
  
  failure to check assumptions about the data required by particular tests, over-testing and using inappropriate tests
  
  Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
  
  ai-user-approved
3. ianarawjo 30 Mar 2026
  
  in Public
  
  abusing statistical tests, making illogical arguments as a result of tests, deriving inappropriate conclusions from nonsignificant results, and confusing the size of p-values with effect sizes.
  
  Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
  
  ai-user-approved
4. ianarawjo 30 Mar 2026
  
  in Public
  
  This approach, fiercely promoted by Fisher in the 1930's [9], has become the gold standard in many disciplines including quantitative evaluations in HCI. However, the approach is rather counter-intuitive; many researchers misinterpret the meaning of the p-value.
  
  Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
  
  ai-user-approved
Visit annotations in context

Tags

ai-user-approved

Annotators

ianarawjo

URL

ianarawjo.com/annotation-test/2207676.2208557.pdf
arxiv.org arxiv.org

Representation Learning with Contrastive Predictive Coding

14
1. maxhenry 29 Mar 2026
  
  in Public
  
  We found that using MINE directly gave identical performance when the task was nontrivial, but became very unstable if the target was easy to predict from the context (e.g., when predicting a single step in the future and the target overlaps with the context).
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
2. maxhenry 29 Mar 2026
  
  in Public
  
  We note that better [49, 27] results have been published on these target datasets, by transfer learning from a different source task.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
3. maxhenry 29 Mar 2026
  
  in Public
  
  We also found that not all the information encoded is linearly accessible. When we used a single hidden layer instead the accuracy increases from 64.6 to 72.5, which is closer to the accuracy of the fully supervised model.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
4. maxhenry 29 Mar 2026
  
  in Public
  
  For lasertag_three_opponents_small, contrastive loss does not help nor hurt. We suspect that this is due to the task design, which does not require memory and thus yields a purely reactive policy.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
5. maxhenry 29 Mar 2026
  
  in Public
  
  Although this is a standard transfer learning benchmark, we found that models that learn better relationships in the childeren books did not necessarily perform better on the target tasks (which are very different: movie reviews etc).
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
6. maxhenry 29 Mar 2026
  
  in Public
  
  We found that more advanced sentence encoders did not significantly improve the results, which may be due to the simplicity of the transfer tasks (e.g., in MPQA most datapoints consists of one or a few words), and the fact that bag-of-words models usually perform well on many NLP tasks [48].
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
7. maxhenry 29 Mar 2026
  
  in Public
  
  It is important to note that the window size (maximum context size for the GRU) has a big impact on the performance, and longer segments would give better results. Our model had a maximum of 20480 timesteps to process, which is slightly longer than a second.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
8. maxhenry 29 Mar 2026
  
  in Public
  
  Interestingly, CPCs capture both speaker identity and speech contents, as demonstrated by the good accuracies attained with a simple linear classifier, which also gets close to the oracle, fully supervised networks.
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
9. maxhenry 29 Mar 2026
  
  in Public
  
  Figure 6 shows that for 4 out of the 5 games performance of the agent improves significantly with the contrastive loss after training on 1 billion frames.
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
10. maxhenry 29 Mar 2026
  
  in Public
  
  CPC 76.9 80.1 91.2 87.7 96.8
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
11. maxhenry 29 Mar 2026
  
  in Public
  
  CPC 73.6
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
12. maxhenry 29 Mar 2026
  
  in Public
  
  CPC 48.7
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
13. maxhenry 29 Mar 2026
  
  in Public
  
  Despite being relatively domain agnostic, CPCs improve upon state-of-the-art by 9% absolute in top-1 accuracy, and 4% absolute in top-5 accuracy.
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
14. maxhenry 29 Mar 2026
  
  in Public
  
  We also found that not all the information encoded is linearly accessible. When we used a single hidden layer instead the accuracy increases from 64.6 to 72.5, which is closer to the accuracy of the fully supervised model.
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
Visit annotations in context

Tags

ai-user-approved

most successful version

caveats

Annotators

maxhenry

URL

arxiv.org/pdf/1807.03748
arxiv.org arxiv.org

2305.14975.pdf

23
1. maxhenry 29 Mar 2026
  
  in Public
  
  Are the following two answers to my question Q semantically equivalent?\n\nQ: ${THE_QUESTION}\nA1: ${GOLD_ANSWER}\nA2: ${PRED_ANSWER}\n\nPlease answer with a single word, either "Yes." or "No.", and explain your reasoning.
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
2. maxhenry 29 Mar 2026
  
  in Public
  
  Provide your best guess for the following question, and describe how likely it is that your guess is correct as one of the following expressions: ${EXPRESSION_LIST}. Give ONLY the guess and your confidence, no other words or explanation. For example:\n\nGuess: <most likely guess, as short as possible; not a complete sentence, just the guess!>\nConfidence: <description of confidence, without any extra commentary whatsoever; just a short phrase!>\n\nThe question is: ${THE_QUESTION}
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
3. maxhenry 29 Mar 2026
  
  in Public
  
  Provide your ${k} best guesses and the probability that each is correct (0.0 to 1.0) for the following question. Give ONLY the guesses and probabilities, no other words or explanation. For example:\n\nG1: <first most likely guess, as short as possible; not a complete sentence, just the guess!>\n\nP1: <the probability between 0.0 and 1.0 that G1 is correct, without any extra commentary whatsoever; just the probability!>
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
4. maxhenry 29 Mar 2026
  
  in Public
  
  Each linguistic likelihood expression is mapped to a probability using responses from a human survey on social media with 123 respondents (Fagen-Ulmschneider, 2023). Ling. 1S-opt. uses a held out set of calibration questions and answers to compute the average accuracy for each likelihood expression, using these 'optimized' values instead.
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
5. maxhenry 29 Mar 2026
  
  in Public
  
  Finally, our study is limited to short-form question-answering; future work should extend this analysis to longer-form generation settings.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
6. maxhenry 29 Mar 2026
  
  in Public
  
  While our work demonstrates a promising new approach to generating calibrated confidences through verbalization, there are limitations that could be addressed in future work. First, our experiments are focused on factual recall-oriented problems, and the extent to which our observations would hold for reasoning-heavy settings is an interesting open question.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
7. maxhenry 29 Mar 2026
  
  in Public
  
  the 1-stage and 2-stage verbalized numerical confidence prompts sometimes differ drastically in the calibration of their confidences. How can we reduce sensitivity of a model's calibration to the prompt?
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
8. maxhenry 29 Mar 2026
  
  in Public
  
  Provide your best guess and the probability that it is correct (0.0 to 1.0) for the following question. Give ONLY the guess and probability, no other words or explanation. For example:\n\nGuess: <most likely guess, as short as possible; not a complete sentence, just the guess!>\n Probability: <the probability between 0.0 and 1.0 that your guess is correct, without any extra commentary whatsoever; just the probability!>\n\nThe question is: ${THE_QUESTION}
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
9. maxhenry 29 Mar 2026
  
  in Public
  
  Provide your ${k} best guesses and the probability that each is correct (0.0 to 1.0) for the following question. Give ONLY the guesses and probabilities, no other words or explanation.
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
10. maxhenry 29 Mar 2026
  
  in Public
  
  Provide your best guess for the following question, and describe how likely it is that your guess is correct as one of the following expressions: ${EXPRESSION_LIST}. Give ONLY the guess and your confidence, no other words or explanation.
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
11. maxhenry 29 Mar 2026
  
  in Public
  
  To fit the temperature that is used to compute ECE-t and BS-t we split our total data into 5 folds. For each fold, we use it once to fit a temperature and evaluate metrics on the remaining folds. We find that fitting the temperature on 20% of the data yields relatively stable temperatures across folds.
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
12. maxhenry 29 Mar 2026
  
  in Public
  
  To avoid excessive false negatives in our correctness computation as a result of exact-match evaluation, we use either GPT-4 or GPT-3.5 to evaluate whether a response is essentially equivalent to the ground truth answer.
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
13. maxhenry 29 Mar 2026
  
  in Public
  
  We sample 1000 questions from the validation split of TriviaQA (rc.web.nocontext) and SciQ and all 817 questions from the validation split of TruthfulQA (generation) for our experiments.
  
  please find the barebones practical information i need to implement this system or strategy
  
  pragmatic implications ai-user-approved
14. maxhenry 29 Mar 2026
  
  in Public
  
  Ling. 1S-opt. 0.056 0.051 0.088 0.927 0.028 0.052 0.172 0.828 0.082 0.105 0.212 0.632
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
15. maxhenry 29 Mar 2026
  
  in Public
  
  Verb. 1S top-4 0.041 0.039 0.081 0.959 0.056 0.059 0.185 0.815 0.198 0.144 0.245 0.619
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
16. maxhenry 29 Mar 2026
  
  in Public
  
  Ling. 1S-opt. 0.058 0.066 0.135 0.878 0.064 0.068 0.220 0.674 0.125 0.165 0.270 0.492
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
17. maxhenry 29 Mar 2026
  
  in Public
  
  Verb. 1S top-4 0.054 0.057 0.144 0.896 0.065 0.051 0.209 0.763 0.203 0.189 0.284 0.455
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
18. maxhenry 29 Mar 2026
  
  in Public
  
  Additionally, the lack of technical details available for many state-of-the-art closed RLHF-LMs may limit our ability to understand what factors enable a model to verbalize well-calibrated confidences and differences in this ability across different models.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
19. maxhenry 29 Mar 2026
  
  in Public
  
  With Llama2-70B-Chat, verbalized calibration provides improvement over conditional probabilities across some metrics, but the improvement is much less consistent compared to GPT-* and Claude-*.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
20. maxhenry 29 Mar 2026
  
  in Public
  
  The verbal calibration of the open source model Llama-2-70b-chat is generally weaker than that of closed source models but still demonstrates improvement over its conditional probabilities by some metrics, and does so most clearly on TruthfulQA.
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
21. maxhenry 29 Mar 2026
  
  in Public
  
  Chain-of-thought prompting does not improve verbalized calibration
  
  all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
  
  caveats ai-user-approved
22. maxhenry 29 Mar 2026
  
  in Public
  
  Among the methods for verbalizing probabilities directly, we observe that generating and evaluating multiple hypotheses improves calibration (see Figure 1), similarly to humans (Lord et al., 1985), and corroborating a similar finding in LMs (Kadavath et al., 2022).
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
23. maxhenry 29 Mar 2026
  
  in Public
  
  Ling. 1S-opt. 0.060 0.070 0.151 0.874 0.049 0.056 0.214 0.738 0.099 0.130 0.266 0.446
  
  please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
  
  most successful version ai-user-approved
Visit annotations in context

Tags

ai-user-approved

most successful version

caveats

pragmatic implications

Annotators

maxhenry

URL

arxiv.org/pdf/2305.14975
sites.stat.columbia.edu sites.stat.columbia.edu

Untitled document

2
1. ianarawjo 29 Mar 2026
  
  in Public
  
  the solution is not to reform p-values or to replace them with some other statistical summary or threshold, but rather to move toward a greater acceptance of uncertainty and embracing of variation.
  
  Where it's mentioned how to address the problems with p-values
  
  ai-pending Solutions to p-value crisis
2. ianarawjo 29 Mar 2026
  
  in Public
  
  What, then, can and should be done? I agree with the ASA statement's final paragraph, which emphasizes the importance of design, understanding, and context—and I would also add measurement to that list.
  
  Where it's mentioned how to address the problems with p-values
  
  ai-pending Solutions to p-value crisis
Visit annotations in context

Tags

ai-pending

Solutions to p-value crisis

Annotators

ianarawjo

URL

sites.stat.columbia.edu/gelman/research/published/asa_pvalues.pdf
courses.media.mit.edu courses.media.mit.edu

Untitled document

7
1. ianarawjo 29 Mar 2026
  
  in Public
  
  the psychology research community has been strongly questioning the value of NHST in psychology for some years now [6] and calling for a more meaningful reporting of statistical inference based on effect sizes, confidence intervals and Bayesian reasoning [9].
  
  Mentioning the problems with p-values
  
  ai-pending p-values
2. ianarawjo 29 Mar 2026
  
  in Public
  
  Similarly, if the significance level is set at 0.05, then this is the probability of the data occurring by chance when there is no experimental effect, namely one in twenty times. The more tests that are done on a particular dataset, the more likely it is that some chance variation will be extreme enough to seem like significance.
  
  Mentioning the problems with p-values
  
  ai-pending p-values
3. ianarawjo 29 Mar 2026
  
  in Public
  
  Violation of the assumptions of any statistical test can produce p values that bear little relation to the actual probabilities of outcomes and hence comparison to the significance level of 0.05 is meaningless.
  
  Mentioning the problems with p-values
  
  ai-pending p-values
4. ianarawjo 29 Mar 2026
  
  in Public
  
  for an analysis to be sound, it is necessary that in the tests performed the probabilities of outcomes are accurately reflected in the p values produced by the tests. If this is not the case, then the NHST argument form is severely weakened.
  
  Mentioning the problems with p-values
  
  ai-pending p-values
5. ianarawjo 29 Mar 2026
  
  in Public
  
  NHST is the most commonly encountered form of statistical inference and is what is usually associated with producing a null hypothesis, then testing it to give some statistic such as a t value, and then turning the statistic into a p value.
  
  Mentioning the problems with p-values
  
  ai-pending p-values
6. ianarawjo 29 Mar 2026
  
  in Public
  
  properly reported non-significant results can help future researchers to provide estimates of effect sizes and associated confidence intervals.
  
  Effect sizes are mentioned
  
  ai-pending Effect sizes
7. ianarawjo 29 Mar 2026
  
  in Public
  
  calling for a more meaningful reporting of statistical inference based on effect sizes, confidence intervals and Bayesian reasoning [9].
  
  Effect sizes are mentioned
  
  ai-pending Effect sizes
Visit annotations in context

Tags

p-values

ai-pending

Effect sizes

Annotators

ianarawjo

URL

courses.media.mit.edu/2010spring/mas964/Week6/Cairns_2007.pdf
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Supporting Co-Adaptive Machine Teaching through Human Concept Learning and Cognitive Theories

16
1. elglassman 29 Mar 2026
  
  in Public
  
  P8 said: "as I get familiar with this system [C3], I feel more skilled" to use the highlighted and grayed phrases.
  
  any sentence that describes a user's emotional (positive or negative) response to any condition in the experiments.
  
  emotional response ai-user-approved
2. elglassman 29 Mar 2026
  
  in Public
  
  It is worth noting that P3 and P8 mentioned feeling more comfortable with the more familiar visualization in C1 and C2 during their first impression of the conditions.
  
  any sentence that describes a user's emotional (positive or negative) response to any condition in the experiments.
  
  emotional response ai-user-approved
3. elglassman 29 Mar 2026
  
  in Public
  
  The inclusion of counterfactuals often resulted in a substantial increase in precision, indicating that the models were better able to correctly classify relevant instances while reducing false positives.
  
  statements that draw general conclusions about humans, computers, and/or human-computer interaction based on the results of the specific experiment done in the paper.
  
  ai-pending generalizable knowledge
4. elglassman 29 Mar 2026
  
  in Public
  
  By visualizing these consistent pattern rules, users may be better understanding the behavior of the model through inference projection.
  
  statements that draw general conclusions about humans, computers, and/or human-computer interaction based on the results of the specific experiment done in the paper.
  
  ai-pending generalizable knowledge
5. elglassman 29 Mar 2026
  
  in Public
  
  Mocha addresses two seemingly contradictory objectives: (1) generating labeled data that diversifies the training dataset to aid the model's learning, and (2) maintaining structural consistency across the batches of data presented to users to support their cognitive processes.
  
  statements that draw general conclusions about humans, computers, and/or human-computer interaction based on the results of the specific experiment done in the paper.
  
  ai-pending generalizable knowledge
6. elglassman 29 Mar 2026
  
  in Public
  
  The results of our study indicate that participants spent significantly less time annotating batches of counterfactuals when they were rendered according to SAT compared to other conditions i.e., supporting the participants' selective focus on the varying phrases, rather than phrases that stay consistent.
  
  statements that draw general conclusions about humans, computers, and/or human-computer interaction based on the results of the specific experiment done in the paper.
  
  ai-pending generalizable knowledge
7. elglassman 29 Mar 2026
  
  in Public
  
  From a cognitive perspective, the theme color aligns with the human's (theorized) structural mapping engine [27] by making relational discrepancies between the original and counterfactual examples more explicit.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
8. elglassman 29 Mar 2026
  
  in Public
  
  Estes and Hasson [17] highlights the significant role of bringing salience to 'non-alignable' differences.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
9. elglassman 29 Mar 2026
  
  in Public
  
  The last two prior works also combine Variation Theory (VT) and SAT together, as we did (i.e., a corollary of SAT referred to as Analogical Transfer/Learning Theory).
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
10. elglassman 29 Mar 2026
  
  in Public
  
  Estes and Hasson [17] highlights the significant role of bringing salience to "non-alignable" differences.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
11. elglassman 29 Mar 2026
  
  in Public
  
  Estes and Hasson [17] argue that while alignable differences can be more straightforward and easier for comparison, non-alignable differences can also provide key information that might otherwise remain overlooked.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
12. elglassman 29 Mar 2026
  
  in Public
  
  By incorporating theories such as Structural Alignment Theory and Variation Theory, it aims to support the learning of both the human and the model.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
13. elglassman 29 Mar 2026
  
  in Public
  
  This symbiotic relationship stems from the fact that Structural Alignment Theory (SAT) enhances the salience of differences, while the way we used Variation Theory (VT) to generate contradicting examples across the boundaries of labels ensures that these differences are conceptually informative.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
14. elglassman 29 Mar 2026
  
  in Public
  
  It states that understanding and sensemaking involve mapping the relationships between elements, especially in complex and ambiguous tasks.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
15. elglassman 29 Mar 2026
  
  in Public
  
  Structural Alignment Theory states that humans naturally look for structural mapping between representations of objects to help them understand, compare, and infer relationships between said objects.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
16. elglassman 29 Mar 2026
  
  in Public
  
  According to Variation Theory, learners better understand concepts by observing variations along critical features (dimensions of variation) that define that concept and, separately, observing variations along superficial features that do not define that concept—all while other features, when possible, are held constant.
  
  return any single sentence that describes an explicit or implicit connection to theory
  
  ai-pending theory
Visit annotations in context

Tags

ai-user-approved

generalizable knowledge

ai-pending

emotional response

theory

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/papers/mocha_chi25.pdf
glassmanlab.seas.harvard.edu glassmanlab.seas.harvard.edu

Untitled document

4
1. elglassman 29 Mar 2026
  
  in Public
  
  Mohamed et al. (2020) put forth the idea of dismantling power assymmetries to resist data colonialism.
  
  sentence that refers to a theory
  
  theory ai-user-approved
2. elglassman 29 Mar 2026
  
  in Public
  
  Couldry and Mejias (2019) propose 'data colonialism' as a new form of colonialism to make sense of the use of large amounts of data by a small group of corporate and government actors.
  
  sentence that refers to a theory
  
  theory ai-user-approved
3. elglassman 29 Mar 2026
  
  in Public
  
  Taken together, these findings almost unanimously show that, on average, AI-supported writing decreases but does not eliminate writer's feelings of ownership, underscoring the need for a larger theory of AI participation in the creative process.
  
  sentence that refers to a theory
  
  ai-pending theory
4. elglassman 29 Mar 2026
  
  in Public
  
  This can be understood through the frame of precarious work [5]; as writers feel that their work is increasingly precarious, the power differential between themselves and the organizations seeking to train LLMs grows larger.
  
  sentence that refers to a theory
  
  ai-pending theory
Visit annotations in context

Tags

ai-user-approved

ai-pending

theory

Annotators

elglassman

URL

glassmanlab.seas.harvard.edu/papers/gero_LIL_chi25.pdf

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL