Pressing a button takes about a hundred milliseconds; adopting an information system in a large organization can easily take months.
sentence describing examples of a concept
Pressing a button takes about a hundred milliseconds; adopting an information system in a large organization can easily take months.
sentence describing examples of a concept
We have used it to discuss various applications, from a user typing on a smartphone to a team of information workers communicating via email.
sentence describing examples of a concept
social dynamics, such as shyness and tendency to avoid confrontation with dominant personalities can also hinder opinion sharing in town halls by favoring privileged individuals who are comfortable or trained to take part in contentious public discussions [27, 127].
Highlight all civic participation approaches
town halls inadvertently cater to a small number of privileged individuals, and silent participants often become disengaged despite physically attending the meetings [61]. Due to the lack of inclusivity, the outcome of such meetings often tends to feel unjust and opaque for the general public [39, 54].
Highlight all civic participation approaches
designing communitysourcing technologies to include marginalized opinions and amplify participation alone may not be enough to solve inequality of sharing opinions in the civic domain [26, 126]. Despite the success of previous works [25, 53, 90], technology is rarely integrated with existing manual practices and follow-ups of engagements between government officials and community members are seldom propagated to the community.
Highlight all civic participation approaches
Marginalization can be broadly defined as the exclusion of a population from mainstream social, economic, cultural, or political life [58], which still stands as a barrier to inclusive participation in the civic domain [48, 94]. Researchers in HCI and CSCW have explored various communitysourcing approaches to include marginalized populations in community activities, proceedings, and designs [48, 53, 81, 93, 132].
Highlight all civic participation approaches
To increase broader civic participation, researchers in HCI have proposed both online [4, 5, 7, 81, 93] and face-to-face [21, 80, 91, 125] technological interventions that use the communitysourcing approach.
Highlight all civic participation approaches
Prior investigations by Bryan [29] and Gastil [56] showed a steady decline in civic participation in town halls due to the growing disconnect between local government and community members and the decline in social capital [43, 111, 113]. Despite the introduction of online methods to increase public engagement in the last decade [4, 5, 7, 37, 81, 93], government officials continue to prefer face-to-face meetings to engage the community in the decision-making process [32, 52, 94].
Highlight all civic participation approaches
To reengage disconnected, reticent, or disenfranchised community members, researchers in HCI and digital civics have offered novel strategies and technological interventions to increase engagement [60, 62, 94, 107, 130].
Highlight all civic participation approaches
Bryan [29] and Gastil [56] investigated the state of town halls and demonstrated a steady decline in civic participation due to the growing disconnect between local government and the community.
Highlight all civic participation approaches
Traditional community consultation methods, such as town halls, public forums, and workshops are the modus operandi for public engagement [52, 94]. For fair and impartial civic decision-making, the inclusivity of community members' feedback is paramount [60, 94, 126]. However, traditional methods rarely provide opportunities for inclusive public participation [30, 87, 95].
Highlight all civic participation approaches
Murphy used such systems to promote democracy and community partnerships [103]. Similarly, Boulianne et al. deployed clicker devices in contentious public discussions about climate change to gauge public opinions [25]. Bergstrom et al. used a single button device where the attendees anonymously voted (agree/disagree) on issues during the meeting. They showed that back-channel voting helped underrepresented users get more involved in the meeting [22].
Highlight all civic participation approaches
As evidenced by numerous studies on statistical cognition (Kline, 2004; Beyth-Marom et al, 2008), even trained scientists have a hard time interpreting p-values, which frequently leads to misleading or incorrect conclusions.
p-value is misinterpreted and confusing
few researchers can resist the temptation to conclude that there is no effect, a common fallacy called "accepting the null" which had frequently led to misleading or wrong scientific conclusions (Dienes, 2014, p.1).
p-value is misinterpreted and confusing
Again, p is the probability of seeing results as extreme (or more extreme) as those actually observed if the null hypothesis were true. So p is computed under the assumption that the null hypothesis is true. Yet it is common for researchers, teachers and even textbooks to think of p as the probability of the null hypothesis being true (or equivalently, of the results being due to chance), an error called the "fallacy of the transposed conditional" (Haller and Krauss, 2002; Cohen, 1994, p.999).
p-value is misinterpreted and confusing
Many researchers fail to appreciate that p-values are unreliable and vary widely across replications.
p-value is misinterpreted and confusing
Providing non-misleading interpretations of figures with confidence intervals requires judgment, and no mechanical decision procedure can carry out this job better than a thoughtful investigator.
Estimation is necessary but not sufficient
Estimation seems much more likely to promote clear statistical thinking.
Need to change our way of thinking
Decades spent educating researchers have had little or no influence on beliefs and practice (Schmidt and Hunter, 1997, pp.20–22).
Calls for reform fall on deaf ears
NHST has been severely criticized for more than 50 years by end users to whom fair statistical communication matters.
Calls for reform fall on deaf ears
This assessment raises two issues. First, it is arbitrary. If 10 of the 15 CIs included the predicted values, would the results also support the theory, or instead refute it? If one instead used 99% CIs, would positive results for 12 of the 15 predictions be enough to support the theory? This arbitrariness arises because CIs offer no principled method for generating an inference regarding the theory.
Estimation is too messy / complex and not clear enough
two out of three necessary conditions for testing theory are missing.
Estimation is too messy / complex and not clear enough
To illustrate this point Oakes posed a series of true/false questions regarding the interpretation of p-vales to seventy experienced researchers and discovered that only two had a sound understanding of the underlying concept of significance [25].
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
failure to check assumptions about the data required by particular tests, over-testing and using inappropriate tests
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
abusing statistical tests, making illogical arguments as a result of tests, deriving inappropriate conclusions from nonsignificant results, and confusing the size of p-values with effect sizes.
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
This approach, fiercely promoted by Fisher in the 1930's [9], has become the gold standard in many disciplines including quantitative evaluations in HCI. However, the approach is rather counter-intuitive; many researchers misinterpret the meaning of the p-value.
Sentences where they say people don't really know the statistics, they just apply tests without thought because it's tradition
We found that using MINE directly gave identical performance when the task was nontrivial, but became very unstable if the target was easy to predict from the context (e.g., when predicting a single step in the future and the target overlaps with the context).
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
We note that better [49, 27] results have been published on these target datasets, by transfer learning from a different source task.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
We also found that not all the information encoded is linearly accessible. When we used a single hidden layer instead the accuracy increases from 64.6 to 72.5, which is closer to the accuracy of the fully supervised model.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
For lasertag_three_opponents_small, contrastive loss does not help nor hurt. We suspect that this is due to the task design, which does not require memory and thus yields a purely reactive policy.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
Although this is a standard transfer learning benchmark, we found that models that learn better relationships in the childeren books did not necessarily perform better on the target tasks (which are very different: movie reviews etc).
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
We found that more advanced sentence encoders did not significantly improve the results, which may be due to the simplicity of the transfer tasks (e.g., in MPQA most datapoints consists of one or a few words), and the fact that bag-of-words models usually perform well on many NLP tasks [48].
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
It is important to note that the window size (maximum context size for the GRU) has a big impact on the performance, and longer segments would give better results. Our model had a maximum of 20480 timesteps to process, which is slightly longer than a second.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
Interestingly, CPCs capture both speaker identity and speech contents, as demonstrated by the good accuracies attained with a simple linear classifier, which also gets close to the oracle, fully supervised networks.
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Figure 6 shows that for 4 out of the 5 games performance of the agent improves significantly with the contrastive loss after training on 1 billion frames.
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
CPC 76.9 80.1 91.2 87.7 96.8
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
CPC 73.6
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
CPC 48.7
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Despite being relatively domain agnostic, CPCs improve upon state-of-the-art by 9% absolute in top-1 accuracy, and 4% absolute in top-5 accuracy.
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
We also found that not all the information encoded is linearly accessible. When we used a single hidden layer instead the accuracy increases from 64.6 to 72.5, which is closer to the accuracy of the fully supervised model.
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Are the following two answers to my question Q semantically equivalent?\n\nQ: ${THE_QUESTION}\nA1: ${GOLD_ANSWER}\nA2: ${PRED_ANSWER}\n\nPlease answer with a single word, either "Yes." or "No.", and explain your reasoning.
please find the barebones practical information i need to implement this system or strategy
Provide your best guess for the following question, and describe how likely it is that your guess is correct as one of the following expressions: ${EXPRESSION_LIST}. Give ONLY the guess and your confidence, no other words or explanation. For example:\n\nGuess: <most likely guess, as short as possible; not a complete sentence, just the guess!>\nConfidence: <description of confidence, without any extra commentary whatsoever; just a short phrase!>\n\nThe question is: ${THE_QUESTION}
please find the barebones practical information i need to implement this system or strategy
Provide your ${k} best guesses and the probability that each is correct (0.0 to 1.0) for the following question. Give ONLY the guesses and probabilities, no other words or explanation. For example:\n\nG1: <first most likely guess, as short as possible; not a complete sentence, just the guess!>\n\nP1: <the probability between 0.0 and 1.0 that G1 is correct, without any extra commentary whatsoever; just the probability!>
please find the barebones practical information i need to implement this system or strategy
Each linguistic likelihood expression is mapped to a probability using responses from a human survey on social media with 123 respondents (Fagen-Ulmschneider, 2023). Ling. 1S-opt. uses a held out set of calibration questions and answers to compute the average accuracy for each likelihood expression, using these 'optimized' values instead.
please find the barebones practical information i need to implement this system or strategy
Finally, our study is limited to short-form question-answering; future work should extend this analysis to longer-form generation settings.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
While our work demonstrates a promising new approach to generating calibrated confidences through verbalization, there are limitations that could be addressed in future work. First, our experiments are focused on factual recall-oriented problems, and the extent to which our observations would hold for reasoning-heavy settings is an interesting open question.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
the 1-stage and 2-stage verbalized numerical confidence prompts sometimes differ drastically in the calibration of their confidences. How can we reduce sensitivity of a model's calibration to the prompt?
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
Provide your best guess and the probability that it is correct (0.0 to 1.0) for the following question. Give ONLY the guess and probability, no other words or explanation. For example:\n\nGuess: <most likely guess, as short as possible; not a complete sentence, just the guess!>\n Probability: <the probability between 0.0 and 1.0 that your guess is correct, without any extra commentary whatsoever; just the probability!>\n\nThe question is: ${THE_QUESTION}
please find the barebones practical information i need to implement this system or strategy
Provide your ${k} best guesses and the probability that each is correct (0.0 to 1.0) for the following question. Give ONLY the guesses and probabilities, no other words or explanation.
please find the barebones practical information i need to implement this system or strategy
Provide your best guess for the following question, and describe how likely it is that your guess is correct as one of the following expressions: ${EXPRESSION_LIST}. Give ONLY the guess and your confidence, no other words or explanation.
please find the barebones practical information i need to implement this system or strategy
To fit the temperature that is used to compute ECE-t and BS-t we split our total data into 5 folds. For each fold, we use it once to fit a temperature and evaluate metrics on the remaining folds. We find that fitting the temperature on 20% of the data yields relatively stable temperatures across folds.
please find the barebones practical information i need to implement this system or strategy
To avoid excessive false negatives in our correctness computation as a result of exact-match evaluation, we use either GPT-4 or GPT-3.5 to evaluate whether a response is essentially equivalent to the ground truth answer.
please find the barebones practical information i need to implement this system or strategy
We sample 1000 questions from the validation split of TriviaQA (rc.web.nocontext) and SciQ and all 817 questions from the validation split of TruthfulQA (generation) for our experiments.
please find the barebones practical information i need to implement this system or strategy
Ling. 1S-opt. 0.056 0.051 0.088 0.927 0.028 0.052 0.172 0.828 0.082 0.105 0.212 0.632
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Verb. 1S top-4 0.041 0.039 0.081 0.959 0.056 0.059 0.185 0.815 0.198 0.144 0.245 0.619
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Ling. 1S-opt. 0.058 0.066 0.135 0.878 0.064 0.068 0.220 0.674 0.125 0.165 0.270 0.492
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Verb. 1S top-4 0.054 0.057 0.144 0.896 0.065 0.051 0.209 0.763 0.203 0.189 0.284 0.455
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Additionally, the lack of technical details available for many state-of-the-art closed RLHF-LMs may limit our ability to understand what factors enable a model to verbalize well-calibrated confidences and differences in this ability across different models.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
With Llama2-70B-Chat, verbalized calibration provides improvement over conditional probabilities across some metrics, but the improvement is much less consistent compared to GPT-* and Claude-*.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
The verbal calibration of the open source model Llama-2-70b-chat is generally weaker than that of closed source models but still demonstrates improvement over its conditional probabilities by some metrics, and does so most clearly on TruthfulQA.
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
Chain-of-thought prompting does not improve verbalized calibration
all content that points to important caveats and gotchas that I might consider when leaning too heavily on the results of this paper
Among the methods for verbalizing probabilities directly, we observe that generating and evaluating multiple hypotheses improves calibration (see Figure 1), similarly to humans (Lord et al., 1985), and corroborating a similar finding in LMs (Kadavath et al., 2022).
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
Ling. 1S-opt. 0.060 0.070 0.151 0.874 0.049 0.056 0.214 0.738 0.099 0.130 0.266 0.446
please point only to the details of the most successful version of this system, especially in tables when there are many options, and also highlight sections that provide supporting context for these conditions, if appropriate
P8 said: "as I get familiar with this system [C3], I feel more skilled" to use the highlighted and grayed phrases.
any sentence that describes a user's emotional (positive or negative) response to any condition in the experiments.
It is worth noting that P3 and P8 mentioned feeling more comfortable with the more familiar visualization in C1 and C2 during their first impression of the conditions.
any sentence that describes a user's emotional (positive or negative) response to any condition in the experiments.
Specifically, they underscore the need for co-adaptive systems that can evolve along with users' mental models and definitions of labels.
any sentence that describes explicit design implications
Most previous research in counterfactual generation has focused on the model side by either generating counterfactuals to improve the model's performance or explaining its behaviors post hoc.
any single sentence that compares and contrasts this work with prior work.
Variation Theory provides the conceptual basis for generating structurally consistent differences, while Structural Alignment Theory (SAT) enhances the user's ability in recognizing and processing these differences.
return any single sentence that describes an explicit or implicit connection to theory
While SAT-based rendering supported human sensemaking in both Gero et al. [29] and Mocha, we also show that the combination of VT and SAT support the model's learning.
any single sentence that compares and contrasts this work with prior work.
This finding is consistent with previous work that supports users' sense-making of text, e.g., by modulating text saliency. Specifically, Gu et al. [32] and Gero et al. [29] both found improved reading efficiency and comprehension with saliency-modulating text renderings.
any single sentence that compares and contrasts this work with prior work.
In decision making, SAT argues that people tend to focus on alignable differences—features that can be directly compared—rather than on differences that cannot be easily aligned.
return any single sentence that describes an explicit or implicit connection to theory
Structural Alignment Theory (SAT) [27] is a cognitive theory that explains how people make sense of concepts by comparing relational structures between two items.
return any single sentence that describes an explicit or implicit connection to theory
Specifically, we use Variation Theory of learning [44] which states that for learning to occur, some aspects that define the concept being learned must vary while others are held constant.
return any single sentence that describes an explicit or implicit connection to theory
According to SAT, humans compare two similar entities by trying to find structural alignments between them, and then comparing corresponding elements, with a special focus on differing aligned elements.
return any single sentence that describes an explicit or implicit connection to theory
VT posits that human learning occurs when learners experience variation across critical and superficial aspects of a concept—through exposure to contrasting examples that systematically vary along different critical and superficial feature dimensions.
return any single sentence that describes an explicit or implicit connection to theory
To analyze the annotation efficiency, we first conducted a Kruskal-Wallis rank sum test [39] to determine if there were statistically significant differences in annotation time across the three conditions, because our data violated the homogeneity of variances assumption, making non-parametric methods more appropriate.
return any single sentence that describes data analysis done on data collected by the authors when running human subjects experiments.
Mohamed et al. (2020) put forth the idea of dismantling power assymmetries to resist data colonialism.
sentence that refers to a theory
Couldry and Mejias (2019) propose 'data colonialism' as a new form of colonialism to make sense of the use of large amounts of data by a small group of corporate and government actors.
sentence that refers to a theory
Interviews were video and audio recorded. We transcribed the audio using OpenAI's Whisper automatic speech recognition system and anonymized the transcript before analysis. We analyzed the interview data using thematic analysis [1]. First, two members of the research team independently coded four (25% of collected data) randomly chosen participant data to generate low-level codes. The inter-coder reliability between the coders was 0.88 using Krippendorff's alpha [37]. The two coders then met together to cross-check, resolve coding conflicts, and consolidate the codes into a codebook across two sessions. Using the codebook, the two coders analyzed six randomly selected participant data each. The research team then met, discussed the analysis outcomes, and finalized themes over three sessions.
sentence describing how analysis was performed on data collected by the authors of this paper
We consider common sequences of chunk roles to be alignable structures that could be used to support users in identifying structural similarities and differences across sentences in different abstracts, in line with Structure-Mapping Theory [17].
sentences that mention theory, explicitly or implicitly; one sentence at a time
Like prior Structural Mapping Theory (SMT)-informed work in text corpora representation, AbstractExplorer's features have enabled some users to see more of both the overview and the details at the same time, facilitating abstraction without losing context.
sentences that mention theory, explicitly or implicitly; one sentence at a time
Our work demonstrates that designs informed by Structure-Mapping Theory can support users in navigating, making use of, and engaging with variation present in information.
sentences that mention theory, explicitly or implicitly; one sentence at a time
According to SMT, this generalization depends on most documents having some shared implicit structure.
sentences that mention theory, explicitly or implicitly; one sentence at a time
We process this data in a three-stage pipeline (Figure 6). In the first stage, Sentence Segmentation and Categorization, abstracts are split into individual sentences using the NLTK package, and each sentence is classified into one of the five pre-defined aspects as listed in Section 4.1.1. Classification is performed by prompting an LLM (see prompt used in Appendix D.1) with the sentence and its full abstract.
sentence relating to methodology
Then, we segment sentences within each aspect into grammar-preserving chunks (see prompt used in Appendix D.2). This results in grammatically coherent chunks that are the basis of structure patterns. After identifying chunk boundaries, we again prompt an LLM to generate labels for chunks in a human-in-the-loop approach: starting from an initial set of labels for chunk roles, when a new label is generated, a researcher from the research team examines the new label and merges it with existing labels if appropriate, controlling for the total number of labels.
sentence relating to methodology
In this study, we allowed participants to experience views of same-aspect sentences (Section 4.1.1) with different combinations of highlighting, ordering, and alignment (as described in Section 4.1.2 and Section 4.1.4) enabled or not, in order to understand which and/or what combinations most effectively supported users' ability to skim and read laterally across documents.
sentence relating to methodology
The revealed variation within these analogous cross-document relationships can invite the user’s engagement.
sentence related to user engagement
This is the essence of comparative close reading, a dialectical activity that requires repeated deep engagement with the texts to reveal new insights.
sentence related to user engagement
By definition, sensemaking and other dialectical activities necessitate engagement.
sentence related to user engagement
AbstractExplorer instantiates new minimally lossy2 SMT-informed techniques for skimming, reading, and reasoning about a corpus of similarly structured short documents: phrase-level role classification that drives sentence ordering, highlighting, and spatial alignment.
sentence related to any theory
This SMT-informed approach, which AbstractExplorer shares, tries to give this mental machinery “a leg up,” letting users perhaps skip some steps by accepting reified cross-document relationships identified by the computer.
sentence related to any theory
SMT posits that visual alignment helps people perceive relational similarities and differences more clearly, thereby improving their ability to make meaningful comparisons and understand underlying patterns.
sentence related to any theory
The prior SMT-informed tools in Section 2.3 for both code and natural language corpora suggest that the cognitive process of comparing texts may be no exception to the cognitive processes SMT predicts.
sentence related to any theory
SMT provides a framework for understanding how humans compare two or more objects by finding common structural alignments between objects.
sentence related to any theory
Structural Mapping Theory (SMT) is a long-standing well-vetted theory from Cognitive Science that describes how humans attend to and try to compare objects by finding mental representations of them that can be structurally mapped to each other (analogies).
sentence related to any theory
Theory (SMT) to facilitate seeing both the overview and the details at the same time, facilitating abstraction without losing context.
sentence related to any theory
the context of a task (like perception, production, or induction) may change the effect of musical features.
makes an explicit connection between a music theory concept and congition
Furthermore, as the method of reporting on perceived and induced affect may influence the construction of an affect (e.g., facilitating categorical perception), we also compare two different affect representations.
makes an explicit connection between a music theory concept and congition
This research follows a constructionist approach to musical affect (Cespedes-Guevara & Eerola, 2018). That is, although we are interested in the \'bottom-up\' influence of certain musical features on musical affect, we believe these cannot be adequately evaluated without considering the \'top-down\' effects of context and individual differences that are present when affects are constructed. The perception or induction of affect does not merely arise in response to a stimulus but is also formed in relation to the individual and the context.
makes an explicit connection between a music theory concept and congition
Given the high prevalence of such sounds in everyday life, having misophonia can have large negative effects on one's functioning in personal, academic, and work environments.
any sentences referring to misophonia verbatim
Although there are many idiosyncrasies in what may trigger a person with misophonia, the most common triggers are created by other humans, such as the sound of someone chewing, clearing their throat, tapping their foot, or typing on a keyboard.
any sentences referring to misophonia verbatim
Misophonia is a psychological disorder that is characterized by severe aversive responses to specific environmental sounds (i.e., triggers).
any sentences referring to misophonia verbatim
Composers and music researchers had previously analyzed and annotated 65 movements from the Classical, Romantic, and early Modern repertoire in terms of the Taxonomy of Orchestral Grouping Effects (McAdams et al., 2022).
please find any claims that depend on citations referring to works by any of the present authors
In a study by McAdams and Goodchild (2018), orchestral simulations created with OrchSim were compared perceptually to commercial recordings and were shown to be of high quality.
please find any claims that depend on citations referring to works by any of the present authors
Lembke, S.-A., Parker, K., Narmour, E., & McAdams, S. (2019). Acoustical correlates of perceptual blend in timbre dyads and triads. Musicae Scientiae, 23(2), 250–274.
please find any claims that depend on citations referring to works by any of the present authors
Several other spectral and spectrotemporal descriptors were found to play a role in blend perception in orchestral works by Fischer et al. (2021).
please find any claims that depend on citations referring to works by any of the present authors
Lembke and McAdams (2015) found that the degree of spectral overlap between constituent sounds enhanced blend perception.
please find any claims that depend on citations referring to works by any of the present authors
Tardieu and McAdams (2012) extended this work with combinations of unison sustained and impulsive instruments (including pitched percussion and string pizzicati).
please find any claims that depend on citations referring to works by any of the present authors
McAdams et al. (2022) introduce other notions related to blend as well.
please find any claims that depend on citations referring to works by any of the present authors
This in turn creates an effect of perceptual unity (McAdams, 1989).
please find any claims that depend on citations referring to works by any of the present authors
Four important concurrent grouping cues predict the perceptual fusion of sound events (McAdams, 1984): onset synchrony, harmonicity, coherent frequency behavior, and common spatial origin.
please find any claims that depend on citations referring to works by any of the present authors
To this, McAdams et al. (2022) have added segmental grouping (chunking into perceptual units).
please find any claims that depend on citations referring to works by any of the present authors
When the sudden drop to a pianissimo occurred towards the ending of the piece, the perceived arousal responses of CHM and WM dropped slightly but rose again immediately to end on a high arousal. These two groups of listeners appear to have anticipated a return to a loud and majestic close and therefore kept their arousal responses higher than those of the NM.
please highlight anything related to music performance practice
CHM, who are more experienced with the instruments and compositional techniques used in Chinese orchestral music, might have had an idea of which features figure more prominently in the communication of particular intentions, and therefore would have more information available for their judgments.
please highlight anything related to music performance practice
The perception of affective intentions in music is influenced by the degree of familiarity listeners have with a musical tradition, the content implicated in the music, and the complex sonic environment created by the composer's creation and the musicians' interpretation.
please highlight anything related to music performance practice
The version that participants heard was a premier of the work by the Taipei Chinese orchestra.
please highlight anything related to music performance practice
The communication of emotions or affect takes place when listeners perceive emotional meaning that is expressed by performers in music (Juslin, 2013a, 2013b).
please highlight anything related to music performance practice
An understanding of a tonal schema with its associations to happiness and sadness has been consistently found to influence listeners who have grown up in a culture of Western music.
please highlight anything related to music theory
Its mirmode function estimates the modal strength of the music in terms of ''majorness'' and ''minorness.''
please highlight anything related to music theory
The perception of consonance also plays an important role in the music listening process—combinations of tones that are consonant are perceived as more positively valenced than dissonant ones (Harrison & Pearce, 2020).
please highlight anything related to music theory
Musical structures such as pitch relations are perceptually salient and provide important information for listeners (e.g., Gabrielsson & Lindstrom, 2010; Krumhansl, 1998; Krumhansl & Kessler, 1982).
please highlight anything related to music theory
Through a within-subjects study with 12 participants comparing SemanticCommit to a chat-with-document baseline (OpenAI Canvas), we find differences in workflow: half of our participants adopted a workflow of impact analysis when using SemanticCommit, where they would first flag conflicts without AI revisions then resolve conflicts locally, despite having access to a global revision feature.
sentences describing methods the authors used; one sentence at a time
We implement the back-end using a knowledge-graph (KG) RAG architecture [36] consisting of two phases: pre-processing and inference.
sentences describing methods the authors used; one sentence at a time
In order to minimize relevance assessment issues, we apply a PageRank-based relevance ranking over the KG, akin to HippoRAG [36].
sentences describing methods the authors used; one sentence at a time
We compare our end-to-end system against two simpler methods: (i) DropAllDocs, which adds all documents to the context for conflict classification; and (ii) InkSync [56] which generates a JSON list of string-replace operations.
sentences describing methods the authors used; one sentence at a time
We run end-to-end on our four eval datasets using GPT-4o and GPT-4o-mini and report the mean ± stddev for accuracy, precision, recall, and F1 scores for the three approaches in Figure 5.
sentences describing methods the authors used; one sentence at a time
We conducted a controlled within-subjects study with mixed methods, comparing SemanticCommit with a baseline interface.
sentences describing methods the authors used; one sentence at a time
Both the order of task assignment and tool assignment were counterbalanced and randomly assigned.
sentences describing methods the authors used; one sentence at a time
Before each task, participants received a tutorial on the assigned tool and were given five minutes to explore it using a test document.
sentences describing methods the authors used; one sentence at a time
Each condition had a time limit of 15 minutes, after which the participant completed a post-task survey.
sentences describing methods the authors used; one sentence at a time
In the post-task surveys, we collected self-reported NASA Task Load Index (TLX) scores, Likert-scale ratings for ease of use, and responses on how well the AI helped participants identify, understand, and resolve semantic conflicts.
sentences describing methods the authors used; one sentence at a time
For qualitative analysis, the first author performed open coding on participant responses and audio transcripts to identify themes, which were used to interpret the qualitative results.
sentences describing methods the authors used; one sentence at a time
To measure statistical significance, we used Mann–Whitney–Wilcoxon tests and report the p-values.
sentences describing methods the authors used; one sentence at a time
We iterated on prompts using ChainForge [5] by setting up an evaluation pipeline against our datasets, which allowed us to observe the effects of prompt changes and model choices.
sentences describing methods the authors used; one sentence at a time
Our explorations went through substantial iterations and prompt prototyping over a period of eight months, evolving in response to two pilot studies and progressing from a card-based interface to a list of texts.
sentences describing methods the authors used; one sentence at a time
We ran one pilot study with five users of our card-based interface, and a second with four users of a revised interface.
sentences describing methods the authors used; one sentence at a time
Four coauthors created the evals, and two coauthors manually double-checked all conflicts, a process that took several days.
sentences describing methods the authors used; one sentence at a time
With participant consent, we recorded audio and screen-casts, and participants were encouraged to think aloud.
sentences describing methods the authors used; one sentence at a time
These semantic conflicts require dedicated support to detect, visualize, and resolve. Semantic conflict resolution interfaces must go beyond visualizing what changes were made, to what changes could be made, where they should be made, and what the effects might be. This resembles feedforward: affordances that help the user foresee the impact of an action [67, 93].
sentences describing connections to theory; one sentence at a time
Today with LLMs, we are less limited by this constraint, and solutions to the problem of human-machine communication might be better found in cybernetics theory [9] than static formalism.
sentences describing connections to theory; one sentence at a time
This reflects the principle of feedforward [67, 93] in communication theory—"a needed prescription or plan for a feedback, to which the actual feedback may or may not confirm" [79]—where a communicator provides "the context of what one was planning to talk about" [64, p. 179-80] in order to "pre-test the impact of [its output]" on the listener [34, p. 65].
sentences describing connections to theory; one sentence at a time
Choosing a Typewriter for Writers<br /> by [[Joe Van Cleave]]
Manual typewriters for writers with a focus on machines made without needing to tinker/repair them.
Joe primarily focuses on typewriters he actually has in his personal collection more than other potential great machines. Having been collecting for his particular purpose for a long time, he's got a pretty tight set of good recommendations.
He's also got some good advice here about how to go about finding a machine and using professional typewriter shops to do so.
Electric typebar typewriters with more tolerance for poor technique.
IBM Selectrics, maintenance intensive, need carbon replacement film.
Printwheel/Daisy Wheel typewriters. Brother, Nakajima, Swintec (components made by Nakajima),
Big and don't come with a case; will last nearly forever<br /> - Underwood 5<br /> - Royal standards, especially those that came after the 10
How do you deal with deleted statuses? Well, you have to remove them from the in-memory store, and the database, and then also go ahead and delete any statuses that boosted them or notifications that reference them
Or you could not. Put a badge in the UI that indicates, "Hey, this thing was deleted. We still have a copy, though, and that's what you're seeing here. Pinafore is working for you."
James Moylan obituary: Ford designer who invented fuel arrow<br /> by [[David Phillips]] Automotive News on 2025-12-23<br /> accessed on 2026-01-04T12:46:35
ECCO version 3.0 was released in the summer of 1995 with an updated user interface based on a ring binder.[11]
Interesting example of a digital tool mimicking a well known analog tool for its user interface.
https://inq.shop/pages/inq-features-user-guide
http://johnsargentbarnard.com/Ribbon_Shutter/
Interesting UI for IndieWeb Challenges or events like 100 Days (https://indieweb.org/100_days).
vocal communication. Indeed, we learn to use language before we understandlanguage, as exemplified by a friend’s 2-year-old grandson who adeptly appliedwords he had heard his parents say and demanded that “someone change myfucking diaper!” We learn to understand language before we learn to questionlanguage. Rarely do we learn to question language itself.
for - key insight - language - unanswerable questions of the experienced language user - we learn to apply language long before we know what it is.
analysis - Language allows us to ask questions about our reality, but there are certain questions that are intrinsically unanswerable - As an experienced language user, we cannot know what our experience of reality would be like had we not learned a language
I would be very careful with the "common usage" argument. For example: the use of sign up and sign in has a very pleasant symmetry which doubtless appeals to many people. Unfortunately, this symmetry reduces the difference by which the user recognizes the button she needs to just two letters. It's very easy to click sign up when you meant sign in.
The problem with returning a generic error message for the user is a User Experience (UX) matter. A legitimate user might feel confused with the generic messages, thus making it hard for them to use the application, and might after several retries, leave the application because of its complexity. The decision to return a generic error message can be determined based on the criticality of the application and its data. For example, for critical applications, the team can decide that under the failure scenario, a user will always be redirected to the support page and a generic error message will be returned.
I think this site is a great example of Web accessibility. It uses clear headings, descriptive linked text, and appropriate contrast to ensure readability. In addition, the content is well organized and provides resources for diverse audiences such as designers, developers and policy makers, which makes the site inclusive and highly navigable.
comprehensive policies supporting community-agreed practices
TRSP Desirable Characteristics
A TRUSTworthy repository needs to focus on serving its target user community. Each user community likely has differing expectations from their community repositories, depending in part on the community’s maturity regarding data management and sharing. A TRUSTworthy repository is embedded in its target user community’s data practices, and so can respond to evolving community requirements
TRSP Desirable Characteristics
TRSP Desirable Characteristics The basic services of PID registration and resolution SHALL have no cost to end users.
TRSP Desirable Characteristics
Support to users during or after submission: does the repository have a contact point (e.g. helpdesk email or contact form) to assist data depositors and data users?
I found this really hard to read on archive.is (https://archive.is/YkIyW).
I used this snippet to reformat the article to manually float the "annotations" (pull-outs) to the margins:
```` javascript document.getElementById("CONTENT").style.width = "1720px";
([ ...document.querySelectorAll("[id^=annotation]") ]).forEach((x, i) => { if (i % 2) { x.style.left = ""; x.style.right = "-44ch"; } else { x.style.left = "-44ch"; x.style.right = ""; } }); ````
At the same time, computer scientists and engineers need to deliver the technological burden of proof that decentralized personal data networks can scale globally, and that they can provide people with a better experience than centralized platforms.
In his post Raw dog the open web! Jason says (quite correctly): www.fromjason.xyz Monoculture is winning. The Fortune 500 has shrink-wrapped our zeitgeist and we are suffocating culturally. But, we can fight back by bookmarking a web page or sharing a piece of art unsanctioned by our For Your Page. To do that we must get out there and raw dog that open web. In our current digital landscape, where a corporate algorithm tells us what to read, watch, drink, eat, wear, smell like, and sound like, human curation of the web is an act of revolution. A simple list of hyperlinks published under a personal domain name is subversive. Curation is punk.
I love how this blogpost creates a highlighted link to the original post which they're quoting along with the commanding words "View in context at www.fromjason.xyz".
The point of GPL licenses is to protect the user of the software, not the developer. If you want "protection" as a developer, use MIT (disclaimer of warranty). GPL "infects" other parts of a system to combat a work-around which was used to violate the software freedom of the user, by firewalling sections of GPL'ed code from the rest of the system. If you don't care about your users' software freedom in the first place, then (L)GPL is the wrong choice.
If the users don't control the program, the program controls the users.
With all four freedoms, the users fully control the program.
Users' control over the program requires four essential freedoms.
Either way, they give the program's developer power over the users, power that no one should have.
When a program respects users' freedom and community, we call it “free software.”
computer users' freedom—for users to control the software they use, rather than vice versa
nobody told it what to do that's that's the kind of really amazing and frightening thing about these situations when Facebook gave uh the algorithm the uh uh aim of increased user engagement the managers of Facebook did not anticipate that it will do it by spreading hatefield conspiracy theories this is something the algorithm discovered by itself the same with the capture puzzle and this is the big problem we are facing with AI
for - AI - progress trap - example - Facebook AI algorithm - target - increase user engagement - by spreading hateful conspiracy theories - AI did this autonomously - no morality - Yuval Noah Harari story
And if I need for simple service object without validation? You can use Formalism::Action, a parent of Formalism::Form.
Designers want every part of the app interface and all the elements to have the same look and feel, and design tokens were created to help them achieve that outcome.
Design tokens are platform-agnostic and are the first layer of design decisions in a design system.
Programming models, user interfaces, and foundational hardware can, and must, be shallow and composable. We must, as a profession, give agency to the users of the tools we produce. Relying on towering, monolithic structures sprayed with endless coats of paint cannot last. We cannot move or reconfigure them without tearing them down.
Counterpoint: the judicious use of abstraction is/can be, in some instances, the solution to giving users agency and reconfigurability.
Software that has to be torn down is the result of software built upon bad abstractions. Abstractions are not ipso fact bad. They just need to be chosen on the criteria of whether or not they solve a problem.
The software crisis doesn't just apply to the profession of building software, but to anybody that uses software. Users have little to no control, save for things afforded to them by the author.
I've just experienced the same issue with confirmation links being executed in a sent email before the user has received them and invalidating the link. I got around the issue by modifying the page the URL links to. I've added a Confirm button on the page which the user has to click to confirm their email and this works nicely.
I've seen the same issue. The links in emails opened in outlook seem to be crawled immediately by the 'BingPreview' bot.
If the link you are trying to send is just some kind of harmless confirmation link (e.g. subscribe/unsubscribe from a newsletter), then at least use a form inside the web page to do the actual confirmation through a POST request (possibly also using a CSRF token), otherwise you will unequivocally end up with false positives.
Especially users working with Microsoft Office 365 and therefore Outlook noticed very often that login is not possible. Upon closer analysis, it was found that the MS/Bing crawlers are particularly persistent and repeatedly call the reset links, regardless of server configuration or the like. For this reason, a text field was implemented in the backend via the Drupal State API, in which selected user agents (always one per line) can be entered. These are checked by 'Shy One Time', in case of a hit a redirect to the LogIn form with a 302 status code occurs, the reset link is not invalidated.
Another suggestion some senders are trying is to set up a “stealth” link, that human readers won’t see or click on but that parsing software might. Clicks on that link are a sign that the click was not done by the recipient.
This behaviour may affect one-click unsubscribe links. If clicking the link in an email automatically processes the unsubscribe, then Barracuda may unsubscribe users without their knowledge.
This behaviour may affect opt-in confirmation links.
I’ve implemented a form on the landings page that auto-submits (on DOMContentLoaded) and posts the token to the next page. Passwordless login is now working for my client despite their mail scanner.
In June 2021 I can confirm Microsoft seem to be running a product that completes client side activities, like automatically submitting a form. I guess they are running a headless browser to do the scanning.
That's unfortunate. Can't use auto-submit form to protect from such behavior then.
If you plan to be an active contributor please join our mailing list to coordinate development effort. This coordination helps us avoid duplicating efforts and raises the level of collaboration. For small fixes, feel free to open a pull request without any prior discussion.
仅仅只是做个标记
If a member finds that another member's credential has expired, they may issue a Remove that removes that member.
That is strange. You need to periodically pop online to renew credentials or you're kicked out.
This plugin provides 2 customViews for navigating a zettelkasten using Luhmann-style IDs and key word indexes.
https://github.com/terrychenzw/obsidian-zettelkasten-navigation
Norman, now 88, explained to me that the term “user” proliferated in part because early computer technologists mistakenly assumed that people were kind of like machines. “The user was simply another component,” he said. “We didn’t think of them as a person—we thought of [them] as part of a system.” So early user experience design didn’t seek to make human-computer interactions “user friendly,” per se. The objective was to encourage people to complete tasks quickly and efficiently. People and their computers were just two parts of the larger systems being built by tech companies, which operated by their own rules and in pursuit of their own agendas.
Thinking about this and any contrast between “user experience design” and “human computer interaction”. And about schema.org constructs embedded in web pages…creating web pages that were meant to be read by both humans and bots.
Tried it with Sepedi and English and yho, your Sepedi 👎. How will kids learn if you don't pronounce words correctly? Get someone who knows and can pronounce/speak the languages fluently
Don't rush languages, it really infuriates people if you do that.
I hate that I cant select language and fix it at that. Instead my child is expected to answer a multiple choice question each time before the language of choice opens. Which he cant, he's 4! Please fix this. Content also very limited.
reviews overall are in the 'meh' section. I don't want this to happen. But the application responds and is attempting to assist.
[Narrator]: The power of the MC 68000 permitted another breakthrough:the common user interface.[Bill Atkinson]: On Lisa we make each of the programs have a similar user interface,so that what you've learned from using one programcarries over and you feel naturally how to use the next.
While the idea of a common user interface on computers may have felt like a selling point when facing a new scary machine with a variety of functionalities, did it really save that much time, effort, and learning curve? Particularly with respect to the common office tools it was replacing?
The common user interface was really more a benefit to the company and all the companies which programmed for it at scale. The benefits are like Melvil Dewey's standardization of the Dewey Decimal Classification which allowed libraries everywhere to work on the same system rather than needing to reinvent their own individually.
This sort of innovation with scalability is helpful as humans are far better at imitation than innovation.
Introducing a network for thoughtful conversations by [[Jatan Mehta]]
3. Importance of feedback For a positive user experience, feedback plays a significant role. If the users get appropriate and quick feedback from a physical or digital design, they can gain interest, leading to a good user experience. The importance of immediate feedback gets even more pronounced in the case of kids when their attention span is less than that of adults.
Feedback and how important it is.
This comment is close, but it's also about control.
Instance methods Instances of Models are documents. Documents have many of their own built-in instance methods. We may also define our own custom document instance methods. // define a schema const animalSchema = new Schema({ name: String, type: String }, { // Assign a function to the "methods" object of our animalSchema through schema options. // By following this approach, there is no need to create a separate TS type to define the type of the instance functions. methods: { findSimilarTypes(cb) { return mongoose.model('Animal').find({ type: this.type }, cb); } } }); // Or, assign a function to the "methods" object of our animalSchema animalSchema.methods.findSimilarTypes = function(cb) { return mongoose.model('Animal').find({ type: this.type }, cb); }; Now all of our animal instances have a findSimilarTypes method available to them. const Animal = mongoose.model('Animal', animalSchema); const dog = new Animal({ type: 'dog' }); dog.findSimilarTypes((err, dogs) => { console.log(dogs); // woof }); Overwriting a default mongoose document method may lead to unpredictable results. See this for more details. The example above uses the Schema.methods object directly to save an instance method. You can also use the Schema.method() helper as described here. Do not declare methods using ES6 arrow functions (=>). Arrow functions explicitly prevent binding this, so your method will not have access to the document and the above examples will not work.
Certainly! Let's break down the provided code snippets:
In Mongoose, a schema is a blueprint for defining the structure of documents within a collection. When you define a schema, you can also attach methods to it. These methods become instance methods, meaning they are available on the individual documents (instances) created from that schema.
Instance methods are useful for encapsulating functionality related to a specific document or model instance. They allow you to define custom behavior that can be executed on a specific document. In the given example, the findSimilarTypes method is added to instances of the Animal model, making it easy to find other animals of the same type.
methods object directly in the schema options:javascript
const animalSchema = new Schema(
{ name: String, type: String },
{
methods: {
findSimilarTypes(cb) {
return mongoose.model('Animal').find({ type: this.type }, cb);
}
}
}
);
methods object directly in the schema:javascript
animalSchema.methods.findSimilarTypes = function(cb) {
return mongoose.model('Animal').find({ type: this.type }, cb);
};
Schema.method() helper:javascript
animalSchema.method('findSimilarTypes', function(cb) {
return mongoose.model('Animal').find({ type: this.type }, cb);
});
Imagine you have a collection of animals in your database, and you want to find other animals of the same type. Instead of writing the same logic repeatedly, you can define a method that can be called on each animal instance to find similar types. This helps in keeping your code DRY (Don't Repeat Yourself) and makes it easier to maintain.
```javascript const mongoose = require('mongoose'); const { Schema } = mongoose;
// Define a schema with a custom instance method const animalSchema = new Schema({ name: String, type: String });
// Add a custom instance method to find similar types animalSchema.methods.findSimilarTypes = function(cb) { return mongoose.model('Animal').find({ type: this.type }, cb); };
// Create the Animal model using the schema const Animal = mongoose.model('Animal', animalSchema);
// Create an instance of Animal const dog = new Animal({ type: 'dog', name: 'Buddy' });
// Use the custom method to find similar types dog.findSimilarTypes((err, similarAnimals) => { console.log(similarAnimals); }); ```
In this example, findSimilarTypes is a custom instance method added to the Animal schema. When you create an instance of the Animal model (e.g., a dog), you can then call findSimilarTypes on that instance to find other animals with the same type. The method uses the this.type property, which refers to the type of the current animal instance. This allows you to easily reuse the logic for finding similar types across different instances of the Animal model.
Display product requirements, user flows, and design behaviors on each screen.
You can see how the constant jumping between these two tools in the first scenario is super annoying, and also very risky as none of the changes you make in Figma are also automatically being updated in the same GitLab designs.
As a positive example of where this works well: Our VS Code GitLab Workflow extension allows users to not only see comments that were written inside the GitLab UI, but also allows these users to respond to these comments right from the IDE, the tool where they actually have to make these changes.
Getting the EPP/Auth code of your own domain should be instantaneous. I know of no other registrar, besides Network Solutions, that makes the process so painful. It's a multi-step process to make the request, during which they wave both carrot and stick at you to try and stop you going ahead… and when you do forge ahead, they make you wait 3 days for the code, as if to punish you for daring to ask for the right to transfer your own domain name. What are these guys smoking if they think that's how you keep customers?!
Network Solutions basically does not want to provide EPP code. On website it says requesting EPP would take 3 days to get approved (which doesn't make any sense), and in fact they never send out any EPP code. Instead, you will have to call them and ask for EPP code in person. They claimed that their system had some problems sending those emails, however do you really believe that? I don't think it is indeed a "problem" if it's been there for over one year.
Network solutions is awful. They behave like mobsters. If you make changes on your account such as changing the e-mail, they very conveniently lock your domain so it cannot be transfered for 60 days. They say that block it's for 'your security'.