448 Matching Annotations
  1. May 2022
    1. n modern terms, we could say that the Llullian art had been designed to function as a type of search engine. As Llull himself admitted at the beginning of his Ars brevis, the purpose of the mechanism was to provide the user with the opportunity to have a prompt answer to any question, provided that the user knew the meaning of the word used for the query.51 This definition could describe Google as well.

      I've long been contemplating the modern-day equivalents of the Llullian combinatorial arts and the relationship to Google's search engine is an interesting one.

      I've always said that knowing the name of the thing you're searching for (especially in technical settings) will aid you immeasurably in finding it. For example, if you're searching for content management systems and don't know that as their name or even the name of an example, you're unlikely to be very successful.

    Tags

    Annotators

    1. Also, keep in mind that you don’t need to find a single app to fulfill all your needs. You might use more than one tool at a time depending on the use case.

      It's true that each note taking application may be purpose fit for a particular use, but having a single store for all of your notes is incredibly important for future search and re-discovery. Keeping one's notes across a range of applications is disaster waiting to happen, at least until there is a bigger aggregate search function that can search across multiple platforms.

    1. For the first time in history, we have instantaneous access to theworld’s knowledge.

      While we may have the impression of instant access to the world's knowledge, this is really far from the truth. It's all there, but being able to search through it for what we want or being able to find or generate insight from it involves a massive mountain of hidden work that no one really wants to do in practice.

    Tags

    Annotators

  2. Apr 2022
    1. Open Knowledge Maps, meanwhile, is built on top of the open-source Bielefeld Academic Search Engine, which boasts more than 270 million documents, including preprints, and is curated to remove spam.

      Open Knowledge Maps uses the open-source Bielefeld Academic Search Engine and in 2021 indicated that it covers 270 million documents including preprints. Open Knowledge Maps also curates its index to remove spam.


      How much spam is included in the journal article space? I've heard of incredibly low quality and poorly edited journals, so filtering those out may be fairly easy to do, but are there smaller levels of individual spam below that?

    2. Another visual-mapping tool is Open Knowledge Maps, a service offered by a Vienna-based not-for-profit organization of the same name. It was founded in 2015 by Peter Kraker, a former scholarly-communication researcher at Graz University of Technology in Austria.

      https://openknowledgemaps.org/

      Open Knowledge maps is a visual literature search tool that is based on keywords rather than on a paper's title, author, or DOI. The service was founded in 2015 by Peter Kraker, a former scholarly communication researcher at Graz University of Technology.

    3. In 2019, Smolyansky co-founded Connected Papers, one of a new generation of visual literature-mapping and recommendation tools.

      https://www.connectedpapers.com/

      https://twitter.com/ConnectedPapers


      Something about the name Connected Papers reminds me of the same sort of linking name that Manfred Kuehn gave to his note taking software ConnectedText.

    1. Algospeak refers to code words or turns of phrase users have adopted in an effort to create a brand-safe lexicon that will avoid getting their posts removed or down-ranked by content moderation systems. For instance, in many online videos, it’s common to say “unalive” rather than “dead,” “SA” instead of “sexual assault,” or “spicy eggplant” instead of “vibrator.”

      Definition of "Algospeak"

      In order to get around algorithms that demote content in social media feeds, communities have coined new words or new meanings to existing words to communicate their sentiment.

      This is affecting TikTok in particular because its algorithm is more heavy-handed in what users see. This is also causing people who want to be seen to tailor their content—their speech—to meet the algorithms needs. It is like search engine optimization for speech.

      Article discovered via Cory Doctorow at The "algospeak" dialect

    1. 3. Who are you annotating with? Learning usually needs a certain degree of protection, a safe space. Groups can provide that, but public space often less so. In Hypothes.is who are you annotating with? Everybody? Specific groups of learners? Just yourself and one or two others? All of that, depending on the text you’re annotating? How granular is your control over the sharing with groups, so that you can choose your level of learning safety?

      This is a great question and I ask it frequently with many different answers.

      I've not seen specific numbers, but I suspect that the majority of Hypothes.is users are annotating in small private groups/classes using their learning management system (LMS) integrations through their university. As a result, using it and hoping for a big social experience is going to be discouraging for most.

      Of course this doesn't mean that no one is out there. After all, here you are following my RSS feed of annotations and asking these questions!

      I'd say that 95+% or more of my annotations are ultimately for my own learning and ends. If others stumble upon them and find them interesting, then great! But I'm not really here for them.

      As more people have begun using Hypothes.is over the past few years I have slowly but surely run into people hiding in the margins of texts and quietly interacted with them and begun to know some of them. Often they're also on Twitter or have their own websites too which only adds to the social glue. It has been one of the slowest social media experiences I've ever had (even in comparison to old school blogging where discovery is much higher in general use). There has been a small uptick (anecdotally) in Hypothes.is use by some in the note taking application space (Obsidian, Roam Research, Logseq, etc.), so I've seen some of them from time to time.

      I can only think of one time in the last five or so years in which I happened to be "in a text" and a total stranger was coincidentally reading and annotating at the same time. There have been a few times I've specifically been in a shared text with a small group annotating simultaneously. Other than this it's all been asynchronous experiences.

      There are a few people working at some of the social side of Hypothes.is if you're searching for it, though even their Hypothes.is presences may seem as sparse as your own at present @tonz.

      Some examples:

      @peterhagen Has built an alternate interface for the main Hypothes.is feed that adds some additional discovery dimensions you might find interesting. It highlights some frequent annotators and provide a more visual feed of what's happening on the public Hypothes.is timeline as well as data from HackerNews.

      @flancian maintains anagora.org, which is like a planet of wikis and related applications, where he keeps a list of annotations on Hypothes.is by members of the collective at https://anagora.org/latest

      @tomcritchlow has experimented with using Hypothes.is as a "traditional" comments section on his personal website.

      @remikalir has a nice little tool https://crowdlaaers.org/ for looking at documents with lots of annotations.

      Right now, I'm also in an Obsidian-based book club run by Dan Allosso in which some of us are actively annotating the two books using Hypothes.is and dovetailing some of this with activity in a shared Obsidian vault. see: https://boffosocko.com/2022/03/24/55803196/. While there is a small private group for our annotations a few of us are still annotating the books in public. Perhaps if I had a group of people who were heavily interested in keeping a group going on a regular basis, I might find the value in it, but until then public is better and I'm more likely to come across and see more of what's happening out there.

      I've got a collection of odd Hypothes.is related quirks, off label use cases, and experiments: https://boffosocko.com/tag/hypothes.is/ including a list of those I frequently follow: https://boffosocko.com/about/following/#Hypothesis%20Feeds

      Like good annotations and notes, you've got to put some work into finding the social portion what's happening in this fun little space. My best recommendation to find your "tribe" is to do some targeted tag searches in their search box to see who's annotating things in which you're interested.

    1. Yeshiva teaching in the modern period famously relied on memorization of the most important texts, but a few medieval Hebrew manu-scripts from the twelfth or thirteenth centuries include examples of alphabetical lists of words with the biblical phrases in which they occurred, but without pre-cise locations in the Bible—presumably because the learned would know them.

      Prior to concordances of the Christian Bible there are examples of Hebrew manuscripts in the twelfth and thirteenth centuries that have lists of words and sentences or phrases in which they occurred. They didn't include exact locations with the presumption being that most scholars would know the texts well enough to quickly find them based on the phrases used.


      Early concordances were later made unnecessary as tools as digital search could dramatically decrease the load. However these tools might miss the value found in the serendipity of searching through broad word lists.

      Has anyone made a concordance search and display tool to automatically generate concordances of any particular texts? Do professional indexers use these? What might be the implications of overlapping concordances of seminal texts within the corpus linguistics space?

      Fun tools like the Bible Munger now exist to play around with find and replace functionality. https://biblemunger.micahrl.com/munge

      Online tools also have multi-translation versions that will show translational differences between the seemingly ever-growing number of English translations of the Bible.

  3. Mar 2022
    1. Human minds are made of memories, and today those memories have competition. Biological memory capacities are being supplanted, or at least supplemented, by digital ones, as we rely on recording—phone cameras, digital video, speech-to-text—to capture information we’ll need in the future and then rely on those stored recordings to know what happened in the past. Search engines have taken over not only traditional reference materials but also the knowledge base that used to be encoded in our own brains. Google remembers, so we don’t have to. And when we don’t have to, we no longer can. Or can we? Remembering and Forgetting in the Age of Technology offers concise, nontechnical explanations of major principles of memory and attention—concepts that all teachers should know and that can inform how technology is used in their classes. Teachers will come away with a new appreciation of the importance of memory for learning, useful ideas for handling and discussing technology with their students, and an understanding of how memory is changing in our technology-saturated world.

      How much history is covered here?

      Will mnemotechniques be covered here? Spaced repetition? Note taking methods in the commonplace book or zettelkasten traditions?

    1. Improved search engines will help for devoted searchers, but promoting a new contribution in the marketplace of ideas still takes effort. Improved methods to reach the right audiences will begin with email and newsgroups, but should be extended by nov el group formation methods.

      Search and discovery are hard problems in an ever increasing informational overload riddled society.

    1. Ask and ye shall receive!

      https://uxdesign.cc/the-power-of-seeing-only-the-questions-in-a-piece-of-writing-8f486d2c6d7d

      I made a web tool that takes a piece of writing and strips out everything but the questions: https://t.co/i8FsoFwPt4<br><br>Here's the first chapter of Moby Dick<br><br>Quite fascinating to see this aspect of a literary style pic.twitter.com/tHrHA7jsdW

      — Clive Thompson (@pomeranian99) February 27, 2022
      <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>

      Link to [[searching for questions while reading]]

  4. Feb 2022
  5. Jan 2022
    1. Budak, C., Soroka, S., Singh, L., Bailey, M., Bode, L., Chawla, N., Davis-Kean, P., Choudhury, M. D., Veaux, R. D., Hahn, U., Jensen, B., Ladd, J., Mneimneh, Z., Pasek, J., Raghunathan, T., Ryan, R., Smith, N. A., Stohr, K., & Traugott, M. (2021). Modeling Considerations for Quantitative Social Science Research Using Social Media Data. PsyArXiv. https://doi.org/10.31234/osf.io/3e2ux

    1. search can’t be fixed solely by focusing on the interface or engine, because it depends upon the foundation of content and metadata, which in turn are shaped by governance, incentives, and metrics.

      Metadata supports search

    1. https://www.goedel.io/p/tools-for-thought-but-not-for-search

      Searching for two ingredients in an effort to find a recipe that will allow their use should be de rigueur in a personal knowledge manager, sadly it doesn't appear to be the case.


      This sort of simple search not working in these tools is just silly.

      They should be able to search across blocks, pages, and even provide graph views to help in this process. Where are all the overlaps of these words within one's database?

    1. Twitter search filter Search results filter:follows tweets only from accounts you follow filter:news tweets containing news filter:links tweets containing links filter:images tweets containing images filter:videos tweets containing videos filter:periscope tweets containing Periscope videos filter:retweets classic RT retweets or quote tweets filter:nativeretweets retweets via the retweet button filter:safe tweets excluding adult content filter:verified tweets from verified accounts

      These are generally useful. I've used most of them regularly for the past three years. In particular one of my primary modes of reading Twitter is with the link: https://twitter.com/search?q=filter%3Afollows%20-filter%3Areplies&src=typed_query&f=live

    1. But Google also uses optical character recognition to produce a second version, for its search engine to use, and this double process has some quirks. In a scriptorium lit by the sun, a scribe could mistakenly transcribe a “u” as an “n,” or vice versa. Curiously, the computer makes the same mistake. If you enter qualitas—an important term in medieval philosophy—into Google Book Search, you’ll find almost two thousand appearances. But if you enter “qnalitas” you’ll be rewarded with more than five hundred references that you wouldn’t necessarily have found.

      I wonder how much Captcha technology may have helped to remedy this in the intervening years?

    1. Serious reading will require just as much effort as it has always required.

      Reading is hard to disrupt.

      Speeding up and dramatically improving the reading process is incredibly difficult. No one has yet made really huge strides in this space. Google has made it imminently more accessible to the masses, but it still requires a lot of physical work and processing on our part.

    1. The Notetab-Zettelkasten has several major advantages over the paper-implementation: 1. It is much more difficult to misplace slips 1. It has a powerful search function

      Most digital note taking systems have two major advantages of paper versions:

      • It's harder to misplace material unless one's system has major flaws or one accidentally deletes content
      • Digital search is far more powerful and efficient than manual search
  6. Dec 2021
  7. Nov 2021
    1. I also did a bit of web and JSTOR research, and started a new Zotero folder called World History Comparison. Research Rabbit found a bunch of similar titles, but it will be a while before I can get to many of them. I DID, however, ask some people and groups such as the OE Global community on Twitter, and I want to extend that request to anyone who watches this video. I know a number of my subs and viewers are in India and I've noticed on Twitter and on Abhijit Chavda's channel that there's quite a bit of controversy about the way Indian History is taught to Indian students.

      Methods for attacking a research problem about history used here:

      • Web research
      • Journal database research
      • Zotero reference manager stub
      • Research Rabbit (AI search)
      • Reach out on various social media channels

      Not mentioned, but perhaps useful:

      • Standard library search (WorldCat)
      • Internet Archive search (scanned historical textbooks)
      • Off-label and dark web services (Library Genesis, Pirate Bay, etc.)
      • Open access and OER sources (this will probably find newer perspectives and newer texts which sometimes have philosophical outlines of what they're trying to change for the future versus the pedagogies of the past)
      • Current curricula and recommended textbooks at major universities on particular books and potential comparisons to those of the past (perhaps via Internet Archive).
    1. A fluorescence of note taking tools

      What is missing in this train of thought is search. The real challenge is recalling of information easily, whether that is traditional search or something more AI-ish that can uncover connections between items that I don't see myself. What I want is a tool that can search across all my repositories, and that requires either APIs for communication or standard data storage formats. I prefer APis.

    1. But dig into the story of anyone who has been a genuine victim of modern mob justice and you will often find not an obvious argument between “woke” and “anti-woke” perspectives but rather incidents that are interpreted, described, or remembered by different people in different ways, even leaving aside whatever political or intellectual issue might be at stake.

      Cancel culture and modern mob justice are possible as the result of volumes of more detail and data as well as large doses of context collapse.

      In some cases, it's probably justified to help level the playing field for those in power who are practicing hypocrisy, but in others, it's simply a lack of context by broader society who have kneejerk reactions which have the ability to be "remembered" by broader society with search engines.

      How might Google allow the right to forget to serve as a means of restorative justice?

    1. wn written cultures material is typically sorted alphabeticallySor by some other method of linguistic ordering such as the number ofstrokes in qhinese charactersTW or systematicallyW according to various sysXtems that strive to map or hierarchize the relations between the items storedSincluding those of uoogle or ffiahooTW or miscellaneouslyY

      What about the emergence of non-hierarchal methods? (Can these logically be sorted somehow without this structure?)

      With digital commonplacing methods, I find that I can sort and search for things temporally by date and time as well as by tag/heading.

      Cross reference:

  8. Oct 2021
  9. Sep 2021
    1. paradigmatic

      typical answer to something

    2. elucidatory

      give a clarifying expression

    3. Pragmatic

      to deal with something in a practical non-theoretical way.

    4. elucidation.

      another word for clarification

    1. The question is similar but its in a Rails context. The solutions would answer my question, but I'm almost certain that he could probably leverage Arel to solve his problem. The question I posted was designed purely as a Ruby question so that it was easier to search for. You might want to suggest an edit of the title of his question because it didn't show up when I searched for a solution to my problem.
  10. Aug 2021
  11. Jul 2021
    1. The idea behind the reference interview is for the librarian to respond to questions by asking questions in order to supply the best possible resource. Frequently, this is really useful because people will ask for the kind of information they think the librarian can find, rather than what they actually want.

      This is closely related to solving a particular well-defined problem rather than solving a more general problem.

    1. Ebooks don’t have those limitations, both because of how readily new editions can be created and how simple it is to push “updates” to existing editions after the fact. Consider the experience of Philip Howard, who sat down to read a printed edition of War and Peace in 2010. Halfway through reading the brick-size tome, he purchased a 99-cent electronic edition for his Nook e-reader:As I was reading, I came across this sentence: “It was as if a light had been Nookd in a carved and painted lantern …” Thinking this was simply a glitch in the software, I ignored the intrusive word and continued reading. Some pages later I encountered the rogue word again. With my third encounter I decided to retrieve my hard cover book and find the original (well, the translated) text. For the sentence above I discovered this genuine translation: “It was as if a light had been kindled in a carved and painted lantern …”A search of this Nook version of the book confirmed it: Every instance of the word kindle had been replaced by nook, in perhaps an attempt to alter a previously made Kindle version of the book for Nook use. Here are some screenshots I took at the time:It is only a matter of time before the retroactive malleability of these forms of publishing becomes a new area of pressure and regulation for content censorship. If a book contains a passage that someone believes to be defamatory, the aggrieved person can sue over it—and receive monetary damages if they’re right. Rarely is the book’s existence itself called into question, if only because of the difficulty of putting the cat back into the bag after publishing.

      This story of find and replace has chilling future potential. What if a dictatorial government doesn't like your content. It can be all to easy to remove the digital versions and replace them whole hog for "approved" ones.

      Where does democracy live in such a world? Consider similar instances when the Trump administration forced the disappearance of government websites and data.

    1. Finding these kinds of sites can be tough, especially if you’re looking for authentic 1990s sites and not retro callbacks, since Google seems to refuse to show you pages from over 10 years ago.

      I think I've read this bit about Google forgetting from Tim Bray(?) before. Would be useful to have additional back up for it.

      Not being able to rely on Google means that one's on personal repositories of data in their commonplace book becomes far more valuable in the search proposition. This means that Google search is more of a discovery mechanism rather than having the value of the sort of personalized search people may be looking for.

  12. Jun 2021
    1. One thing that should be learned from the bitter lesson is the great power of general purpose methods, of methods that continue to scale with increased computation even as the available computation becomes very great. The two methods that seem to scale arbitrarily in this way are search and learning

      This is a big lesson. As a field, we still have not thoroughly learned it, as we are continuing to make the same kind of mistakes. To see this, and to effectively resist it, we have to understand the appeal of these mistakes. We have to learn the bitter lesson that building in how we think we think does not work in the long run. The bitter lesson is based on the historical observations that 1) AI researchers have often tried to build knowledge into their agents, 2) this always helps in the short term, and is personally satisfying to the researcher, but 3) in the long run it plateaus and even inhibits further progress, and 4) breakthrough progress eventually arrives by an opposing approach based on scaling computation by search and learning. The eventual success is tinged with bitterness, and often incompletely digested, because it is success over a favored, human-centric approach.

  13. May 2021
    1. A relatively comprehensive view of Wouter Groeneveld's commonplacing workflow. There are a few bits missing here and there, but he's got most of the bigger basics down that a majority of people seem to have found and discovered.

      He's got a strong concept of indexing, search, and even some review, which many miss. There's some organic work toward combinatorial thought, but only via the search piece.

      I should make a list of the important pieces for more advanced versions to have. I've yet to see any articles or work on this.

    1. So the proper unit for this kind of exploratory, semantic search is not the file, but rather something else, something I don't quite have a word for: a chunk or cluster of text, something close to those little quotes that I've assembled in DevonThink. If I have an eBook of Manual DeLanda's on my hard drive, and I search for "urban ecosystem" I don't want the software to tell me that an entire book is related to my query. I want the software to tell me that these five separate paragraphs from this book are relevant. Until the tools can break out those smaller units on their own, I'll still be assembling my research library by hand in DevonThink.

      Search on documents returning something in the neighborhood of 500 words or so seems to be the right amount of information. One wants a few paragraphs related to an idea and not an entire book which takes longer to scan.

      Google search does this type of search and it's also what Google Books attempts to do as well when searching specifically there.

    1. An interesting take from a significant modern researcher/writer about commonplaces in the digital era. He's particularly enamoured of the fact that Evernote dovetails with Google searches to show details from his own notebooks which he's saved in the past.

      Search in commonplace books is definitely a must have feature.

    1. I cannot speak for the editor but I don’t see why Searchmysite would not also accept and crawl static HTML websites (ie. Retro or vintage HTML sites) so long as the site has some value and content that it can index, but they might not.

      They certainly could. I've seen the author haunting the IndieWeb chat in the past and they've mentioned that crawling and saving data can tend to be a bit on the expensive side, so they're trying to do more targeted search/save when they can. Perhaps as the project matures, it will add these sorts of functionalities.

  14. Apr 2021
    1. I'm posting this question to help other users, it was not easy to find a useful answer on UNIX SE until I started to type this question.
    1. Exercise can tackle symptoms of schizophrenia

      Not only am I unsurprised by this, but I'd be surprised if it were otherwise. The logic is that schizophrenia is a sleep disorder, and exercise enhances sleep. Additionally, lack of movement is one of the negative symptoms of schizophrenia. Therefore, this poverty of movement may play a role in the pathogenesis of schizophrenia symptoms.

      I need to start a google search document with predictions prior to actually searching. It will slow down my research speed, but it is necessary in order to provide unbiased data on my intuitive understanding of diseases. It seems like the majority of my strong intuitions are true. Edit: I'll just record the search phrase in my hypothes.is notes. This one was "exercise schizophrenia"

  15. Mar 2021
    1. Furthermore, web annotation also affords curation, creating a static but unstable record of this emergent and dynamic performance, accenting via hypertext particular ideas and moments from a malleable document.

      One of the pieces missing from Hypothes.is is the curateable notebook which more easily allows one to create new content from one's annotations.

      Search is certain there, but being able to move the pieces about and re-synthesize them into new emergent pieces is the second necessary step.

    1. I searched for a replacement, but the list of plug-ins had 5000 items and the search function couldn't find anything of the same kind...
    1. [...document.querySelectorAll("*")].filter(e => e.childNodes && [...e.childNodes].find(n => n.nodeValue?.match("❤")))
    2. function contains(selector, text) { var elements = document.querySelectorAll(selector); return [].filter.call(elements, function(element){ return RegExp(text).test(element.textContent); }); }
    1. Fexeel ba kër gi bañ ñàkk alkol.

      Veille à ce qu'il ne manque pas d'alcool à la maison.

      fexe+el (fexe) v. -- search/seek by all means.

      ba -- the (?).

      kër gi -- house; family.

      gi -- the (indicates nearness).

      bañ v. -- refuse, resist, refuse to; to hate; verb marking the negation in subordinate clauses.

      ñàkk v. / ñàkk bi -- vaccinate / vaccine (not sure exactly how this fits in the sentence if it's even the right translation -- perhaps it has to do with surgical alcohol rather than drinking alcohol).

      alkol ji -- (French) surgical alcohol. (I'm certain this is also used for the type of alcohol you drink -- but sangara is probably the most used term).

      https://www.youtube.com/watch?v=SsUjvAItysA

    1. So when I’m searching for information in this space, I’m much less interested in asking “what is this thing?” than I am in asking “what do the people who know a lot about this thing think about it?” I want to read what Vitalik Buterin has recently proposed regarding Ethereum scalability, not rote definitions of Layer 2 scaling solutions. Google is extraordinarily good at answering the “what is this thing?” question. It’s less good at answering the “what do the people who know about the thing think about it?” question. Why? 

      According to Devin Google is good at answering a question such as "what is this thing?", but not good at answering a questions "what do people who know a lot about this thing say about it?"

      This reminds me of social search

    1. a data donation platform that allows users of browsers to donate data on their usage of specific services (eg Youtube, or Facebook) to a platform.

      This seems like a really promising pattern for many data-driven problems. Browsers can support opt-in donation to contribute their data to improve Web search, social media, recommendations, lots of services that implicitly require lots of operational data.

  16. Feb 2021
    1. Along the right side of the page, we have short snippets of text written by five advertisers, mostly journalism schools as it happens, though they are in a silent competition with other snippets of text created by other advertisers bidding to be on this page.

      Reframing:

      SEO is really just various commonplace books competing to be the most important zettels in the world for particular taxonomies.

    2. What you see on this page is, in a very real sense, textual play: the recombining of words into new forms and associations that their original creators never dreamed of.

      What if we add in a dose of Llull's combinatorial thought to the idea of a search engine? What if the search engine can remember the top 50 categories in my personal commonplace book and show overlapping searches of those terms? What if it's even more combinatorial and randomly chooses overlaps from any words in my commonplace? Is that more or less valuable as an idea generator?

      Is it more fruitful to randomly work on various entries every day in an effort to tie them into our other thoughts?

    3. What I want to suggest to you is that, in some improbable way, this page is as much of an heir to the structure of a commonplace book as the most avant-garde textual collage. Who is the “author” of this page? There are, in all likelihood, thousands of them. It has been constructed, algorithmically, by remixing small snippets of text from diverse sources, with diverse goals, and transformed into something categorically different and genuinely valuable. In the center column, we have short snippets of text written by ten individuals or groups, though of course, Google reports that it has 32 million more snippets to survey if we want to keep clicking. The selection of these initial ten links is itself dependant on millions of other snippets of text that link to these and other journalism-related pages on the Web.

      Google search is just an algorithmic search version of John Locke's commonplace book index iterated across millions of individual commonplace books.

    4. In a certain sense, this is a search algorithm, a defined series of steps that allows the user to index the text in a way that makes it easier to query.

      Indices are simply a physical manifestation of metadata upon which we built a rudimentary search algorithm.

  17. Jan 2021
    1. hundreds of these free little apps

      Whoa! Super organized site with lots of ways to find these little apps.

  18. Dec 2020
  19. Nov 2020
    1. At the same time, use of the web is now ubiquitous,and ”Google” is a verb. With the advent of search en-gines, users have learned to find data by describing whatthey want (e.g., various characteristics of a photo) insteadof where it lives (i.e., the full pathname of the photo inthe filesystem). This can be seen in the popularity ofsearch as a modern desktop paradigm in such products asWindows Desktop Search (WDS) [26]; MacOS X Spot-light [21], which fully integrates search with the Mac-intosh journaled HFS+ file system [7]; and the variousdesktop search engines for Linux [4, 27]. Indeed, MacOS X in particular goes one step further and exports APIsto developers allowing applications to directly access themeta-data store and content index.

      With the advent of search engines, search as a paradigm for retrieving files has become ubiquitous.

  20. Oct 2020
    1. Most people seem to follow one of two strategies - and these strategies come under the umbrella of tree-traversal algorithms in computer science.

      Deciding whether you want to go deep into one topic, or explore more topics, can be seen as a choice between two types of tree-traversal algorithms: depth-first and breadth-first.

      This also reminds me of the Explore-Exploit problem in machine learning, which I believe is related to the Multi-Armed Bandit Problem.

    1. I don’t want to build yet another Podcast player app. I don’t want to trap listeners to Listen Notes. You come to Listen Notes and find the Podcasts or Podcast Episodes that you want to listen, then you leave Listen Notes to use your favorite Podcast player app to listen. Under this principle, Listen Notes shows RSS & brings traffic back to official websites of Podcasts. Many Podcast-related sites don’t show RSS, because they want to build a walled garden to make visitors stay there as long as possible.
    1.  I often find myself saying, “I saved something about that…somewhere…<img class="emoji" role="img" draggable="false" src="https://s.w.org/images/core/emoji/12.0.0-1/svg/1f914.svg" alt="🤔">“ Here’s what that looks like:  <img width="777" height="468" src="http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2.jpg" class="attachment-large size-large" alt="Roam Digital Garden Second Brain" srcset="http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2.jpg 777w, http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2-300x181.jpg 300w, http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2-768x463.jpg 768w" sizes="(max-width: 777px) 100vw, 777px">

      I have had this issue with my data spread over multiple sites and services and not easily searchable.

      Unless one can buy into a single system, it may be an eternal problem...

    1. Mastodon deliberately does not support arbitrary search. If someone wants their message to be discovered, they can use a hashtag, which can be browsed. What does arbitrary search accomplish? People and brands search for their own name to self-insert into conversations they were not invited to. What you can do, however, is search messages you posted, received or favourited. That way you can find that one message on the tip of your tongue.
  21. Sep 2020
    1. Leuker, C., Hertwig, R., Gumenik, K., Eggeling, L. M., Hechtlinger, S., Kozyreva, A., Samaan, L., & Fleischhut, N. (2020). Wie informiert sich die Bevölkerung in Deutschland rund um das Coronavirus? Umfrage zu vorherrschenden Themen und Gründen, dem Umgang mit Fehlinformationen, sowie der Risikowahrnehmung und dem Wissen der Bevölkerung rund um das Coronavirus (Version 5, p. 966670) [Application/pdf]. Max-Planck-Institut für Bildungsforschung. https://doi.org/10.17617/2.3247925

    1. Bavadekar, Shailesh, Andrew Dai, John Davis, Damien Desfontaines, Ilya Eckstein, Katie Everett, Alex Fabrikant, et al. ‘Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (Version 1.0)’. ArXiv:2009.01265 [Cs], 2 September 2020. http://arxiv.org/abs/2009.01265.

  22. Aug 2020
    1. Research shows that people are highly likely to revisit information they have viewed in the past and to re-issue queries that they have written in the past (Jones et al., 2002, Milic-Frayling et al., 2004). In one large study, 40% of people's search results clicks were on pages that they had clicked on before over the course of a year, with 71% of these using the identical query string as before (Teevan et al., 2006a). In a survey associated with this study, 17% of interviewees reported “not being able to return to a page I once visited” as one of the “biggest problems in using the web.” Therefore, allowing search over recently viewed information can improve a user's productivity (Dumais et al., 2003). Web browsers, as opposed to search engines, can provide much of this functionality. For example, the Chrome Web browser supports information revisiting by showing a grid of thumbnail images representing a user's most frequently visited web pages, and the drop-down menu from the many browser Web address bars shows recently visited pages. Search engines themselves can provide query history, as well as history of previously selected pages if the user agrees to having that information recorded. The PubMed bioscience journal service shows recently issued queries and visited documents in a simple history display (see Figure 1.6). Similarly, many shopping Web site show recently viewed items in a graphical form. Thumbnail images have also been experimented with in search results listing, both for reminding searchers of previously visited pages and for suggesting information about the hit, such as its genre.
    1. Look, nobody was flipping through pages on your blog to find anything anyway, so it's fine.

      so true...

      but I do search on my own site frequently...

    1. What is Search, Anyway?Steam Search does more than just looking up games; it's a powerful tool that drives many of our discoverability features, including Top Sellers and Specials pages. Today's features are available anywhere the Search tool is used across the store.
  23. Jul 2020
  24. Jun 2020
    1. ow many hours did you spend customizing, configuring, or otherwise optimizing your site search in the past month?
  25. May 2020
    1. Cloze deletion is, of course, just a fancy way of saying fill in the blank. This might sound trivial, but the simple act forces you to consider the surrounding context and search your mind for an answer. This, in turn, is scientifically proven to form stronger memories enabling you to remember profoundly more of what you've read.
  26. Apr 2020
  27. Mar 2020
    1. the link operator was only designed to return a small sampling of backlinks to prevent SEOs from reverse engineering another site’s rankings
    2. To find a sampling of links to any site, you can perform a Google search using the link: operator. For instance, [link:www.google.com] will list a selection of the web pages that have links pointing to the Google home page
    1. Yes, it’s been deprecated. Why? Because too few people were using it to make it worth the time, money, and energy to maintain. In truth, although I sometimes disagree with the operator changes, I happen to agree with this one. Maintaining ALL of the synonyms takes real time and costs us real money. Supporting this operator also increases the complexity of the code base. By dropping support for it we can free up a bunch of resources that can be used for other, more globally powerful changes.
    1. The <meta name="robots" content="noindex" /> tag or directive applies to search engine crawlers. To block non-search crawlers, such as AdsBot-Google, you might need to add directives targeted to the specific crawler
    1. For the noindex directive to be effective, the page must not be blocked by a robots.txt file. If the page is blocked by a robots.txt file, the crawler will never see the noindex directive, and the page can still appear in search results, for example if other pages link to it.
  28. Jan 2020
    1. Now, Google has to change its practices and prompt users to choose their own default search engine when setting up a European Android device that has the Google Search app built in. Not all countries will have the same options, however, as the choices included in Google’s new prompt all went to the highest bidders.As it turns out, DuckDuckGo must have bid more aggressively than other Google competitors, as it’s being offered as a choice across all countries in the EU.
  29. Dec 2019
    1. Note, too, the implications extending from Bush's mention of one user duplicating a trail (a portion of his structure) and giving it to a friend who can put it into his Memex and integrate it into his own trail (structure).
  30. Aug 2019
  31. Jul 2019
    1. I do not actually know of a real findability index, but tools in the field of information retrieval could be applied to develop one. One of the unsolved problems in the field is how to help the searcher to determine if the information simply is not available.
    2. Although some have written about information overload, data smog, and the like, my view has always been the more information online, the better, so long as good search tools are available. Sometimes this information is found by directed search using a web search engine, sometimes by serendipty by following links, and sometimes by asking hundreds of people in our social network or hundreds of thousands of people on a question answering website such as Answers.com, Quora, or Yahoo Answer
    1. Compared with neural networks configured by a pure grid search,we find that random search over the same domain is able to find models that are as good or betterwithin a small fraction of the computation time.
  32. citeseerx.ist.psu.edu citeseerx.ist.psu.edu
    1. faceted sea

      it's a technique which involves augmenting traditional search techniques with a faceted navigation system, allowing users to narrow down search results by applying multiple filters based on faceted classification of the items

  33. Jun 2019
  34. mitpressonpubpub.mitpress.mit.edu mitpressonpubpub.mitpress.mit.edu
    1. This is why annotation matters.

      Google has accelerated this by using search to better link pieces of knowledge in the modern world, but scholars have been linking thoughts manually for centuries.

  35. May 2019
    1. mightthosewhoareintheminorityeverbeabletoinfluenceorcontrolthewaytheyarerepresentedinasearchengine?

      If the majority rules search engine results, the majority could also rule over the content. if the majority of the online population are sexist, then the majority of the results when searching even a simple/general term such as "cars" could be more searched by men, but also have more content created by men, especially men who are "middle class" or higher who could afford to purchase a car and own a technology capable of using a search engine.

      It makes me think... What searches would come from people who were struggling financially and did not usually have access to a computer? What would they search first? And how would that impact the popular searches and content?

      The majority of Canada's populace are not low income, but what would happen to results in other countries (or even ours)where low income is the majority, and they were all given access to search technologies?

  36. Feb 2019
  37. Oct 2018
  38. Sep 2018
    1. Then, venturing further into the store: this is what we happen to have.

      I'm also reminded here of the idea of serendipity. Perhaps I go into a library looking for a specific topic and browse to that section via Dewey decimal. What about the serendipity of finding something interesting in that same section (or even nearby sections) that I wasn't specifically looking for?

      Google's search ranking rarely if ever unearths this sort of serendipitous material.

  39. Jul 2018
    1. A good resource to keep in mind for future lessons:) Found this website using a custom google search engine

    1. Using information technology has become an important skill for students and employees. As a teacher wishing to use the Internet your options are typically to either provide students with specific links or have them “Google” to find information on the Internet. Using Google can yield interesting and unexpected results. Creating a list of specific links is time consuming and does not teach the students how to search the web.

      This is a good point- typically teachers either give students a list of links or let them use google free reign. Creating a custom google search engine for the class may help

  40. Jun 2018
    1. (Remember when every news site published the piece, “What Time Is the Super Bowl?”)

      This is a great instance for Google's box that simply provides the factual answer instead of requiring a click through.

    1. "Information retrieval is not about finding how much tannin there is in an apple," he declared in his San Francisco office. "It's about letting everyone publish." With that, he was off on a long rant about how organizing the Web matters, because, as Architext's Spencer had told me, "it's about people finding people, not people finding information." Indexing the Web allows the 40 people interested in Bulgarian folksinging to find each other, it allows fans of long-forgotten TV shows to get together and reminisce. It creates communities.
  41. May 2018
    1. Ironically, DWYL reinforces exploitation even within the so-called lovable professions where off-the-clock, underpaid, or unpaid labor is the new norm:

      Doing what you love, isolates and degrades other workforces and elevates others and more so, the ones of higher economic class. One should be paid fair dues as per their work and have good working conditions. She wants people to realize that they deserve goods jobs and they should never settle for less in the name of doing what they love.

    2. limits

      The article argues different theories regarding doing what is right, with the bigger question being is it wrong for people to strive to be able to do what they love. But, at the same time, what is the limit of people seeking what they love without overstepping the boundaries?

  42. Apr 2018
    1. It’s striking that we likely have author information provided as metadata on the majority of articles published today, but almost none of our reading tools expose this information in useful ways, or let us search or explore using the metadata.

      This is a really really important point.

  43. Mar 2018
  44. Feb 2018
  45. Dec 2017
    1. Still at a starting point, we have neither ‘advanced’ nor remained unmoved.

      The cover of Ken Macrorie's textbook, The I-Search Paper, has a moebius strip on it, reflecting this sentiment nicely.<br>

      But the conceit of the moebius strip extends from the back cover as well

      So, you start reading this book on the back cover, nice twist, just like in a moebius strip. You work through the text by doing. Then when you are done, you get to the end and out the back of the book you go onto the back cover, to enter again. Escher would approve. That is what I see happening here. In the end this process serves the product, the text that is produced for anyone concerned.

  46. Nov 2017
    1. having students do a basic Google image search for terms like “doctor” “teacher” “baby”

      It may sound obvious but it actually works. Just did it with each of these three words (on DuckDuckGo) and the results, though unsurprising, bring home the point. Tried switching on the Canadian filter, to check if their might be a difference, and it mostly reorders the results, for some reason. Also tried “student” and “musician” which provide an interesting contrast. Doing this exercise in class, would probably start by asking learners to write down what they expect to get. (Might even do it in my applied anthro class, tomorrow.)

    2. In this particular case, Google worked as a kind of amplifier of distortion.
    1. As an index, people have different expectations on search result neutrality. Some want Google Search to be entirely neutral, some demand immediate action to remove some results. The European Union has both demanded GOOG to comply with removal requests, and fined GOOG for not being neutral in shopping queries. It is not beneficial for GOOG to assume the role of an impartial arbiter of content, since it’s not supporting their business model. Quite the contrary, they are under public scrutiny from multiple governments, potentially risking their reputation.