416 Matching Annotations
  1. Jan 2022
    1. But Google also uses optical character recognition to produce a second version, for its search engine to use, and this double process has some quirks. In a scriptorium lit by the sun, a scribe could mistakenly transcribe a “u” as an “n,” or vice versa. Curiously, the computer makes the same mistake. If you enter qualitas—an important term in medieval philosophy—into Google Book Search, you’ll find almost two thousand appearances. But if you enter “qnalitas” you’ll be rewarded with more than five hundred references that you wouldn’t necessarily have found.

      I wonder how much Captcha technology may have helped to remedy this in the intervening years?

    1. Serious reading will require just as much effort as it has always required.

      Reading is hard to disrupt.

      Speeding up and dramatically improving the reading process is incredibly difficult. No one has yet made really huge strides in this space. Google has made it imminently more accessible to the masses, but it still requires a lot of physical work and processing on our part.

    1. The Notetab-Zettelkasten has several major advantages over the paper-implementation: 1. It is much more difficult to misplace slips 1. It has a powerful search function

      Most digital note taking systems have two major advantages of paper versions:

      • It's harder to misplace material unless one's system has major flaws or one accidentally deletes content
      • Digital search is far more powerful and efficient than manual search
  2. Dec 2021
  3. Nov 2021
    1. I also did a bit of web and JSTOR research, and started a new Zotero folder called World History Comparison. Research Rabbit found a bunch of similar titles, but it will be a while before I can get to many of them. I DID, however, ask some people and groups such as the OE Global community on Twitter, and I want to extend that request to anyone who watches this video. I know a number of my subs and viewers are in India and I've noticed on Twitter and on Abhijit Chavda's channel that there's quite a bit of controversy about the way Indian History is taught to Indian students.

      Methods for attacking a research problem about history used here:

      • Web research
      • Journal database research
      • Zotero reference manager stub
      • Research Rabbit (AI search)
      • Reach out on various social media channels

      Not mentioned, but perhaps useful:

      • Standard library search (WorldCat)
      • Internet Archive search (scanned historical textbooks)
      • Off-label and dark web services (Library Genesis, Pirate Bay, etc.)
      • Open access and OER sources (this will probably find newer perspectives and newer texts which sometimes have philosophical outlines of what they're trying to change for the future versus the pedagogies of the past)
      • Current curricula and recommended textbooks at major universities on particular books and potential comparisons to those of the past (perhaps via Internet Archive).
    1. A fluorescence of note taking tools

      What is missing in this train of thought is search. The real challenge is recalling of information easily, whether that is traditional search or something more AI-ish that can uncover connections between items that I don't see myself. What I want is a tool that can search across all my repositories, and that requires either APIs for communication or standard data storage formats. I prefer APis.

    1. But dig into the story of anyone who has been a genuine victim of modern mob justice and you will often find not an obvious argument between “woke” and “anti-woke” perspectives but rather incidents that are interpreted, described, or remembered by different people in different ways, even leaving aside whatever political or intellectual issue might be at stake.

      Cancel culture and modern mob justice are possible as the result of volumes of more detail and data as well as large doses of context collapse.

      In some cases, it's probably justified to help level the playing field for those in power who are practicing hypocrisy, but in others, it's simply a lack of context by broader society who have kneejerk reactions which have the ability to be "remembered" by broader society with search engines.

      How might Google allow the right to forget to serve as a means of restorative justice?

    1. wn written cultures material is typically sorted alphabeticallySor by some other method of linguistic ordering such as the number ofstrokes in qhinese charactersTW or systematicallyW according to various sysXtems that strive to map or hierarchize the relations between the items storedSincluding those of uoogle or ffiahooTW or miscellaneouslyY

      What about the emergence of non-hierarchal methods? (Can these logically be sorted somehow without this structure?)

      With digital commonplacing methods, I find that I can sort and search for things temporally by date and time as well as by tag/heading.

      Cross reference:

  4. Oct 2021
  5. Sep 2021
    1. paradigmatic

      typical answer to something

    2. elucidatory

      give a clarifying expression

    3. Pragmatic

      to deal with something in a practical non-theoretical way.

    4. elucidation.

      another word for clarification

    1. The question is similar but its in a Rails context. The solutions would answer my question, but I'm almost certain that he could probably leverage Arel to solve his problem. The question I posted was designed purely as a Ruby question so that it was easier to search for. You might want to suggest an edit of the title of his question because it didn't show up when I searched for a solution to my problem.
  6. Aug 2021
  7. Jul 2021
    1. The idea behind the reference interview is for the librarian to respond to questions by asking questions in order to supply the best possible resource. Frequently, this is really useful because people will ask for the kind of information they think the librarian can find, rather than what they actually want.

      This is closely related to solving a particular well-defined problem rather than solving a more general problem.

    1. Ebooks don’t have those limitations, both because of how readily new editions can be created and how simple it is to push “updates” to existing editions after the fact. Consider the experience of Philip Howard, who sat down to read a printed edition of War and Peace in 2010. Halfway through reading the brick-size tome, he purchased a 99-cent electronic edition for his Nook e-reader:As I was reading, I came across this sentence: “It was as if a light had been Nookd in a carved and painted lantern …” Thinking this was simply a glitch in the software, I ignored the intrusive word and continued reading. Some pages later I encountered the rogue word again. With my third encounter I decided to retrieve my hard cover book and find the original (well, the translated) text. For the sentence above I discovered this genuine translation: “It was as if a light had been kindled in a carved and painted lantern …”A search of this Nook version of the book confirmed it: Every instance of the word kindle had been replaced by nook, in perhaps an attempt to alter a previously made Kindle version of the book for Nook use. Here are some screenshots I took at the time:It is only a matter of time before the retroactive malleability of these forms of publishing becomes a new area of pressure and regulation for content censorship. If a book contains a passage that someone believes to be defamatory, the aggrieved person can sue over it—and receive monetary damages if they’re right. Rarely is the book’s existence itself called into question, if only because of the difficulty of putting the cat back into the bag after publishing.

      This story of find and replace has chilling future potential. What if a dictatorial government doesn't like your content. It can be all to easy to remove the digital versions and replace them whole hog for "approved" ones.

      Where does democracy live in such a world? Consider similar instances when the Trump administration forced the disappearance of government websites and data.

    1. Finding these kinds of sites can be tough, especially if you’re looking for authentic 1990s sites and not retro callbacks, since Google seems to refuse to show you pages from over 10 years ago.

      I think I've read this bit about Google forgetting from Tim Bray(?) before. Would be useful to have additional back up for it.

      Not being able to rely on Google means that one's on personal repositories of data in their commonplace book becomes far more valuable in the search proposition. This means that Google search is more of a discovery mechanism rather than having the value of the sort of personalized search people may be looking for.

  8. Jun 2021
    1. One thing that should be learned from the bitter lesson is the great power of general purpose methods, of methods that continue to scale with increased computation even as the available computation becomes very great. The two methods that seem to scale arbitrarily in this way are search and learning

      This is a big lesson. As a field, we still have not thoroughly learned it, as we are continuing to make the same kind of mistakes. To see this, and to effectively resist it, we have to understand the appeal of these mistakes. We have to learn the bitter lesson that building in how we think we think does not work in the long run. The bitter lesson is based on the historical observations that 1) AI researchers have often tried to build knowledge into their agents, 2) this always helps in the short term, and is personally satisfying to the researcher, but 3) in the long run it plateaus and even inhibits further progress, and 4) breakthrough progress eventually arrives by an opposing approach based on scaling computation by search and learning. The eventual success is tinged with bitterness, and often incompletely digested, because it is success over a favored, human-centric approach.

  9. May 2021
    1. A relatively comprehensive view of Wouter Groeneveld's commonplacing workflow. There are a few bits missing here and there, but he's got most of the bigger basics down that a majority of people seem to have found and discovered.

      He's got a strong concept of indexing, search, and even some review, which many miss. There's some organic work toward combinatorial thought, but only via the search piece.

      I should make a list of the important pieces for more advanced versions to have. I've yet to see any articles or work on this.

    1. So the proper unit for this kind of exploratory, semantic search is not the file, but rather something else, something I don't quite have a word for: a chunk or cluster of text, something close to those little quotes that I've assembled in DevonThink. If I have an eBook of Manual DeLanda's on my hard drive, and I search for "urban ecosystem" I don't want the software to tell me that an entire book is related to my query. I want the software to tell me that these five separate paragraphs from this book are relevant. Until the tools can break out those smaller units on their own, I'll still be assembling my research library by hand in DevonThink.

      Search on documents returning something in the neighborhood of 500 words or so seems to be the right amount of information. One wants a few paragraphs related to an idea and not an entire book which takes longer to scan.

      Google search does this type of search and it's also what Google Books attempts to do as well when searching specifically there.

    1. An interesting take from a significant modern researcher/writer about commonplaces in the digital era. He's particularly enamoured of the fact that Evernote dovetails with Google searches to show details from his own notebooks which he's saved in the past.

      Search in commonplace books is definitely a must have feature.

    1. I cannot speak for the editor but I don’t see why Searchmysite would not also accept and crawl static HTML websites (ie. Retro or vintage HTML sites) so long as the site has some value and content that it can index, but they might not.

      They certainly could. I've seen the author haunting the IndieWeb chat in the past and they've mentioned that crawling and saving data can tend to be a bit on the expensive side, so they're trying to do more targeted search/save when they can. Perhaps as the project matures, it will add these sorts of functionalities.

  10. Apr 2021
    1. I'm posting this question to help other users, it was not easy to find a useful answer on UNIX SE until I started to type this question.
    1. Exercise can tackle symptoms of schizophrenia

      Not only am I unsurprised by this, but I'd be surprised if it were otherwise. The logic is that schizophrenia is a sleep disorder, and exercise enhances sleep. Additionally, lack of movement is one of the negative symptoms of schizophrenia. Therefore, this poverty of movement may play a role in the pathogenesis of schizophrenia symptoms.

      I need to start a google search document with predictions prior to actually searching. It will slow down my research speed, but it is necessary in order to provide unbiased data on my intuitive understanding of diseases. It seems like the majority of my strong intuitions are true. Edit: I'll just record the search phrase in my hypothes.is notes. This one was "exercise schizophrenia"

  11. Mar 2021
    1. Furthermore, web annotation also affords curation, creating a static but unstable record of this emergent and dynamic performance, accenting via hypertext particular ideas and moments from a malleable document.

      One of the pieces missing from Hypothes.is is the curateable notebook which more easily allows one to create new content from one's annotations.

      Search is certain there, but being able to move the pieces about and re-synthesize them into new emergent pieces is the second necessary step.

    1. I searched for a replacement, but the list of plug-ins had 5000 items and the search function couldn't find anything of the same kind...
    1. [...document.querySelectorAll("*")].filter(e => e.childNodes && [...e.childNodes].find(n => n.nodeValue?.match("❤")))
    2. function contains(selector, text) { var elements = document.querySelectorAll(selector); return [].filter.call(elements, function(element){ return RegExp(text).test(element.textContent); }); }
    1. Fexeel ba kër gi bañ ñàkk alkol.

      Veille à ce qu'il ne manque pas d'alcool à la maison.

      fexe+el (fexe) v. -- search/seek by all means.

      ba -- the (?).

      kër gi -- house; family.

      gi -- the (indicates nearness).

      bañ v. -- refuse, resist, refuse to; to hate; verb marking the negation in subordinate clauses.

      ñàkk v. / ñàkk bi -- vaccinate / vaccine (not sure exactly how this fits in the sentence if it's even the right translation -- perhaps it has to do with surgical alcohol rather than drinking alcohol).

      alkol ji -- (French) surgical alcohol. (I'm certain this is also used for the type of alcohol you drink -- but sangara is probably the most used term).

      https://www.youtube.com/watch?v=SsUjvAItysA

    1. So when I’m searching for information in this space, I’m much less interested in asking “what is this thing?” than I am in asking “what do the people who know a lot about this thing think about it?” I want to read what Vitalik Buterin has recently proposed regarding Ethereum scalability, not rote definitions of Layer 2 scaling solutions. Google is extraordinarily good at answering the “what is this thing?” question. It’s less good at answering the “what do the people who know about the thing think about it?” question. Why? 

      According to Devin Google is good at answering a question such as "what is this thing?", but not good at answering a questions "what do people who know a lot about this thing say about it?"

      This reminds me of social search

    1. a data donation platform that allows users of browsers to donate data on their usage of specific services (eg Youtube, or Facebook) to a platform.

      This seems like a really promising pattern for many data-driven problems. Browsers can support opt-in donation to contribute their data to improve Web search, social media, recommendations, lots of services that implicitly require lots of operational data.

  12. Feb 2021
    1. Along the right side of the page, we have short snippets of text written by five advertisers, mostly journalism schools as it happens, though they are in a silent competition with other snippets of text created by other advertisers bidding to be on this page.

      Reframing:

      SEO is really just various commonplace books competing to be the most important zettels in the world for particular taxonomies.

    2. What you see on this page is, in a very real sense, textual play: the recombining of words into new forms and associations that their original creators never dreamed of.

      What if we add in a dose of Llull's combinatorial thought to the idea of a search engine? What if the search engine can remember the top 50 categories in my personal commonplace book and show overlapping searches of those terms? What if it's even more combinatorial and randomly chooses overlaps from any words in my commonplace? Is that more or less valuable as an idea generator?

      Is it more fruitful to randomly work on various entries every day in an effort to tie them into our other thoughts?

    3. What I want to suggest to you is that, in some improbable way, this page is as much of an heir to the structure of a commonplace book as the most avant-garde textual collage. Who is the “author” of this page? There are, in all likelihood, thousands of them. It has been constructed, algorithmically, by remixing small snippets of text from diverse sources, with diverse goals, and transformed into something categorically different and genuinely valuable. In the center column, we have short snippets of text written by ten individuals or groups, though of course, Google reports that it has 32 million more snippets to survey if we want to keep clicking. The selection of these initial ten links is itself dependant on millions of other snippets of text that link to these and other journalism-related pages on the Web.

      Google search is just an algorithmic search version of John Locke's commonplace book index iterated across millions of individual commonplace books.

    4. In a certain sense, this is a search algorithm, a defined series of steps that allows the user to index the text in a way that makes it easier to query.

      Indices are simply a physical manifestation of metadata upon which we built a rudimentary search algorithm.

  13. Jan 2021
    1. hundreds of these free little apps

      Whoa! Super organized site with lots of ways to find these little apps.

  14. Dec 2020
  15. Nov 2020
    1. At the same time, use of the web is now ubiquitous,and ”Google” is a verb. With the advent of search en-gines, users have learned to find data by describing whatthey want (e.g., various characteristics of a photo) insteadof where it lives (i.e., the full pathname of the photo inthe filesystem). This can be seen in the popularity ofsearch as a modern desktop paradigm in such products asWindows Desktop Search (WDS) [26]; MacOS X Spot-light [21], which fully integrates search with the Mac-intosh journaled HFS+ file system [7]; and the variousdesktop search engines for Linux [4, 27]. Indeed, MacOS X in particular goes one step further and exports APIsto developers allowing applications to directly access themeta-data store and content index.

      With the advent of search engines, search as a paradigm for retrieving files has become ubiquitous.

  16. Oct 2020
    1. Most people seem to follow one of two strategies - and these strategies come under the umbrella of tree-traversal algorithms in computer science.

      Deciding whether you want to go deep into one topic, or explore more topics, can be seen as a choice between two types of tree-traversal algorithms: depth-first and breadth-first.

      This also reminds me of the Explore-Exploit problem in machine learning, which I believe is related to the Multi-Armed Bandit Problem.

    1. I don’t want to build yet another Podcast player app. I don’t want to trap listeners to Listen Notes. You come to Listen Notes and find the Podcasts or Podcast Episodes that you want to listen, then you leave Listen Notes to use your favorite Podcast player app to listen. Under this principle, Listen Notes shows RSS & brings traffic back to official websites of Podcasts. Many Podcast-related sites don’t show RSS, because they want to build a walled garden to make visitors stay there as long as possible.
    1.  I often find myself saying, “I saved something about that…somewhere…<img class="emoji" role="img" draggable="false" src="https://s.w.org/images/core/emoji/12.0.0-1/svg/1f914.svg" alt="🤔">“ Here’s what that looks like:  <img width="777" height="468" src="http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2.jpg" class="attachment-large size-large" alt="Roam Digital Garden Second Brain" srcset="http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2.jpg 777w, http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2-300x181.jpg 300w, http://bjosephburch.com/wp-content/uploads/2020/04/secondbrain-2-768x463.jpg 768w" sizes="(max-width: 777px) 100vw, 777px">

      I have had this issue with my data spread over multiple sites and services and not easily searchable.

      Unless one can buy into a single system, it may be an eternal problem...

    1. Mastodon deliberately does not support arbitrary search. If someone wants their message to be discovered, they can use a hashtag, which can be browsed. What does arbitrary search accomplish? People and brands search for their own name to self-insert into conversations they were not invited to. What you can do, however, is search messages you posted, received or favourited. That way you can find that one message on the tip of your tongue.
  17. Sep 2020
    1. Leuker, C., Hertwig, R., Gumenik, K., Eggeling, L. M., Hechtlinger, S., Kozyreva, A., Samaan, L., & Fleischhut, N. (2020). Wie informiert sich die Bevölkerung in Deutschland rund um das Coronavirus? Umfrage zu vorherrschenden Themen und Gründen, dem Umgang mit Fehlinformationen, sowie der Risikowahrnehmung und dem Wissen der Bevölkerung rund um das Coronavirus (Version 5, p. 966670) [Application/pdf]. Max-Planck-Institut für Bildungsforschung. https://doi.org/10.17617/2.3247925

    1. Bavadekar, Shailesh, Andrew Dai, John Davis, Damien Desfontaines, Ilya Eckstein, Katie Everett, Alex Fabrikant, et al. ‘Google COVID-19 Search Trends Symptoms Dataset: Anonymization Process Description (Version 1.0)’. ArXiv:2009.01265 [Cs], 2 September 2020. http://arxiv.org/abs/2009.01265.

  18. Aug 2020
    1. Research shows that people are highly likely to revisit information they have viewed in the past and to re-issue queries that they have written in the past (Jones et al., 2002, Milic-Frayling et al., 2004). In one large study, 40% of people's search results clicks were on pages that they had clicked on before over the course of a year, with 71% of these using the identical query string as before (Teevan et al., 2006a). In a survey associated with this study, 17% of interviewees reported “not being able to return to a page I once visited” as one of the “biggest problems in using the web.” Therefore, allowing search over recently viewed information can improve a user's productivity (Dumais et al., 2003). Web browsers, as opposed to search engines, can provide much of this functionality. For example, the Chrome Web browser supports information revisiting by showing a grid of thumbnail images representing a user's most frequently visited web pages, and the drop-down menu from the many browser Web address bars shows recently visited pages. Search engines themselves can provide query history, as well as history of previously selected pages if the user agrees to having that information recorded. The PubMed bioscience journal service shows recently issued queries and visited documents in a simple history display (see Figure 1.6). Similarly, many shopping Web site show recently viewed items in a graphical form. Thumbnail images have also been experimented with in search results listing, both for reminding searchers of previously visited pages and for suggesting information about the hit, such as its genre.
    1. Look, nobody was flipping through pages on your blog to find anything anyway, so it's fine.

      so true...

      but I do search on my own site frequently...

    1. What is Search, Anyway?Steam Search does more than just looking up games; it's a powerful tool that drives many of our discoverability features, including Top Sellers and Specials pages. Today's features are available anywhere the Search tool is used across the store.
  19. Jul 2020
  20. Jun 2020
    1. ow many hours did you spend customizing, configuring, or otherwise optimizing your site search in the past month?
  21. May 2020
    1. Cloze deletion is, of course, just a fancy way of saying fill in the blank. This might sound trivial, but the simple act forces you to consider the surrounding context and search your mind for an answer. This, in turn, is scientifically proven to form stronger memories enabling you to remember profoundly more of what you've read.
  22. Apr 2020
  23. Mar 2020
    1. the link operator was only designed to return a small sampling of backlinks to prevent SEOs from reverse engineering another site’s rankings
    2. To find a sampling of links to any site, you can perform a Google search using the link: operator. For instance, [link:www.google.com] will list a selection of the web pages that have links pointing to the Google home page
    1. Yes, it’s been deprecated. Why? Because too few people were using it to make it worth the time, money, and energy to maintain. In truth, although I sometimes disagree with the operator changes, I happen to agree with this one. Maintaining ALL of the synonyms takes real time and costs us real money. Supporting this operator also increases the complexity of the code base. By dropping support for it we can free up a bunch of resources that can be used for other, more globally powerful changes.
    1. The <meta name="robots" content="noindex" /> tag or directive applies to search engine crawlers. To block non-search crawlers, such as AdsBot-Google, you might need to add directives targeted to the specific crawler
    1. For the noindex directive to be effective, the page must not be blocked by a robots.txt file. If the page is blocked by a robots.txt file, the crawler will never see the noindex directive, and the page can still appear in search results, for example if other pages link to it.
  24. Jan 2020
    1. Now, Google has to change its practices and prompt users to choose their own default search engine when setting up a European Android device that has the Google Search app built in. Not all countries will have the same options, however, as the choices included in Google’s new prompt all went to the highest bidders.As it turns out, DuckDuckGo must have bid more aggressively than other Google competitors, as it’s being offered as a choice across all countries in the EU.
  25. Dec 2019
    1. Note, too, the implications extending from Bush's mention of one user duplicating a trail (a portion of his structure) and giving it to a friend who can put it into his Memex and integrate it into his own trail (structure).
  26. Aug 2019
  27. Jul 2019
    1. I do not actually know of a real findability index, but tools in the field of information retrieval could be applied to develop one. One of the unsolved problems in the field is how to help the searcher to determine if the information simply is not available.
    2. Although some have written about information overload, data smog, and the like, my view has always been the more information online, the better, so long as good search tools are available. Sometimes this information is found by directed search using a web search engine, sometimes by serendipty by following links, and sometimes by asking hundreds of people in our social network or hundreds of thousands of people on a question answering website such as Answers.com, Quora, or Yahoo Answer
    1. Compared with neural networks configured by a pure grid search,we find that random search over the same domain is able to find models that are as good or betterwithin a small fraction of the computation time.
  28. citeseerx.ist.psu.edu citeseerx.ist.psu.edu
    1. faceted sea

      it's a technique which involves augmenting traditional search techniques with a faceted navigation system, allowing users to narrow down search results by applying multiple filters based on faceted classification of the items

  29. Jun 2019
  30. mitpressonpubpub.mitpress.mit.edu mitpressonpubpub.mitpress.mit.edu
    1. This is why annotation matters.

      Google has accelerated this by using search to better link pieces of knowledge in the modern world, but scholars have been linking thoughts manually for centuries.

  31. May 2019
    1. mightthosewhoareintheminorityeverbeabletoinfluenceorcontrolthewaytheyarerepresentedinasearchengine?

      If the majority rules search engine results, the majority could also rule over the content. if the majority of the online population are sexist, then the majority of the results when searching even a simple/general term such as "cars" could be more searched by men, but also have more content created by men, especially men who are "middle class" or higher who could afford to purchase a car and own a technology capable of using a search engine.

      It makes me think... What searches would come from people who were struggling financially and did not usually have access to a computer? What would they search first? And how would that impact the popular searches and content?

      The majority of Canada's populace are not low income, but what would happen to results in other countries (or even ours)where low income is the majority, and they were all given access to search technologies?

  32. Feb 2019
  33. Oct 2018
  34. Sep 2018
    1. Then, venturing further into the store: this is what we happen to have.

      I'm also reminded here of the idea of serendipity. Perhaps I go into a library looking for a specific topic and browse to that section via Dewey decimal. What about the serendipity of finding something interesting in that same section (or even nearby sections) that I wasn't specifically looking for?

      Google's search ranking rarely if ever unearths this sort of serendipitous material.

  35. Jul 2018
    1. A good resource to keep in mind for future lessons:) Found this website using a custom google search engine

    1. Using information technology has become an important skill for students and employees. As a teacher wishing to use the Internet your options are typically to either provide students with specific links or have them “Google” to find information on the Internet. Using Google can yield interesting and unexpected results. Creating a list of specific links is time consuming and does not teach the students how to search the web.

      This is a good point- typically teachers either give students a list of links or let them use google free reign. Creating a custom google search engine for the class may help

  36. Jun 2018
    1. (Remember when every news site published the piece, “What Time Is the Super Bowl?”)

      This is a great instance for Google's box that simply provides the factual answer instead of requiring a click through.

    1. "Information retrieval is not about finding how much tannin there is in an apple," he declared in his San Francisco office. "It's about letting everyone publish." With that, he was off on a long rant about how organizing the Web matters, because, as Architext's Spencer had told me, "it's about people finding people, not people finding information." Indexing the Web allows the 40 people interested in Bulgarian folksinging to find each other, it allows fans of long-forgotten TV shows to get together and reminisce. It creates communities.
  37. May 2018
    1. Ironically, DWYL reinforces exploitation even within the so-called lovable professions where off-the-clock, underpaid, or unpaid labor is the new norm:

      Doing what you love, isolates and degrades other workforces and elevates others and more so, the ones of higher economic class. One should be paid fair dues as per their work and have good working conditions. She wants people to realize that they deserve goods jobs and they should never settle for less in the name of doing what they love.

    2. limits

      The article argues different theories regarding doing what is right, with the bigger question being is it wrong for people to strive to be able to do what they love. But, at the same time, what is the limit of people seeking what they love without overstepping the boundaries?

  38. Apr 2018
    1. It’s striking that we likely have author information provided as metadata on the majority of articles published today, but almost none of our reading tools expose this information in useful ways, or let us search or explore using the metadata.

      This is a really really important point.

  39. Mar 2018
  40. Feb 2018
  41. Dec 2017
    1. Still at a starting point, we have neither ‘advanced’ nor remained unmoved.

      The cover of Ken Macrorie's textbook, The I-Search Paper, has a moebius strip on it, reflecting this sentiment nicely.<br>

      But the conceit of the moebius strip extends from the back cover as well

      So, you start reading this book on the back cover, nice twist, just like in a moebius strip. You work through the text by doing. Then when you are done, you get to the end and out the back of the book you go onto the back cover, to enter again. Escher would approve. That is what I see happening here. In the end this process serves the product, the text that is produced for anyone concerned.

  42. Nov 2017
    1. having students do a basic Google image search for terms like “doctor” “teacher” “baby”

      It may sound obvious but it actually works. Just did it with each of these three words (on DuckDuckGo) and the results, though unsurprising, bring home the point. Tried switching on the Canadian filter, to check if their might be a difference, and it mostly reorders the results, for some reason. Also tried “student” and “musician” which provide an interesting contrast. Doing this exercise in class, would probably start by asking learners to write down what they expect to get. (Might even do it in my applied anthro class, tomorrow.)

    2. In this particular case, Google worked as a kind of amplifier of distortion.
    1. As an index, people have different expectations on search result neutrality. Some want Google Search to be entirely neutral, some demand immediate action to remove some results. The European Union has both demanded GOOG to comply with removal requests, and fined GOOG for not being neutral in shopping queries. It is not beneficial for GOOG to assume the role of an impartial arbiter of content, since it’s not supporting their business model. Quite the contrary, they are under public scrutiny from multiple governments, potentially risking their reputation.
  43. Oct 2017
    1. People tend to believe what the read on the web, whether it is true or not. Public opinion can easily be swayed with misinformation whether accidental or intentional. One must rely on trustworthy and reliable sources before coming to a conclusion on any subject.

      I agree with Nancy that information people find on the Internet may not be as useful and it can be falsified. An example would be Wikipedia as any person could modify the content of the information in it.

    2. Trends have been incorporated into the search engine rankings that determine what pages rise to the top of the list. The positive impact is search results that show the most likely needed pages at the top of the list. The negative impact is having the algorithm making assumptions of what is wanted.

      I agree with Nancy. These pages that are ranked higher are only based on assumptions from the search algorithm due to the fact that the website are from notable corporations/company and are linked to a lot of other other websites. However, the credibility of its contents may not come from an expert or whatsoever.

    3. Yes, I agree. People are depending too much on the ability to “Google” information instead of learning it. Although being able to search for any information you may need is convenient, knowledge that used to be considered basic need-to-know information is no longer considered so. Take for instance changing a flat tire. How many people under the age of 40 can tell you how to do it, without “Googling” it?

      I completely agree with you. After the popularity of Google rose, the reliance on convenient and fast information increased. Many millennials lack the skill of performing research without Google or an online search engine/

    1. No, it should not. Information is so easily changed, altered, and manipulated by other users that you can never trust a search engine completely. It is important that researchers should never trust only one search engine and get the facts from a few different places.

      I said the same thing. If researchers are getting there information for the internet then we dont know whats true.

    1. I personally delete my search history because I don’t like people knowing my business.

      I understand this and it makes a lot of sense. People think everything that is theres is always safe.

    1. The difference between Figure 4.10 and Figure 4.11 is that Figure 4.10 is uncensored while figure 4.11 is censored. There are differences even though they are both a Google search is because the Chinese government does not want certain information to be released to the public. While from the U.S point of view, nothing is censored because there are certain laws which prevent the government from censoring certain content.

      It is very interesting to have the opportunity to compare and contrast what online searches look like online in different parts of the world. It is unfortunate to see that places like China do not have the freedoms of accessing any information that they are curious about, however, it should inspire us to take advantage of what liberties we have on the Internet.

    1. I agree, people at this day in age are turning to the internet for information. We rather look something up on the internet than going to the library to find out more information about something. Searching things via the Internet is our future.

      I also agree with what Caroline had to say about the claim of search engines changing the way we think. With information becoming so easily accessible, people can be seen as getting lazier with accessing information.

    1. I agree. People now depend on the Internet for every little tidbit of information. They also assume that the information is always correct. This changes how we access information (through computers and phones instead of books and encyclopedias) and how we perceive it (less emphasis on source and more emphasis on it’s location on the web pages).

      I agree, the internet now a days is being used in so many different situations. No one is going to just go to the library in order to find out more information about something when they can just look it up on the internet in a quick second. The ability of searching a lot of information in a short period of time and having it in one place is very useful. People do not need to look for hours upon hours to find one concise answer, they just need to search it on the internet.

  44. Sep 2017
    1. You don’t go to Facebook, Reddit, Twitter, or Instagram because you’re looking for something, rather, you want to see what it has found for you.

      This is a critical distinction.

    1. These analyses offer a theory-motivated method of gauging collaborative search efficiency beyondthe analysis of mean RTs

      This is interesting!

  45. Jul 2017
  46. Jun 2017
    1. we’re at a pivotal point not just in the life of our democracy, but in how we think, read, and make choices. Selective information is being presented to us in a way that encourages selective reading and offers psychological and social rewards for, to put it bluntly, being stupid and submissive and spreading stupid to submit others.

      ...

      What’s different now is that this propaganda is being gamed by professionals in a massive, orchestrated data campaign at a volume, pace, and consistency that not only muddies the truth, but completely eclipses the truth. Destroys the very notion of truth.

      ...

      The truth about the truth is that we believe because we want to, because our ability to think independently is a point of pride for Americans. The people behind the curtain are telling us the same story we tell ourselves about ourselves. But this is also a vulnerability: Independence is in its purist form a kind of division. If you exploit it the right way, you can turn a democracy against itself.

  47. May 2017
  48. citeseerx.ist.psu.edu citeseerx.ist.psu.edu
    1. Faster Approximate String Matching Baeza-Yates and G. Navarro, R. Algorithmica (1999) 23: 127. doi:10.1007/PL00009253


  49. Apr 2017
    1. Google is countering fake news and extremism by updating their search rankings, and taking direct feedback from users.

    1. Google is adding a Fact Check feature to Google Search and Google News. When fact checks are available from one or more approved publishers, they will appear in the search results.

      One requirement for publishers to be cited is to use the schema.org ClaimReview markup, or the Share the Facts widget.

  50. Feb 2017
    1. Imagine searching for “401k matching” and instead of just receiving relevant messages or files, you also get a list of people in HR that can answer your question, or a list of channels for your query where you might be able to find the information you are looking for, or even a list of commonly asked questions relevant to that topic with links to the channel where each one was answered.

      This would be good.

    2. Relevant search relaxes the age constraint and takes into account the Lucene score of the document — how well it matches the query terms (Solr powers search at Slack).

      Relevant

    3. Recent search finds the messages that match all terms, and presents them in reverse chronological order.

      Recent

    4. On average, 20% of a knowledge worker’s day is spent looking for the information they need to get their work done.

      Wow!

  51. Jan 2017
    1. doesn’t mean that it’s impossible to design and create better versions that maximize the benefits while minimizing the problems

      Who are these mythical platform developers? And why wouldn't they follow in the footsteps of the very profitable publishers?

    1. Boolean satisfiability problem

      This is just one specific type of the classes of satisfiability problems (a.k.a. search problems).

      Other related problems include: Linear equation satisfiability, Linear inequality satisfiability, 0-1 integer linear equation satisfiability.

      Given the current context (of search problems), all the above are known as NP problems in general (with the observation, that the classic definition of NP limited the scope to only YES-NO problems).

      One can think of search problems as "one of many ways of characterizing the set of problems that form the basis of the study of intractability". Other ways include viewing such problems through the lenses of decision problems or optimization problems. In other words, problems in any of the aforementioned types can be translated between (or more formally, reduced to) each other with relative ease.

      Source(s): http://algs4.cs.princeton.edu/66intractability/

    2. efficiently solves

      a.k.a. guarantees a poly-time solution for any SAT problem instance.

  52. Dec 2016
    1. Google has adjusted their search results (or the ranking algorithm itself) for "did the holocaust happen". Previously, a page on a white supremacist site had been the top result. That page still appears, but now a little lower in the results.

      Sites that promote hate, violence, extremism, and lies should automatically be ranked very low. Google needs to start acting responsibly.

    1. “a-r-e”. And then “j-e-w-s”.

      If you flip the words around to "jews are..." you don't get predictive searches. Why? I guess it doesn't think of the words as a question. Check out the related searches at the bottom of my page with this query.

      How in the world can these results be so skewed? Is there an active community of antisemitic folks actually looking for self-justification or is this a gaming of the search system?

  53. Nov 2016
    1. Summary: Displaying faceted-search controls on mobile devices in a ‘tray’ overlay is a new effective solution to the challenge of showing both results and filters on small screens.
    1. During mobile e-commerce usability study Baymard Institute observed that more than 50% of users tried to “search within” their currently navigated category path, in an attempt to “filter the product list on my screen with a search query”. However, 94% of mobile e-commerce sites and apps do not support such behavior.
  54. Oct 2016
    1. Find the layer entitled WSDOT Traffic Sections

      The quickest way to find it is to type "WSDOT Traffic Sections" in the search box on the top right of the wsdot arcgis site.

  55. Aug 2016
    1. Page 6

      Borgman on the importance of scale in information retrieval. It's an interesting question for the humanities not only does large-scale introduce new methods for example just reading it also makes traditional methods more difficult EG challenges close reading. It is not enough to say (as color and others do) that they don't like distant reading. They also need to say how they propose doing the reading in a million book environment.

      data and information have always been both input and output of research. What is new is the scale of the data and information involved. Information management is notoriously subject to problems of scale [bibliography removed]. Retrieval methods designed for small databases declined rapidly ineffectiveness as collections grow in size. For example a typical searcher is willing to browse a set of matches consisting of one percent of a database of 1000 documents (10 documents), maybe willing to browse a 1% set of 10,000 documents (100), rarely is willing to browse 1% of 100,000 documents (1000), and almost never would browse 1% of 1 million or 10 million documents.

    2. Page 156

      Borgman discusses a couple of things that are useful for me. The first is how students discover what they miss from the library after they graduate and no longer have access to journals.

      The second is that this passage supplies some evidence for the claim that things that are not online no longer exist as far as such behavior is concerned.

      There's some bibliography at the end of the passage covering both of these points in the print book.

      Scholars seem to be even more dependent on library services for access to scholarly publications than in the past. Personal subscriptions to journals have declined substantially. Faculty and students have been known to panic when unable to access online library services, whether due to system failures or incorrect authentication settings. Students' dependence on these services becomes especially apparent when they graduate and no longer have access. Librarians learned early in the days of online catalogs that people rely on online sources, even if those sources are incomplete. Older material accessible only via the card catalog was quickly "widowed," which was a primary motivation for libraries to complete the retrospective conversion of card catalogs to digital form. The same phenomenon occurred with online access to journals. The more access that libraries provide, the greater the depth of coverage that users expect. The use of printed indexes in libraries has dropped to near zero, although printed finding aids remain popular in archives.