21 Matching Annotations
  1. Apr 2024
    1. by system we eliminate duplication, we concentratecontrol;

      Part of Luhmann's practice in looking up ideas to place in his zettelkasten first was a means of preventing duplication of ideas. If an idea is repeated, that can be noted on the extant card as evidence that others see the idea too or one can compare the potential subtle differences as a means of expanding the space.

      Eliminating duplication also assists in the ratchet effect of collecting information and connecting it.

  2. Jan 2024
    1. It's also common to want to compute the transitive closure of these relations, for instance, in listing all the issues that are, transitively, duped to the current one to hunt for information about how to reproduce them.
  3. Dec 2023
  4. Oct 2023
    1. I've been struggling with duplicate notes within my Zettelkasten. .t3_17ajd34._2FCtq-QzlfuN-SwVMUZMM3 { --postTitle-VisitedLinkColor: #9b9b9b; --postTitleLink-VisitedLinkColor: #9b9b9b; --postBodyLink-VisitedLinkColor: #989898; }

      reply to u/Flubber78769 at https://www.reddit.com/r/Zettelkasten/comments/17ajd34/ive_been_struggling_with_duplicate_notes_within/

      This is the value of actually indexing your content. You can do a quick search around the index entries which provides a natural check against duplication, but importantly it'll let you think about those ideas again and spend your time more profitably by expanding upon them instead.

      Occasionally I'll find duplication from one source to the next which provides some support about an idea's value or spread over time, especially when I'm tracking usage of a thing, so it's not always the case that duplication is automatically a bad thing.

  5. Jun 2022
    1. The slipbox and index cards on which Vladimir Nabokov wrote his novel Lolita.

      Vladimir Nabokov famously wrote most of his works including Lolita using index cards in a slip box. He ultimately died in 1977 leaving an unfinished manuscript in note card form for the novel The Original of Laura. Penguin later published the incomplete novel with in 2012 with the subtitle A Novel in Fragments. Unlike most manuscripts written or typewritten on larger paper, this one came in the form of 138 index cards. Penguin's published version recreated these cards in full-color reproductions including the smudges, scribbles, scrawlings, strikeouts, and annotations in English, French, and Russian. Perforated, one could tear the cards out of the book and reorganize in any way they saw fit or even potentially add their own cards to finish the novel that Nabokov couldn't.

      Index card on which Nabokov collated notes on ages, heights, and measurements for school aged girls as research for his title character Lolita.

      More details at: https://www.openculture.com/2014/02/the-notecards-on-which-vladimir-nabokov-wrote-lolita.html

  6. Sep 2021
    1. The question is similar but its in a Rails context. The solutions would answer my question, but I'm almost certain that he could probably leverage Arel to solve his problem. The question I posted was designed purely as a Ruby question so that it was easier to search for. You might want to suggest an edit of the title of his question because it didn't show up when I searched for a solution to my problem.
    2. Yes, unfortunately the other question has a misleading and completely irrelevant Rails context and might be harder to find for some people. IMHO, it's still a perfect content duplicate, although not a topic one. Answers are also equal. Anyways, still a good question of yours.
  7. Aug 2021
  8. Mar 2021
    1. Yes, but honestly, and no offense intended, but I don't see the harm in these type questions, nor why some people are offended when they are asked. If I owed a website, I wouldn't mind it because it just creates more pages that can be indexed. I see it as helping the website. But, I did look and didn't see a simple answer. Again, no offense is intended. I've just never understood the complaints.
  9. Feb 2021
  10. Nov 2020
  11. Jun 2020
    1. It would be better if you asked a new question. In the worst case scenario, it would be marked as a duplicate (which still means you would probably get some sort of resolution).
    1. If you've found a problem in Ruby on Rails which is not a security risk, do a search on GitHub under Issues in case it has already been reported. If you are unable to find any open GitHub issues addressing the problem you found, your next step will be to open a new one.
  12. Nov 2015
    1. Presentation summarizing an approach to duplicate web page detection that was developed by a researcher whilst at Google in the early 2000s

  13. Sep 2015
  14. arxiv.org arxiv.org
    1. Given an LSH familyH, the LSH scheme amplifiesthe gap between the high probabilityP1and the lowprobabilityP2by concatenating several functions

      Useful recap of LSH

    2. Recent survey paper for hashing-based approaches to similarity search

    1. This paper has a very useful overview of previous work that is worth reading under section 9.

    2. We used the following publicly available real datasets in the experiment

      Datasets used are DBPL, ENRON, UNIREF-4GRAM. All small (<1M records) in web terms and I would guess, all with small document sizes.

      Given a lengthy paper, could potentially divide into smaller documents (1 doc === 1 page) and do signature calculation on a per-page basis. This could have the benefit of bounding the search time by limiting the number of pages that need to be rendered to text in order to start the lookup process.