      Borgman on the importance of scale in information retrieval. It's an interesting question for the humanities not only does large-scale introduce new methods for example just reading it also makes traditional methods more difficult EG challenges close reading. It is not enough to say (as color and others do) that they don't like distant reading. They also need to say how they propose doing the reading in a million book environment.

      data and information have always been both input and output of research. What is new is the scale of the data and information involved. Information management is notoriously subject to problems of scale [bibliography removed]. Retrieval methods designed for small databases declined rapidly ineffectiveness as collections grow in size. For example a typical searcher is willing to browse a set of matches consisting of one percent of a database of 1000 documents (10 documents), maybe willing to browse a 1% set of 10,000 documents (100), rarely is willing to browse 1% of 100,000 documents (1000), and almost never would browse 1% of 1 million or 10 million documents.

    1. Again, though, if maximum recall is required, it is impossible in ranked retrieval to know what is omitted by new queries, whereas Boolean queries allow the user to control and modify the search until a satisfactory result has been achieved and they therefore also seem better suited to iterative searches.