391 Matching Annotations

Jan 2025
academic.oup.com academic.oup.com

An ecosystem for producing and sharing metadata within the web of FAIR Data

1
1. bencomp 20 Jan 2025
  
  in Public
  
  The primary purpose of Maggot is to streamline the collection of high-level metadata using carefully chosen schemas and standards. Additionally, it simplifies data accessibility via metadata, typically a requirement for publicly funded projects. As a result, Maggot can be utilized to promote effective local management with the goal of facilitating data sharing while adhering to the FAIR principles. Furthermore, it can contribute to the preparation of the future EOSC FAIR Web of Data within the European Open Science Cloud framework.
  
  I don't see a lot of support for these conclusions.
Visit annotations in context

Annotators

bencomp

URL

academic.oup.com/gigascience/article/doi/10.1093/gigascience/giae111/7945442
Jun 2024
www.nature.com www.nature.com

The O3 guidelines: open data, open code, and open infrastructure for sustainable curated scientific resources

2
1. bencomp 20 Jun 2024
  
  in Public
  
  The idealfile format should be easily editable by both humans and machines, compatible with version control systems’tools for visualizing changes (often called diffs), and displayable by popular hosting services like GitHub. JSON,TSV, and YAML
  
  YAML is not generally considered easily editable by humans
2. bencomp 20 Jun 2024
  
  in Public
  
  DataCite (https://datacite.org/),
  
  DataCite is not a data repository.
  
  Error
Visit annotations in context

Tags

Error

Annotators

bencomp

URL

nature.com/articles/s41597-024-03406-w.pdf
Aug 2023
arxiv.org arxiv.org

Untitled document

5
1. bencomp 01 Aug 2023
  
  in Public
  
  The presence of duplicate information also cannot be counted when a data stakeholder qual-iﬁes as ideal; thus, Algorithm 3 checks for identical datasets in an ODC, i.e., datasets with thesame identiﬁer. It uses a set to keep track of the identiﬁers of each dataset and prints a messagewith the percentage of duplicate datasets
  
  I don't understand the first sentence (until the semicolon). What is meant with "presence cannot be counted", "a data stakeholder", "a data stakeholder qualifies as ideal"?
  
  Are two datasets with the same identifier "identical"? Or are they the same dataset?
  
  For an RDF processor, two resources with the same identifier are the same resource. And DCAT is okay with that.
2. bencomp 01 Aug 2023
  
  in Public
  
  References
  
  It feels like there are too many self-references for a single-author paper.
  
  DOI links should refer to URLs starting with https://doi.org/ instead of http://dx.doi.org/, and underscores should not have a \ in front. References to arXiv are inconsistent: some have a DOI and others don't.
3. bencomp 01 Aug 2023
  
  in Public
  
  While some common practices exist, individual approachesto ensuring data quality have historically demonstrated superior performance.
  
  Please refer to work that supports this vague claim.
4. bencomp 01 Aug 2023
  
  in Public
  
  focus on the automatic quality assessment of ODCs,
  
  Looking at the references, I don't believe this work is the first to focus on automatic QA of ODCs.
5. bencomp 01 Aug 2023
  
  in Public
  
  (pp. 29–126)
  
  I didn't know that W3C Recommendations had page numbers?
Visit annotations in context

Annotators

bencomp

URL

arxiv.org/pdf/2307.15464.pdf
Oct 2022
ruthtillman.com ruthtillman.com

Between Exports and Infrastructure: Linked Data Systems in 2022 | Ruth Kitchin Tillman

1
1. bencomp 19 Oct 2022
  
  in Public
  
  Having used both Marva and Sinopia, I think that Marva supports “BIBFRAME cataloging without thinking about BIBFRAME” especially well.
  
  I still see a few Linked Data-y elements, like the templates and profiles being identified with "bf2:something".
Visit annotations in context

Annotators

bencomp

URL

ruthtillman.com/talk/ptpl-exports-infrastructure-bibframe/
May 2022
datascience.codata.org datascience.codata.org

Digital Objects – FAIR Digital Objects: Which Services Are Required?

1
1. bencomp 04 May 2022
  
  in Public
  
  Currently often these repositories are giving some data representation enhanced with data base systems that provide a local layer of data and metadata indexing.
  
  This sounds speculative.
Visit annotations in context

Annotators

bencomp

URL

datascience.codata.org/articles/10.5334/dsj-2020-015/
Apr 2022
www.digitalhumanities.org www.digitalhumanities.org

DHQ: Digital Humanities Quarterly: Linked data from TEI (LIFT): A Teaching Tool for TEI to Linked Data Transformation

6
1. bencomp 05 Apr 2022
  
  in Public
  
  development and expansion of a linked open data cloud of cultural heritage
  
  In general it would be good to keep in mind the rationale for such a LOD cloud and express this rationale.
2. bencomp 05 Apr 2022
  
  in Public
  
  LIFT uses another Python library, lxml, in conjunction with RDFLib to parse TEI documents
  
  I think RDFLib is only used to create and output the RDF, not for parsing the TEI documents.
3. bencomp 05 Apr 2022
  
  in Public
  
  so all elements and attributes must be mapped to classes and properties from arbitrary ontologies
  
  This is better for reuse, not really a drawback :) I do understand that there may not always be a perfect mapping.
4. bencomp 05 Apr 2022
  
  in Public
  
  Python
  
  Note that the documentation mentions Python 2.7 very explicitly. That version of Python is no longer supported; it would be great if the code were updated to support Python 3.
5. bencomp 05 Apr 2022
  
  in Public
  
  open-source application
  
  The GitHub repository does not contain a license, so the code does not appear to be open source.
6. bencomp 05 Apr 2022
  
  in Public
  
  there is a lack of user-friendly tools for working with digital scholarly editions and LOD
  
  Why do we need such tools?
Visit annotations in context

Annotators

bencomp

URL

digitalhumanities.org/dhq/vol/16/2/000605/000605.html
Jan 2022
jbiomedsem.biomedcentral.com jbiomedsem.biomedcentral.com

End-to-End provenance representation for the understandability and reproducibility of scientific experiments using a semantic approach

2
1. bencomp 10 Jan 2022
  
  in Public
  
  Skimming the article, it isn't clear to me why this ontology is better than earlier provenance ontologies.
2. bencomp 10 Jan 2022
  
  in Public
  
  We present the “REPRODUCE-ME” data model and ontology
  
  This ontology has been presented before by the authors.
Visit annotations in context

Annotators

bencomp

URL

jbiomedsem.biomedcentral.com/articles/10.1186/s13326-021-00253-1
Nov 2021
academic.oup.com academic.oup.com

Sustainability and complexity: Knowledge and authority in the digital humanities

2
1. bencomp 19 Nov 2021
  
  in Public
  
  For any of our DH work to be sustainable, it needs to be produced in full dialogue with the community of information professionals.
  
  That sounds about right.
2. bencomp 19 Nov 2021
  
  in Public
  
  They tired of care-taking, even though this involved little more than continuing to host the project files on a server.
  
  I'm puzzled that the author uses these words, when the point of the article appears to be that maintenance is work.
  
  This quote was dissected by Andromeda Yelton on Twitter.
Visit annotations in context

Annotators

bencomp

URL

academic.oup.com/dsh/article/36/Supplement_2/ii86/6205948
Jul 2021
teach.dariah.eu teach.dariah.eu

Text Encoding and the TEI: Basic XML rules: well-formed and valid XML: Building-blocks of XML documents

2
1. bencomp 13 Jul 2021
  
  in Public
  
  Valid use of attributes in XML
  
  The quotes are smart quotes, which are not accepted as quotes.
2. bencomp 13 Jul 2021
  
  in Public
  
  Element names cannot start with the string xml, XML, Xml, etc
  
  This is new to me. But it's true:
  
  This specification does not constrain the application semantics, use, or (beyond syntax) names of the element types and attributes, except that names beginning with a match to (('X'|'x')('M'|'m')('L'|'l')) are reserved for standardization in this or future versions of this specification.
Visit annotations in context

Annotators

bencomp

URL

teach.dariah.eu/mod/lesson/view.php
teach.dariah.eu teach.dariah.eu

Text Encoding and the TEI: Basic XML rules: well-formed and valid XML: Well-formed vs valid XML

1
1. bencomp 13 Jul 2021
  
  in Public
  
  W3Schools identifies
  
  W3Schools is not affiliated with W3C, who manages the standards. Maybe it isn't the best source for rules?
Visit annotations in context

Annotators

bencomp

URL

teach.dariah.eu/mod/lesson/view.php
teach.dariah.eu teach.dariah.eu

Text Encoding and the TEI: Modelling with TEI: The TEI schema

1
1. bencomp 13 Jul 2021
  
  in Public
  
  it is a requirement for every TEI project to provide a detailed ODD model.
  
  I assume it is not required for every project to create a new ODD model? Reusing ODDs improves interoperability.
Visit annotations in context

Annotators

bencomp

URL

teach.dariah.eu/mod/lesson/view.php
teach.dariah.eu teach.dariah.eu

Text Encoding and the TEI: Modelling with XML: Examples of XML languages

1
1. bencomp 13 Jul 2021
  
  in Public
  
  For instance, the following statement describes a black circle with a radius r of 50:
  
  Note that this is not a complete SVG document. You cannot save this snippet as a file and view it without adding elements around it.
Visit annotations in context

Annotators

bencomp

URL

teach.dariah.eu/mod/lesson/view.php
teach.dariah.eu teach.dariah.eu

Text Encoding and the TEI: Modelling with XML: Why would you model a text with XML?

1
1. bencomp 13 Jul 2021
  
  in Public
  
  RDF are based on XML syntax
  
  Luckily RDF no longer really requires to learn XML too.
Visit annotations in context

Annotators

bencomp

URL

teach.dariah.eu/mod/lesson/view.php
teach.dariah.eu teach.dariah.eu

Text Encoding and the TEI: Modelling with XML: XML modeling languages

2
1. bencomp 13 Jul 2021
  
  in Public
  
  DTD syntax as it is easier to learn and use than schema languages, while the general principles of modelling are similar.
  
  I don't quite agree that the DTD syntax is easier than XML's syntax.
2. bencomp 13 Jul 2021
  
  in Public
  
  what attributes they can contain
  
  Attributes were not mentioned yet.
Visit annotations in context

Annotators

bencomp

URL

teach.dariah.eu/mod/lesson/view.php
www.digitalhumanities.org www.digitalhumanities.org

DHQ: Digital Humanities Quarterly: Hierarchical or Non-hierarchical? A Philosophical Approach to a Debate in Text Encoding

2
1. bencomp 12 Jul 2021
  
  in Public
  
  discipline of text encoding
  
  Text encoding is a discipline?
2. bencomp 12 Jul 2021
  
  in Public
  
  philology
  
  Philology is the study of language in oral and written historical sources; it is the intersection of textual criticism, literary criticism, history, and linguistics (with especially strong ties to etymology).[1][2][3] Philology is more commonly defined as the study of literary texts as well as oral and written records, the establishment of their authenticity and their original form, and the determination of their meaning. A person who pursues this kind of study is known as a philologist.
  
  Wikipedia on Philology
Visit annotations in context

Annotators

bencomp

URL

digitalhumanities.org/dhq/vol/15/1/000525/000525.html
www.cbc.ca www.cbc.ca

Heiltsuk woman unable to restore Indigenous surname on ID because system can't handle its spelling | CBC News

1
1. bencomp 09 Jul 2021
  
  in Public
  
  Federally, Immigration, Refugees and Citizenship Canada told CBC News it can only issue documents printed "in the Roman alphabet with some French characters" because of standards set by the International Civil Aviation Organization.
  
  Hmm. I don't think you can put all the blame on commercial air travel.
Visit annotations in context

Annotators

bencomp

URL

cbc.ca/news/canada/british-columbia/heiltsuk-nation-indigenous-name-bc-government-identification-1.6093186
teach.dariah.eu teach.dariah.eu

Text Encoding and the TEI: Markup Languages and text modelling: Characteristics of Markup Languages and Types of Markup

1
1. bencomp 08 Jul 2021
  
  in Public
  
  footnotes
  
  I don't think there are HTML tags for footnotes.
Visit annotations in context

Annotators

bencomp

URL

teach.dariah.eu/mod/lesson/view.php
Nov 2020
journals.plos.org journals.plos.org

Identifiers for the 21st century: How to design, provision, and reuse persistent identifiers to maximize utility and impact of life science data

1
1. bencomp 06 Nov 2020
  
  in Public
  
  [7]:
  
  Is Wikipedia the best source for a definition?
Visit annotations in context

Annotators

bencomp

URL

journals.plos.org/plosbiology/article
Apr 2020
www.mitpressjournals.org www.mitpressjournals.org

Ontology, Ontologies and the “I” of FAIR

1
1. bencomp 30 Apr 2020
  
  in Public
  
  Looking at the number of self-citations, this article is from a niche field. I have seen a presentation from this author in 2017 and subsequently had some discussion about how to apply OntoUML, but at the time his ideas were too philosophical for me.
  
  This article may still be too philosophical for many users of RDFS and OWL, but it provides a good argument for looking at OntoUML.
Visit annotations in context

Annotators

bencomp

URL

mitpressjournals.org/doi/full/10.1162/dint_a_00040
Mar 2020
www.mitpressjournals.org www.mitpressjournals.org

dint_a_00038

2
1. bencomp 31 Mar 2020
  
  in Public
  
  nanopublications now help us to publish this model and its instantiations in a FAIR manner.
  
  Nanopublications may be the most FAIR way to publish Linked Data.
2. bencomp 31 Mar 2020
  
  in Public
  
  Registered in FAIRsharing as https://fairsharing.org/bsg-s001394
  
  Doesn't FAIRsharing support DOIs for entries?
Visit annotations in context

Annotators

bencomp

URL

mitpressjournals.org/doi/pdf/10.1162/dint_a_00038
Jun 2019
journal.code4lib.org journal.code4lib.org

BC Digitized Collections: Towards a Microservices-based Solution to an Intractable Repository Problem

6
1. bencomp 13 Jun 2019
  
  in Public
  
  Will we continue to use a microservices architecture for our digital repository in the future? We aren’t certain.
  
  In software engineering in general, it looks like people are taking a more nuanced view of microservices. As with many new ideas, it looked to be The Answer to some problems, but turned out to have its own problems.
2. bencomp 13 Jun 2019
  
  in Public
  
  Usability testing also revealed biases in our design process that had gone unnoticed. A particularly salient example of this is the name ‘BC Digitized Collections’. As archivists and librarians, we took for granted that our users would be able to decipher this and understand what they would be able to find within the system. In fact, our test results suggested the opposite. A reevaluation of how we are naming and describing the system is forthcoming, with a greater sensitivity to our end users’ level of familiarity with library jargon and information architecture conventions.
  
  Interesting!
3. bencomp 13 Jun 2019
  
  in Public
  
  Technical
  
  Perhaps you meant 'Image [API]'?
4. bencomp 13 Jun 2019
  
  in Public
  
  Of the eight potential repository solutions that we investigated before undertaking this project, five were open-source and three were vended solutions.
  
  Open-source solutions can be provided (even hosted) by vendors, so you probably meant proprietary to contrast with open source. Is that correct?
5. bencomp 13 Jun 2019
  
  in Public
  
  While Alma offers partial IIIF support with its implementation of Universal Viewer,
  
  Wait, what? It does?
6. bencomp 13 Jun 2019
  
  in Public
  
  Instead, we worked with ITS to isolate the server from our campus network to minimize the impact of any potential attacks, then rapidly accelerated our migration timeline.
  
  It almost sounds like the security issues came in handy :)
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/14445
www.sciencedirect.com www.sciencedirect.com

spot: Open Source framework for scientific data repository and interactive visualization

1
1. bencomp 03 Jun 2019
  
  in Public
  
  The most relevant to spot
  
  I keep thinking of Metabase as an open-source alternative that allows creating and sharing charts. Grafana is another dashboard application, geared towards timeseries data.
Visit annotations in context

Annotators

bencomp

URL

sciencedirect.com/science/article/pii/S2352711018302097
Mar 2019
www.europarl.europa.eu www.europarl.europa.eu

AM_Ple_LegConsolidated

4
1. bencomp 23 Mar 2019
  
  in Public
  
  contractaanpassingsmechanismen en alternatievegeschillenbeslechtingsprocedures
  
  Prachtige woorden...
2. bencomp 23 Mar 2019
  
  in Public
  
  Wanneer de rechthebbenden de aanbieders van onlinediensten voor het delen van content niet de relevante en noodzakelijke informatie over hun specifieke werken of andere materialen verstrekken
  
  Wie is er geen rechthebbende? Hoe gaat iedereen in de hele wereld aan alle aanbieders van onlinediensten voor het delen van content informatie verstrekken over hun werken?
3. bencomp 23 Mar 2019
  
  in Public
  
  Het is daarom van belang de ontwikkeling van de licentiemarkt tussen rechthebbenden en aanbieders van een onlinedienst voor het delen van content te bevorderen
  
  Willen we echt licenties voor alles wat iedereen online doet?
4. bencomp 23 Mar 2019
  
  in Public
  
  Wat zou een inhoudsopgave in dit document goed van pas zijn gekomen...
  
  accessibility
Visit annotations in context

Tags

accessibility

Annotators

bencomp

URL

europarl.europa.eu/doceo/document/A-8-2018-0245-AM-271-271_NL.pdf
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – SCOPE: A digital archives access interface

2
1. bencomp 07 Mar 2019
  
  in Public
  
  Open Archival Information System (OAIS) compliant preservation system
  
  I'm not convinced there is such a thing as an OAIS compliant preservation system, as I still see OAIS as a model to describe systems.
2. bencomp 07 Mar 2019
  
  in Public
  
  OAIS is the ISO-standard for digital preservation
  
  I'm taking this a bit out of context, but I was never able to look at OAIS as a standard for digital preservation.
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/14283
Feb 2019
rachelandrew.co.uk rachelandrew.co.uk

HTML, CSS and our vanishing industry entry points

1
1. bencomp 28 Feb 2019
  
  in Public
  
  We see it with a drive to static sites, conflating speed with the lack of a database, and ending up recreating the database in the filesystem or relying on a raft of third party services to plug the holes that would have been filled by a more traditional CMS.
  
  Guilty… :$
Visit annotations in context

Annotators

bencomp

URL

rachelandrew.co.uk/archives/2019/01/30/html-css-and-our-vanishing-industry-entry-points/
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – Improving the discoverability and web impact of open repositories: techniques and evaluation

1
1. bencomp 15 Feb 2019
  
  in Public
  
  Competing scholarly platforms, many of which are proprietary, appear to be growing in popularity yet demonstrate poor support for open standards or prevalent open science technical protocols, as well as low levels of integration with open scholarly infrastructure.
  
  It would be great to see numbers for this statement, although I wouldn't be surprised to see it confirmed.
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/14180
onlinelibrary.wiley.com onlinelibrary.wiley.com

Digital data archives as knowledge infrastructures: Mediating data sharing and reuse

7
1. bencomp 14 Feb 2019
  
  in Public
  
  “back office”
  
  This should say "front office".
2. bencomp 14 Feb 2019
  
  in Public
  
  Archivists need to verify metadata, documentation, and data integrity to ensure that data sets meet minimum standards for ingest.
  
  And this is where automation could help.
3. bencomp 14 Feb 2019
  
  in Public
  
  Contributors who submit a data set once every year or two, or maybe once in a career, need assistance in structuring and documenting their files for submission.
  
  Everyone should be taught to structure and document their files even if they will not submit them for archiving.
4. bencomp 14 Feb 2019
  
  in Public
  
  Automation is facilitating more archival procedures, such as batch ingest of files and APIs for submission and retrieval, but much of the labor associated with contributing data to archives remains craft work.
  
  This remains an issue for growth of data archiving.
5. bencomp 14 Feb 2019
  
  in Public
  
  three‐ring binder
  
  At least two interviewees use three-ring binders, which I think are rare in the Netherlands. Interesting!
6. bencomp 14 Feb 2019
  
  in Public
  
  Weblogs
  
  i.e. logs of web traffic
7. bencomp 14 Feb 2019
  
  in Public
  
  DATAVERSE.nl
  
  Also known as DataverseNL
Visit annotations in context

Annotators

bencomp

URL

onlinelibrary.wiley.com/doi/10.1002/asi.24172
academic.oup.com academic.oup.com

The visual digital turn: Using neural networks to study historical images

2
1. bencomp 11 Feb 2019
  
  in Public
  
  Because retraining a neural network requires large annotated data sets and extensive computational power, we looked for different ways to use existing neural networks to identify visual similarity. We turned to the penultimate layer of the CNN to identify similar visual trends in advertisements.
  
  Excellent!
2. bencomp 11 Feb 2019
  
  in Public
  
  F1 score—a harmonic mean of the precision and recall—of 0.9 over the entire period, meaning that it accurately predicted the type of 90% of the images.
  
  I think this is the definition of 'accuracy'. Precision and especially recall are important when the model also has to find instances, rather than classify all instances into one of two categories.
Visit annotations in context

Annotators

bencomp

URL

academic.oup.com/dsh/article/35/1/194/5296356
ruben.verborgh.org ruben.verborgh.org

Re-decentralizing the Web, for good this time

1
1. bencomp 06 Feb 2019
  
  in Public
  
  As with the universality of browsers, there is a major role for the W3C in creating the standards that will allow decentralized data pods and apps to interoperate.
  
  This point of standardisation has not had enough attention in the Solid presentations I have seen until now. The ecosystem of Solid pods and apps requires agreements on APIs – not so much for reading and writing data documents, but the structure and contents of those documents.
Visit annotations in context

Annotators

bencomp

URL

ruben.verborgh.org/articles/redecentralizing-the-web/
Jan 2019
ruben.verborgh.org ruben.verborgh.org

Designing a Linked Data developer experience

2
1. bencomp 11 Jan 2019
  
  in Public
  
  In my experience, Linked Data excites developers who have never seen it before, because they suddenly have access to a whole Web of data instead of just one back-end. It opens up huge opportunities, since they no longer depend on harvesting data to get build something nice.
  
  Hear, hear!
2. bencomp 11 Jan 2019
  
  in Public
  
  "object": "https://you.example/likes/2018/12#rubens-post",
  
  The object of the Like activity should be the IRI of the thing liked.
Visit annotations in context

Annotators

bencomp

URL

ruben.verborgh.org/blog/2018/12/28/designing-a-linked-data-developer-experience/
datascience.codata.org datascience.codata.org

Data Discovery Paradigms: User Requirements and Recommendations for Data Repositories

2
1. bencomp 10 Jan 2019
  
  in Public
  
  Recommendations 9 and 10 are not directly mapped to any requirements as requirements were inferred from use cases from human users.
  
  This phrasing makes it sound like requirements from human users are very different, whereas in the end all use cases come from human users, aren't they?
2. bencomp 10 Jan 2019
  
  in Public
  
  new description format
  
  Everywhere this format is called a user story.
Visit annotations in context

Annotators

bencomp

URL

datascience.codata.org/articles/10.5334/dsj-2019-003/
Dec 2018
blogs.bodleian.ox.ac.uk blogs.bodleian.ox.ac.uk

Detailed depictions with IIIF, Wikidata and Wikimedia Commons

1
1. bencomp 20 Dec 2018
  
  in Public
  
  A political cartoon can be better understood if we know the offices of state that the individuals held at the time of the cartoon, and here is where it helps to combine artwork data and political data on the same platform.
  
  research question
Visit annotations in context

Tags

research question

Annotators

bencomp

URL

blogs.bodleian.ox.ac.uk/digital/2018/12/06/detailed-depictions-with-iiif-wikidata-and-wikimedia-commons/
Nov 2018
libereurope.eu libereurope.eu

Sharing & Learning From Each Other: A Very Successful First Meeting of LIBER's DH & DCH Working Group - LIBER

2
1. bencomp 29 Nov 2018
  
  in Public
  
  Have your own research question in DH projects. If we want to be partners in research, we can also do research. Find the research aspect in the projects that are relevant for you and feed the answers you find during the projects back into your practice as a library. Make research projects mutually benefical and become the partner you want to be.
  
  I was reminded of this recently and agree.
  
  ProTip
2. bencomp 29 Nov 2018
  
  in Public
  
  Work more openly. Publish documents that you might think are only of interest to your direct colleagues, but can actually be very valuable to your wider network.
  
  Good reminder!
  
  ProTip
Visit annotations in context

Tags

ProTip

Annotators

bencomp

URL

libereurope.eu/blog/2018/04/24/sharing-stories/
blogs.loc.gov blogs.loc.gov

What Could Curation Possibly Mean? | The Signal

1
1. bencomp 29 Nov 2018
  
  in Public
  
  As machine learning techniques like Optical Character Recognition (OCR) push new boundaries, IIIF Image provides a uniquely efficient way to build new training sets that can easily be shared as lists of urls.
  
  ProTip
Visit annotations in context

Tags

ProTip

Annotators

bencomp

URL

blogs.loc.gov/thesignal/2014/02/what-do-you-mean-by-archive-genres-of-usage-for-digital-preservers/
ceur-ws.org ceur-ws.org

Microsoft Word - 99990001.docx

3
1. bencomp 29 Nov 2018
  
  in Public
  
  Composite annotations are a more complex case as they do not represent a single entity mentioned in the text. As such they do not contain a named entity reference but rather point to one or more existing annotations that are used to mark up the infor-mation that describes the thing represented by the composite annotation.
  
  You are creating entity references, but they are unnamed by default.
2. bencomp 29 Nov 2018
  
  in Public
  
  using annotation classes for dis-tinguishingbetween mentions of different types of objects such as Persons or Lo-cations
  
  My intuition would be to distinguish mentions of different types of objects by their entity class, not annotation classes.
3. bencomp 29 Nov 2018
  
  in Public
  
  Annotations have technical metadata.It is important to identify, who and when created the annotation, what is the visibility of the annotation, etc.
  
  This is usually called descriptive metadata (who created the annotation when) and administrative metadata (who should be able to see the annotation).
Visit annotations in context

Annotators

bencomp

URL

ceur-ws.org/Vol-2014/paper-08.pdf
Oct 2018
medium.com medium.com

The Case for Decentralizing Links – Bridgit - Quantum Search – Medium

1
1. bencomp 01 Oct 2018
  
  in Public
  
  It looks to me like "building bridges" is what other people would call "annotating the Web". Using the W3C Web Annotation standards one can do precisely what "building a bridge" is supposed to do: link pages, or specific parts of pages, to (parts of) other pages/commentary/... – except that getting virtual money for annotating is not part of the standards (or thinking, as fas as know).
Visit annotations in context

Annotators

bencomp

URL

medium.com/@bridgitnow/the-case-for-decentralizing-links-4ccf4e72f2e5
Sep 2018
upsilon.cc upsilon.cc

Identifiers for Digital Objects: the Case of Software Source Code Preservation

1
1. bencomp 26 Sep 2018
  
  in Public
  
  there are no othercosts related to the creation and attribution of the identifiers.
  
  No costs for end users, I presume?
Visit annotations in context

Annotators

bencomp

URL

upsilon.cc/~zack/research/publications/ipres-2018-doi.pdf
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – FAIR Principles for Library, Archive and Museum Collections: A proposal for standards for reusable collections

10
1. bencomp 25 Sep 2018
  
  in Public
  
  a policy must be established for the three collection levels “objects”, “metadata” and “metadata records” based on a pilot.
  
  It is still unclear why the authors chose these three 'collection levels' and why there must be a policy for them.
2. bencomp 25 Sep 2018
  
  in Public
  
  The pilot will reveal
  
  A pilot project should reveal obstacles – if it does depends on the setup and execution.
3. bencomp 25 Sep 2018
  
  in Public
  
  a kind of scorecard containing all FAIR principles on which one can mark the score for each principle per collection
  
  How to measure FAIRness of data is still a topic of research. What would you suggest?
4. bencomp 25 Sep 2018
  
  in Public
  
  ORCID (orcid.org),
  
  Only for researchers and only living people can get an ORCID.
5. bencomp 25 Sep 2018
  
  in Public
  
  “Persistent” entails the permanent availability of the identifier
  
  I think people in the field of persistent identifiers don't dare to claim PIDs must or can be permanent. Yes, permanent would be ideal.
6. bencomp 25 Sep 2018
  
  in Public
  
  Digital objects have a date-timestamp
  
  A timestamp for creation, publication, copyright?
7. bencomp 25 Sep 2018
  
  in Public
  
  The DEN DE BASIS set of best practices, although it is very broad and complete, is not compact enough for quick adoption.
  
  Do best practices have to be 'compact for quick adoption'?
8. bencomp 25 Sep 2018
  
  in Public
  
  It seems that citation standards can be derived from provided standard metadata formats as facilitated by tools like Refworks, Mendeley and Zotero.
  
  Having a clear suggestion for a citation is also a visual reminder to actually cite the item. Plus, it can signal the most important metadata to use in the citation, including (potentially) rights holders.
9. bencomp 25 Sep 2018
  
  in Public
  
  The list raises several questions, for instance how to link annotations to specific fragments of a text, what a retrieval protocol for annotations should look like, etc.
  
  I should read the article, but have they heard of Web Annotations?
10. bencomp 25 Sep 2018
  
  in Public
  
  The phrasing of the FAIR principles is somewhat confusing, very probably because of the wish to be concise. Some principles (not all) refer to both “data” and “metadata”, which is formulated as “(meta)data”, for example in “F1. (meta)data are assigned a globally unique and persistent identifier”. One principle is self-referencing (“I2. (meta)data use vocabularies that follow FAIR principles“). Also the enumeration of the four sections (F, A, I, R), each containing 3 or 4 principles, is rather uncommon, for example “(F)air” has four individual principles F1, F2, F3, F4, but “(R)eusable” has one main principle R1 and three sub-principles R1.1, R1.2, R1.3. The logic of this subdivision is unclear, because all four Reusable principles are guidelines on the same level in their own right. In fact, the authors say so themselves: “The elements of the FAIR Principles are related, but independent and separable.” (Wilkinson et al. 2016, p.4).
  
  Interesting observations. I can see that they are confusing, but is it relevant in this context?
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/13427
apenwarr.ca apenwarr.ca

XML, blockchains, and the strange shapes of progress - apenwarr

1
1. bencomp 20 Sep 2018
  
  in Public
  
  It's the forgetting that will allow progress.
  
  But are blockchains a result of progress?
Visit annotations in context

Annotators

bencomp

URL

apenwarr.ca/log/20180914
Aug 2018
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – Are we still working on this? A meta-retrospective of a digital repository migration in the form of a classic Greek Tragedy (in extreme violation of Aristotelian Unity of Time)

3
1. bencomp 28 Aug 2018
  
  in Public
  
  The hardest part of looking back on this project is seeing how, in the last year of the project, we managed to do most of the heavy lifting, while prior to this year the project felt untethered.
  
  I understand this feeling. It can be a similar feeling for shorter projects too – "I've spent weeks on X before I found out that Y was an easier/quicker/better solution."
2. bencomp 28 Aug 2018
  
  in Public
  
  It is challenging, if not impossible, to usefully quantify the return on investment of involvement in a community like Samvera, but the time spent by other community members providing technical support (solicited and unsolicited), conceptual support, and emotional support have provided us with benefits beyond what one could expect from contracts or subscriptions
  
  Well said.
3. bencomp 28 Aug 2018
  
  in Public
  
  We hope our migration story will be helpful to developers and repository managers as a map of development hurdles and an aspiration of success.
  
  It is definitely laudible that you shared your story.
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/13581
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – Using XML Schema with Embedded Schematron Rules for MODS Quality Control in a Digital Repository

1
1. bencomp 27 Aug 2018
  
  in Public
  
  the entire schema can be found on GitHub
  
  Awesome!
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/13546
www.nature.com www.nature.com

A toolkit for data transparency takes shape

1
1. bencomp 24 Aug 2018
  
  in Public
  
  Reproducibility advocates are converging around a tool set to minimize these problems. The list includes version control, scripting, computational notebooks and containerization — tools that allow researchers to document their data, the steps they follow to manipulate it, and the computing environment in which they work (see ‘Getting reproducible’).
  
  It would be great to have references to existing articles about 'getting reproducible'. I find the lack of references amazing to papers with similar lists of practices amazing.
Visit annotations in context

Annotators

bencomp

URL

nature.com/articles/d41586-018-05990-5
peerj.com peerj.com

The appropriation of GitHub for curation

3
1. bencomp 24 Aug 2018
  
  in Public
  
  Limitations
  
  It would be interesting to see a comparison of curation on various platforms. It is clear that GitHub is used a lot for curation of lists, but so is Wikipedia (supported by Wikidata). There used to be directories on the Web, to help people navigate, before search engines became the main means to navigate the Web.
2. bencomp 24 Aug 2018
  
  in Public
  
  A curated list that provides centralized peer-reviewed resources about a specific topic provides a starting point where developers know that they can find high-quality resources and begin learning the subject.
  
  There are different platforms for this purpose as well, like https://learnxinyminutes.com. When you don't know about the existence of such lists, using online search would help beginners.
3. bencomp 24 Aug 2018
  
  in Public
  
  Curation is a common practice in Archeology.
  
  Why only mention archaeology as a field that practices curation?
Visit annotations in context

Annotators

bencomp

URL

peerj.com/articles/cs-134/
greenelab.github.io greenelab.github.io

Open collaborative writing with Manubot

1
1. bencomp 23 Aug 2018
  
  in Public
  
  Concepts for the future of scholarly publishing extend beyond collaborative writing [45,46]. Bookdown [47] and Pandoc Scholar [48] both extend traditional Markdown to better support publishing. Examples of continuous integration to automate manuscript generation include gh-publisher and Continuous Publishing [49], which was used to produce the book Opening Science [50]. Distill journal articles [51], Idyll [52], and Stencila [53] support manuscripts with interactive graphics and close integration with the underlying code. As an open source project, Manubot can be extended to adopt best practices from these other emerging platforms.
  
  Great list of interesting and useful tools.
Visit annotations in context

Annotators

bencomp

URL

greenelab.github.io/meta-review/
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – Copyright and access restrictions–providing access to the digital collections of Leiden University Libraries with conditional access rights

4
1. bencomp 22 Aug 2018
  
  in Public
  
  This resource is protected by copyright
  
  "Protected by copyright" is not saying what you can or cannot do; usually the phrase "All rights reserved" is used to indicate the possibilities for (re)use.
2. bencomp 22 Aug 2018
  
  in Public
  
  To analyze the status and potential risks of copyright infringement for our digital collections we made use of the licenses granted by both Creative Commons and RightsStatements, established by Europeana and DPLA.
  
  How do the licences help you analyse the copyright status or risk of copyright infringement?
3. bencomp 22 Aug 2018
  
  in Public
  
  We decided that all data are made as openly available as possible, in as many places as possible.
  
  If everything should be as open as possible, then why not allow downloads of TIFF files?
4. bencomp 22 Aug 2018
  
  in Public
  
  Are they allowed to download and reuse our metadata?
  
  Are they?
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/13588
osf.io osf.io

Ilik_Koster_OSF.pdf

3
1. bencomp 20 Aug 2018
  
  in Public
  
  World Wide Web (w3c)
  
  C is for Consortium :)
2. bencomp 20 Aug 2018
  
  in Public
  
  nstitutional databases do not communicate with outside data stores such as publishers, vendors, non-profit organizations, and open source platforms.
  
  This is a very broad claim, which I think would be stronger if 'communication with outside data stores' is better defined and if you can support it with sources.
3. bencomp 20 Aug 2018
  
  in Public
  
  The proposed solution is a shared information pipeline where all stakeholders/agents would be able to share and exchange data about entities in real time. Three W3C recommended protocols are considered as potential solutions: the Linked Data Notifications protocol, the ActivityPub protocol, and the WebSub protocol.
  
  It looks like you are equating the 'pipeline' with protocols. Is that what you are doing?
Visit annotations in context

Annotators

bencomp

URL

osf.io/hbwf8/
specialcollections-blog.lib.cam.ac.uk specialcollections-blog.lib.cam.ac.uk

Devotion Made Digital: Encoding CUL MS Ll.5.18 – Cambridge University Library Special Collections

1
1. bencomp 03 Aug 2018
  
  in Public
  
  introductory session on the Textual Encoding Initiative (TEI) offered by Huw Jones, head of the Digital Library Unit
  
  CUL provides introductions to TEI
Visit annotations in context

Annotators

bencomp

URL

specialcollections-blog.lib.cam.ac.uk/
Jul 2018
www.nature.com www.nature.com

Data management made simple

1
1. bencomp 03 Jul 2018
  
  in Public
  
  “Your primary collaborator is yourself six months from now, and your past self doesn’t answer e-mails,”
  
  Very true!
Visit annotations in context

Annotators

bencomp

URL

nature.com/articles/d41586-018-03071-1
Jun 2018
www.tandfonline.com www.tandfonline.com

An On-Demand and Cloud-Based Digital Scholarship Applications Dashboard

9
1. bencomp 21 Jun 2018
  
  in Public
  
  The other cost barrier is the opportunity cost of requiring scholars to spend significant time learning how to install and deploy web applications before they even get the chance to see how those applications can support their digital scholarship goals.
  
  Yes!
2. bencomp 21 Jun 2018
  
  in Public
  
  It was Brian Henebry, the Director for Architecture, Service and Operations for Miami's IT Service
  
  Was this the only collaboration with University IT? What kind of collaboration was there when you used a completely separate infrastructure?
3. bencomp 21 Jun 2018
  
  in Public
  
  these are made more complicated with the addition of a third party (Amazon)
  
  How?
4. bencomp 21 Jun 2018
  
  in Public
  
  plans for longer term sustainability; and succession planning when a client moves on and wants to take their work with them.
  
  Have you considered making the web apps and/or data in the VMs static to preserve them in a different way, like in a web archive or data repository? Sometimes all a researcher wants is to share their data online within the context of a website with context and documentation – you don't need a separate VM for eternity to do that.
5. bencomp 21 Jun 2018
  
  in Public
  
  we recognize that users of the Scholars Dashboard may at some point want to move to a less experimental platform and move their work to a “production” environment.
  
  This means sometimes the 'work' has to be rebuilt from scratch (e.g. if an essential WordPress plugin is found to be a security vulnerability and needs to be replaced) and settings may need to be updated for different URLs etc. Who would be responsible for managing the production environment?
6. bencomp 21 Jun 2018
  
  in Public
  
  Other services routinely require customization of their configuration to match the characteristics of the running instance
  
  Does this mean that even though scholars and students can create their own VMs, the CDS or IT have to configure it? Or is all configuration left to the end users? If the latter, how do they feel about that?
7. bencomp 21 Jun 2018
  
  in Public
  
  That process is mostly manual and would benefit from more automation and integration with common billing mechanisms.
  
  Automation would be great, but I'm already impressed by the built-in usage tracking that allows splitting the bill in this way!
8. bencomp 21 Jun 2018
  
  in Public
  
  (http://scholardashboard.miamioh.edu)
  
  I get a connection time-out when connecting from The Netherlands. Also, this should be secured with TLS, i.e. be HTTPS-only.
9. bencomp 21 Jun 2018
  
  in Public
  
  As indicated in a 2014 EDUCAUSE article “Libraries have always been in the business of knowledge creation and transfer, and the digital scholarship incubator within the library can serve as a natural extension of this essential function” (Sinclair, 2014 Sinclair, B. (2014). The university library as incubator for digital scholarship. EDUCAUSE Review. Retrieved from http://er.educause.edu/articles/2014/6/the-university-library-as-incubator-for-digital-scholarship [Google Scholar]). Such incubators can create innovative virtual shared spaces that can support learning and discovery at different scales.
  
  This is sometimes called a 'digital scholarship laboratory', especially when there is a physical space.
Visit annotations in context

Annotators

bencomp

URL

tandfonline.com/doi/full/10.1080/01930826.2017.1326267
ublinnovatie.wordpress.com ublinnovatie.wordpress.com

Verslag Open Repositories Conference Bozeman, Montana, juni 2018

1
1. bencomp 19 Jun 2018
  
  in Public
  
  Doel is het maken van een toolkit voor het meten van hergebruik van erfgoed data (d.w.z. al het gebruik buiten het kijken en downloaden in het repository om).
  
  Dit doet me denken aan https://peerj.com/preprints/26505, dat bedoeld is voor het meten van gebruik van onderzoeksdata.
Visit annotations in context

Annotators

bencomp

URL

ublinnovatie.wordpress.com/2018/06/18/verslag-open-repositories-conference-bozeman-montana-juni-2018/
www.w3.org www.w3.org

W3C Workshop on Privacy and Linked Data – Report

1
1. bencomp 13 Jun 2018
  
  in Public
  
  Mark Lizar [position statement] presents the idea of ‘consent receipts’. A working group of the Kantara Initiative developed a format to formally describe the purpose of data collection, the identity of the data controller, and more. Mark is working on making such receipts, and the policies they refer to, easier to understand.
  
  This is interesting and something I have been thinking about in 'personal contract management' terms.
Visit annotations in context

Annotators

bencomp

URL

w3.org/2018/vocabws/report.html
www.sciencedirect.com www.sciencedirect.com

Icones Plantarum Malabaricarum: Early 18th century botanical drawings of medicinal plants from colonial Ceylon

1
1. bencomp 12 Jun 2018
  
  in Public
  
  (https://digitalcollections.universiteitleiden.nl/view/item/937962)
  
  Persistent URL: http://hdl.handle.net/1887.1/item:937962
Visit annotations in context

Annotators

bencomp

URL

sciencedirect.com/science/article/pii/S0378874118304562
May 2018
www.frontiersin.org www.frontiersin.org

OpenVIVO: Transparency in Scholarship

1
1. bencomp 17 May 2018
  
  in Public
  
  Next Steps
  
  Please use HTTPS for the OpenVIVO domain. It's free if you use Let's Encrypt.
Visit annotations in context

Annotators

bencomp

URL

frontiersin.org/articles/10.3389/frma.2017.00012/full
qz.com qz.com

Google’s true origin partly lies in CIA and NSA research grants for mass surveillance

1
1. bencomp 14 May 2018
  
  in Public
  
  creating the architecture and scaffolding of the World Wide Web
  
  Of course the World Wide Web is not the same as the Internet – the Web was invented by Tim Berners-Lee.
Visit annotations in context

Annotators

bencomp

URL

qz.com/1145669/googles-true-origin-partly-lies-in-cia-and-nsa-research-grants-for-mass-surveillance/
Apr 2018
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – Microdata in the IR: A Low-Barrier Approach to Enhancing Discovery of Institutional Repository Materials in Google

1
1. bencomp 18 Apr 2018
  
  in Public
  
  It would be interesting to see the impact in other search engines, but I understand this was not the focus of the article.
  
  Another result may be better interpretation of metadata in Zotero and other reference managers.
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/13191
all-geo.org all-geo.org

What does it mean to read the literature, really? (Anne’s 2017 #365papers in review) | Highly Allochthonous

1
1. bencomp 06 Apr 2018
  
  in Public
  
  As a profession, we reward productivity in the form of papers and grants, and sitting down to deeply read journal articles can feel like wasted time. Yet, if we aren’t regularly reading the literature, we risk that the work we are doing is out-of-date, duplicative, or derivative.
  
  Yes!
Visit annotations in context

Annotators

bencomp

URL

all-geo.org/highlyallochthonous/2018/01/what-does-it-mean-to-read-the-literature-really-annes-2017-365papers-in-review/
Mar 2018
www.frontiersin.org www.frontiersin.org

Ensemble Named Entity Recognition (NER): Evaluating NER Tools in the Identification of Place Names in Historical Corpora

1
1. bencomp 21 Mar 2018
  
  in Public
  
  The following five different NER systems have been used in our tests: Stanford NER, NER-Tagger, the Edinburgh Geoparser, spaCy, and Polyglot-NER.
  
  Short overview of NER software used in the research.
Visit annotations in context

Annotators

bencomp

URL

frontiersin.org/articles/10.3389/fdigh.2018.00002/full
digitalscholarshipleiden.nl digitalscholarshipleiden.nl

PIDapalooza 2018: “Open identifiers deserve their own festival” - Digital Scholarship @ Leiden

1
1. bencomp 06 Mar 2018
  
  in Public
  
  By assigning DOIs, these resources also become citable.
  
  You don't need a DOI to make something citable – any identifier will do.
Visit annotations in context

Annotators

bencomp

URL

digitalscholarshipleiden.nl/articles/pidapalooza-2018
www.nature.com www.nature.com

High-quality science requires high-quality open data infrastructure

2
1. bencomp 01 Mar 2018
  
  in Public
  
  References
  
  Why are there no direct links to the articles in this same journal? Why don't you follow your own instruction for citing articles and did you leave out the DOIs?
2. bencomp 01 Mar 2018
  
  in Public
  
  doi: 10.1038/sdata.2017.27 (2018).
  
  Is this the actual DOI? On the right of the page, under the Info tab the DOI contains '2018' instead of '2017'.
Visit annotations in context

Annotators

bencomp

URL

nature.com/articles/sdata201827
Feb 2018
www.sciencedirect.com www.sciencedirect.com

The MASi repository service — Comprehensive, metadata-driven and multi-community research data management

4
1. bencomp 22 Feb 2018
  
  in Public
  
  For large-scale metadata harvesting the MetaStore is OAI complaint
  
  What do you mean by "X is OAI [compliant]"?
2. bencomp 22 Feb 2018
  
  in Public
  
  content
  
  descriptive metadata, I presume?
3. bencomp 22 Feb 2018
  
  in Public
  
  they either have a focus on generic institutional repositories without community specific adaptations (DSpace, Fedora)
  
  Fedora is not at all focused on being used in institutional repositories.
4. bencomp 22 Feb 2018
  
  in Public
  
  KIT DM is more flexible
  
  Fedora and iRODS are designed to be very flexible, so this is a bold statement.
Visit annotations in context

Annotators

bencomp

URL

sciencedirect.com/science/article/pii/S0167739X17305344
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – Approaching the largest ‘API’: extracting information from the Internet with Python

1
1. bencomp 07 Feb 2018
  
  in Public
  
  Internet
  
  This, and probably most other references to "the Internet" in this article, should read "the Web".
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/13197
Jan 2018
content.iospress.com content.iospress.com

The knowledge graph as the default data model for learning on heterogeneous knowledge

3
1. bencomp 24 Jan 2018
  
  in Public
  
  Is DL-Learner related work?
  
  seeAlso
2. bencomp 24 Jan 2018
  
  in Public
  
  If we can incorporate such methods into end-to-end models, it becomes possible to let these models learn the most appropriate level of inference themselves.
  
  You could also make all the implicit knowledge explicit by using a reasoner before using ML.
3. bencomp 24 Jan 2018
  
  in Public
  
  3.4.The default data model?
  
  This section considers the various ways of modeling information (knowledge graph, tree (XML) and table (relational model)) for use in machine learning.
Visit annotations in context

Tags

seeAlso

Annotators

bencomp

URL

content.iospress.com/articles/data-science/ds007
software-carpentry.org software-carpentry.org

My Favorite Tool - Todoist

1
1. bencomp 18 Jan 2018
  
  in Public
  
  Todoist is one of my favourite tools too :)
Visit annotations in context

Annotators

bencomp

URL

software-carpentry.org/blog/2017/12/fave-tool-todoist.html
academic.oup.com academic.oup.com

DivaServices—A RESTful web service for Document Image Analysis methods

2
1. bencomp 10 Jan 2018
  
  in Public
  
  When designing such a system, scalability issues need also to be considered. These questions are not easily solved and will be investigated as part of our future research. We aim at addressing issues such as what is the best way to transfer all the data, how can large data sets be processed, and how can processing power be distributed.
  
  Many algorithms can be parallelised and distributed. Services that are not easily parallelised may be replicated on multiple machines, using a load balancing proxy for access via a central access point.
2. bencomp 10 Jan 2018
  
  in Public
  
  With DivaServices all complicated installation and configuration steps to use a method are removed by providing a simple-to-use RESTful web service.
  
  Unfortunately the authors provide no information about service levels to be expected.
Visit annotations in context

Annotators

bencomp

URL

academic.oup.com/dsh/article/32/suppl_1/i150/2957401
ruben.verborgh.org ruben.verborgh.org

Piecing the puzzle – Self-publishing queryable research data on the Web

1
1. bencomp 07 Jan 2018
  
  in Public
  
  RDFa does not have a standardized option to place data in named graphs
  
  I assumed that by default, a file containing RDF serves as the graph. Reading this, I probably shouldn't.
Visit annotations in context

Annotators

bencomp

URL

ruben.verborgh.org/articles/queryable-research-data/
Dec 2017
www.liberquarterly.eu www.liberquarterly.eu

Library Carpentry: software skills training for library professionals

1
1. bencomp 22 Dec 2017
  
  in Public
  
  training delivered over four segments
  
  Four segments, as in four sessions?
Visit annotations in context

Annotators

bencomp

URL

liberquarterly.eu/articles/10.18352/lq.10176/
www.mareonline.nl www.mareonline.nl

Radicaal transparant, graag

1
1. bencomp 22 Dec 2017
  
  in Public
  
  Slecht onderzoek kan bovendien een domino-effect veroorzaken: als een andere studie dat effect niet repliceert, heeft dat een minder grote kans om gepubliceerd te worden. Het eerste artikel veroorzaakt een bias: als dat niet deugt, hebben alle goede studies het moeilijker.
  
  Belangrijk argument tegen het afwijzen van artikelen waarin een effect niet wordt aangetoond.
Visit annotations in context

Annotators

bencomp

URL

mareonline.nl/archive/2017/12/13/radicaal-transparant-graag
ruben.verborgh.org ruben.verborgh.org

Paradigm shifts for the decentralized Web

4
1. bencomp 21 Dec 2017
  
  in Public
  
  I have not seen this crucial second paradigm shift articulated explicitly elsewhere
  
  dokieli is (AFAIK) the best example of an app being a view – or rather providing a view. And feed readers. You're probably right that it hasn't been discussed a lot.
2. bencomp 21 Dec 2017
  
  in Public
  
  We can essentially replace LinkedIn by an address book, where somebody is a connection if they also have you in their contact list.
  
  I've wondered why we've started exposing our contact lists in the first place – but that's a different discussion.
3. bencomp 21 Dec 2017
  
  in Public
  
  Consider this social media post, where an author states his professional opinion on an online news article.
  
  Great visualisation
4. bencomp 21 Dec 2017
  
  in Public
  
  Most Web applications today follow the adage “your data for my services”. They motivate this deal from both a technical perspective (how could we provide services without your data?) and a business perspective (how could we earn money without your data?).
  
  Great observation! The business perspective also includes why would anyone pay us for this service?, as a counter-counter question to the counter question why don't you charge money instead of data?
Visit annotations in context

Annotators

bencomp

URL

ruben.verborgh.org/blog/2017/12/20/paradigm-shifts-for-the-decentralized-web/
academic.oup.com academic.oup.com

Snakemake—a scalable bioinformatics workflow engine

1
1. bencomp 01 Dec 2017
  
  in Public
  
  The Google Code link is dead. I found Snakemake documentation at Read The Docs, including installation instructions.
Visit annotations in context

Annotators

bencomp

URL

academic.oup.com/bioinformatics/article/28/19/2520/290322
Nov 2017
journals.plos.org journals.plos.org

Good enough practices in scientific computing

2
1. bencomp 01 Nov 2017
  
  in Public
  
  A practice is included in our list if large numbers of researchers use it and large numbers of people are still using it months after first trying it out. We include the second criterion because there is no point in recommending something that people won't actually adopt.
  
  I like the criteria for inclusion (and the whole article), but one could wonder if they really makes something a "good enough" practice. How many is "large numbers of researchers"?
  
  method
2. bencomp 01 Nov 2017
  
  in Public
  
  Many of our recommendations are for the benefit of the collaborator every researcher cares about most: their future self (as the joke goes, yourself from 3 months ago doesn't answer email…).
  
  :)
Visit annotations in context

Tags

method

Annotators

bencomp

URL

journals.plos.org/ploscompbiol/article
Oct 2017
journal.code4lib.org journal.code4lib.org

The Code4Lib Journal – Tools and Workflows for Collaborating on Static Website Projects

1
1. bencomp 19 Oct 2017
  
  in Public
  
  Desktop editors for Markdown writing may meet content editing needs in these cases but were not explored in depth for this article.
  
  Using a tool like pandoc may allow someone to write in Word, LibreOffice or (La)TeX and convert the file to Markdown or HTML, but I understand it always makes things more complex.
Visit annotations in context

Annotators

bencomp

URL

journal.code4lib.org/articles/12779
blogs.loc.gov blogs.loc.gov

What Could Curation Possibly Mean? | The Signal

3
1. bencomp 11 Oct 2017
  
  in Public
  
  Forming an individual relationship may be time-consuming but it can make a big difference in the quantity and quality of researcher contributions.
  
  I do wonder if larger group introductions can help too.
2. bencomp 11 Oct 2017
  
  in Public
  
  Throughout the outreach process, it has become increasingly apparent that focused and personalized attention, demonstrated through individualized emails and one-on-one meetings, helps increase researcher participation.
  
  Hopefully they won't need to address every individual researcher as researchers get comfortable with the catalog and start helping each other get started with the system…
3. bencomp 05 Oct 2017
  
  in Public
  
  People just assume that the work I do in my day job now is much harder than it actually is, so if I can lower that barrier we can have more people learning to do it and more people can be more efficient in their jobs.
  
  This is one the reasons for the DH Clinics to exist.
Visit annotations in context

Annotators

bencomp

URL

blogs.loc.gov/thesignal/2014/02/what-do-you-mean-by-archive-genres-of-usage-for-digital-preservers/
www.maxkemman.nl www.maxkemman.nl

Digital Humanities and Digital Physics

3
1. bencomp 05 Oct 2017
  
  in Public
  
  Very interesting summary and comparison between 'DP' and 'DH'!
2. bencomp 05 Oct 2017
  
  in Public
  
  Do we not see likewise in dh that conferences focus on techniques, and papers focus more on describing technology than its application?
  
  Yes, we do see this! Amen! ;)
3. bencomp 05 Oct 2017
  
  in Public
  
  automatisation
  
  automation?
Visit annotations in context

Annotators

bencomp

URL

maxkemman.nl/2016/12/digital-humanities-and-digital-physics/
www.villagevoice.com www.villagevoice.com

Keepers of the Secrets | Village Voice

1
1. bencomp 04 Oct 2017
  
  in Public
  
  The Harry Ransom Center at the University of Texas Austin, which is well-funded and ambitious, is said to have particularly driven up the price for the most sought-after collections
  
  What a coincidence that the name of the Center includes "Ransom"!
Visit annotations in context

Annotators

bencomp

URL

villagevoice.com/2017/09/20/keepers-of-the-secrets/
Sep 2017
repository.ubn.ru.nl repository.ubn.ru.nl

175890.pdf

8
1. bencomp 27 Sep 2017
  
  in Public
  
  The framework of the EM research is mainly BS added with the concept of spirituality, as described by religious sociologists
  
  Citations needed
2. bencomp 27 Sep 2017
  
  in Public
  
  BS is the most important framework of the research.
  
  Why?
3. bencomp 27 Sep 2017
  
  in Public
  
  Here, however, the discussion starts at the level of the organisation because this was logically seen the best starting point.
  
  What logic did you use?
4. bencomp 27 Sep 2017
  
  in Public
  
  blood groups
  
  Did you mean blood types? Why did you choose blood groups as an analogy for model explanation or interpretation?
5. bencomp 27 Sep 2017
  
  in Public
  
  The EM is an open model. This implies that no restrictions exist in publishing.
  
  Is the model in the public domain?
6. bencomp 27 Sep 2017
  
  in Public
  
  Research to the EM provided relevant information that is profitable for every (potential) user
  
  What research provided what information and how is that information profitable? (Did you mean useful?)
7. bencomp 27 Sep 2017
  
  in Public
  
  he EM is far and foremost a coaching model.
  
  Please define coaching model.
8. bencomp 27 Sep 2017
  
  in Public
  
  BS studies made clear that BS has positive effects for the organisation as a whole, and the welness of the employee.
  
  citations needed
Visit annotations in context

Annotators

bencomp

URL

repository.ubn.ru.nl/bitstream/handle/2066/175890/175890.pdf
ceur-ws.org ceur-ws.org

paper1.pdf

1
1. bencomp 27 Sep 2017
  
  in Public
  
  Because there still does not exist a proper Knowledge Management at theGEI as would be provided by using a suitable KOS these research resultscan not yet be linked to other research contexts and therefore are in danger ofnot being used afterwards. Their production within a highly specialized researchcommunity with complex but separate contexts and systems prevents them frombeing found easily, consequently followed by death of data”, double work andwaste of resources. Another researcher with related information interests has noknowledge about the already existing GEI data and is not able to satisfy theirneed of information easily.
  
  Making research results findable can be done using non-KOS, like a full-text search engine. I think 'not having a KOS will keep research results from view' is not a strong argument.
Visit annotations in context

Annotators

bencomp

URL

ceur-ws.org/Vol-1937/paper1.pdf
mixtrak.tumblr.com mixtrak.tumblr.com

Open scientific methods: my experience with protocols.io

2
1. bencomp 27 Sep 2017
  
  in Public
  
  scientific protocols
  
  i.e. biomedical / experimental protocols :)
2. bencomp 27 Sep 2017
  
  in Public
  
  This post has been written from a life sciences point of view. Life sciences also seems to be the main target discipline of protocols.io, even though the principles of open science are applicable in all disciplines.
Visit annotations in context

Annotators

bencomp

URL

mixtrak.tumblr.com/post/165325610046/open-scientific-methods-my-experience-with
www.nature.com www.nature.com

Stop this waste of people, animals and money

1
1. bencomp 07 Sep 2017
  
  in Public
  
  Before approving a study, ethics committees should ask researchers to declare in writing their willingness to work with their institutional resources, such as librarians, to ensure they do not submit to any journals without reviewing evidence-based criteria for avoiding these titles.
  
  Interesting idea. Librarians will probably have insights on trustworthyness of journals.
Visit annotations in context

Annotators

bencomp

URL

nature.com/news/stop-this-waste-of-people-animals-and-money-1.22554
Aug 2017
link.springer.com link.springer.com

Scholarly Ontology: modelling scholarly practices

1
1. bencomp 30 Aug 2017
  
  in Public
  
  SPRQL queries
  
  I believe the standard is called SPARQL. I can't believe the authors consistently use the wrong name.
Visit annotations in context

Annotators

bencomp

URL

link.springer.com/article/10.1007/s00799-016-0169-3
bjoern.brembs.net bjoern.brembs.net

7 functionalities the scholarly literature should have

2
1. bencomp 23 Aug 2017
  
  in Public
  
  try and click on a very common sentence, e.g. “the experiments were performed as previously described”. In essentially every single case today, nothing happens, while in the demo in 1968, it would have taken the reader to a document describing the experiments.
  
  That is a great example. I guess one reason that this doesn't work is that the software people use for writing makes it unintuitive to (cross)reference. But of course the authors should be encouraged to want to do this in the first place and they aren't.
2. bencomp 23 Aug 2017
  
  in Public
  
  nobody seems to care about the software we write to transform the bits and bytes of the raw data into the flat, pixel-based images.
  
  Some of us do…
Visit annotations in context

Annotators

bencomp

URL

bjoern.brembs.net/2017/08/7-functionalities-the-scholarly-literature-should-have/
blog.fastforwardlabs.com blog.fastforwardlabs.com

The Business Case for Machine Learning Interpretability

1
1. bencomp 10 Aug 2017
  
  in Public
  
  For example, the US Fair Credit Reporting Act requires that agencies disclose “all of the key factors that adversely affected the credit score of the consumer in the model used, the total number of which shall not exceed 4.” It’s difficult to satisfy this regulation if your credit model is a deep neural network.
  
  :/
Visit annotations in context

Annotators

bencomp

URL

blog.fastforwardlabs.com/2017/08/02/business-interpretability.html
neuroneurotic.net neuroneurotic.net

Is open science tone deaf?

2
1. bencomp 09 Aug 2017
  
  in Public
  
  in the end all scientists should have an interest in improving scientific practice
  
  yes!
2. bencomp 09 Aug 2017
  
  in Public
  
  Having recently had the displeasure of experiencing firsthand in my own life how the news media operate
  
  What is the author referring to here?
Visit annotations in context

Annotators

bencomp

URL

neuroneurotic.net/2017/08/07/is-open-science-tone-deaf/
Jul 2017
www.tandfonline.com www.tandfonline.com

Open Annotation and Close Reading the Victorian Text: Using Hypothes.is with Students

8
1. bencomp 26 Jul 2017
  
  in Public
  
  I considered a number of annotation tools. After trying many and consulting with colleagues and the distance learning team at FSU,33. Thanks are due to Paul Fyfe, Tarez Graban, and Charles McCann for their advice on this topic.View all notes I chose Hypothes.is.44. Hypothesis <https://hypothes.is> [accessed 3 August 2016].View all notes It offered stability, good reviews, open source, creative commons, the option of private annotation, a clean and inviting interface, and flexible possibilities for the course and beyond.
  
  There are tools that can be installed on university servers and that should work in much the same way as Hypothes.is. I wonder if these were considered?
2. bencomp 26 Jul 2017
  
  in Public
  
  With a large class, some responsibility must fall on the student to follow directions and ask for help.
  
  I'm not a teacher, but yes, I agree that students (they're usually adults) have responsibility to follow directions – also to be critical of them, and ask for help when the directions are unclear.
3. bencomp 26 Jul 2017
  
  in Public
  
  Some websites do not integrate smoothly with the Hypothes.is shell.
  
  That is indeed problematic.
4. bencomp 26 Jul 2017
  
  in Public
  
  I worked to establish trust and community so that we could respectfully disagree upon difficult matters.
  
  One would hope university students learn to debate respectfully and using arguments that go into the contents, not ad hominem 'arguments'.
5. bencomp 26 Jul 2017
  
  in Public
  
  many students used their own names
  
  On the wider web, this does not appear to be a reason for refraining from making offensive remarks.
6. bencomp 26 Jul 2017
  
  in Public
  
  The staff at Hypothes.is was extremely helpful in deleting these false pointers once students had re-posted their annotations in the correct location.
  
  I understand the students were having trouble already, but they should be able to edit their annotations and put them in the correct channel themselves, don't they?
7. bencomp 26 Jul 2017
  
  in Public
  
  Given my students’ lack of familiarity with annotation software, each of these roles was distinctly necessary. I offered instructions and advice via assignment guidelines, announcements, FAQs with screenshots, personal emails, video conferencing, and meeting with students personally in office hours. The staff at Hypothes.is emailed with students, and Jeremy Dean provided a Student Resource Guide and video tutorial.
  
  That is quite a bit of work!
8. bencomp 26 Jul 2017
  
  in Public
  
  Using Hypothes.is on Victorian texts lends itself to focused annotation along a number of axes:(1) Historical: highlight a literary, medical, judicial, or historical reference and discuss its history;(2) Linguistic: highlight one word and discuss its history, usage, and connotations;(3) Literary: highlight a phrase or sentence to discuss using particular literary or analytic concepts discussed in class (metaphor, free indirect discourse, irony, etc);(4) Ethical: highlight a sentence that represents an ethical choice in this text and discuss the costs of that choice;(5) Multimedia: highlight a phrase or sentence and link to a visual, audio, or video clip with a brief explanation of the connection you see.
  
  Good – a guide for what aspects of texts to annotate.
Visit annotations in context

Annotators

bencomp

URL

tandfonline.com/doi/full/10.1080/13555502.2016.1233905
peerj.com peerj.com

Interoperability and FAIRness through a novel combination of Web technologies

10
1. bencomp 18 Jul 2017
  
  in Public
  
  The authors provide a clear idea on how data that is not accessible in RDF per se can be made interoperable using RML and TPF. However, their suggestion that the presented solution uses the LDP specification beyond the shared use of the term "Container" seems inadequate.
2. bencomp 18 Jul 2017
  
  in Public
  
  Within the LDP specification is the concept of an LDP Container. A basic implementation of LDP containers involves two “kinds” of resources, as diagrammed in Fig. 1. The first type of resource represents the container—a metadata document that describes the shared features of a collection of resources, and (optionally) the membership of that collection. This is analogous to, for example, a metadata document describing a data repository, where the repository itself has features (ownership, curation policy, etc.) that are independent from the individual data records within that repository (i.e., the members of the collection). The second type of resource describes a member of the contained collection and (optionally) provides ways to access the record itself.
  
  It is a bit confusing that the authors project their resource types (Container and MetaRecord) onto the LDP specification, which does not specify the MetaRecord type. It would have been clearer if the authors had just mentioned LDP as inspiration.
3. bencomp 18 Jul 2017
  
  in Public
  
  if there is an algorithm capable of extracting it and exposing it via the TPF interface.
  
  I wonder how much new users should know about the open world assumption and handling conflicting statements. If this takes off, we might see many triples that 'do not compute'. Just a thought, not criticism towards the paper.
4. bencomp 18 Jul 2017
  
  in Public
  
  without the need to define an API
  
  …a new API
5. bencomp 18 Jul 2017
  
  in Public
  
  The FAIR Projector, in this case, is a script that dynamically transforms data from a query of UniProt into the appropriately formatted triples; however, this is opaque to the client. The Projector’s TPF interface, from the perspective of the client, would be identical if the Projector was serving pre-transformed data from a static document, or even generating novel data from an analytical service.
  
  Given the premisse that TPF endpoint are more scalable than SPARQL endpoints, using (dynamic) TPF endpoints makes sense.
6. bencomp 14 Jul 2017
  
  in Public
  
  Calling HTTP GET on the URL of the FAIR Projector produces RDF triples from the data source that match the format defined by that Projector’s Triple Descriptor. The originating data source behind a Projector may be a database, a data transformation script, an analytical web service, another FAIR Projector, or any other static or dynamic data-source.
  
  Cool idea!
7. bencomp 14 Jul 2017
  
  in Public
  
  We propose, therefore, to combine three elements—data transformed into RDF, which is described by Triple Descriptors, and served via TPF-compliant URLs. We call this combination of technologies a ”FAIR Projector”.
  
  definition
8. bencomp 14 Jul 2017
  
  in Public
  
  An RML map describes the triple structure (subject, predicate, object, abbreviated as [S,P,O]), the semantic types of the subject and object, and their constituent URI structures, that would result from a transformation of non-RDF data (of any kind) into RDF data.
  
  So this could be used instead of the JSON schema definition for tabular data that was recommended in the Tabular data on the Web efforts.
9. bencomp 14 Jul 2017
  
  in Public
  
  we only require read functionality
  
  You need to create the structure somehow, don't you? I think you're saying that creating FAIR Accessors is up to the reader?
10. bencomp 13 Jul 2017
  
  in Public
  
  the LDP’s use of the Data Catalogue Vocabulary
  
  The LDP specification does not mention DCAT, so I'm not sure what is meant here. Do the authors imply there is a connection between the standards or merely that it is possible to combine the two?
Visit annotations in context

Tags

definition

Annotators

bencomp

URL

peerj.com/articles/cs-110/
www.buzzfeed.com www.buzzfeed.com

The Woman Hired To Fix GitHub’s Troubled Culture Is Leaving, And Employees Are Worried

1
1. bencomp 17 Jul 2017
  
  in Public
  
  engineers
  
  software engineers, to be precise.
Visit annotations in context

Annotators

bencomp

URL

buzzfeed.com/carolineodonovan/an-executive-departure-at-github-reignites-employee
quod.lib.umich.edu quod.lib.umich.edu

Hacking the Academy: New Approaches to Scholarship and Teaching from Digital Humanities

4
1. bencomp 14 Jul 2017
  
  in Public
  
  What would it be like if that was all there was—structures meant to bring people and students together for as long as a methodology remains useful or a question remains interesting? Such entities would be born like centers—born with all the excitement and possibility of not knowing what you’re doing—of having to learn from each other what the methodologies and questions are really about.
  
  I like this idea.
2. bencomp 14 Jul 2017
  
  in Public
  
  we’d have to change the names of the degrees to something vague, like “Bachelor of Arts” or “Doctor of Philosophy.”
  
  :D
3. bencomp 14 Jul 2017
  
  in Public
  
  The goal of any new theory of libraries must of course accommodate the increasing needs in research and scholarship for large quantities of information, but should not preface quantity of information over all else. As important as the information itself is, providing and supporting an environment that allows for the transformation of that information into new knowledge is essential.
  
  Yes!
4. bencomp 14 Jul 2017
  
  in Public
  
  The Wrong Business for Libraries
  
  This essay got me thinking, first that being scholar-centered is indeed something to strive for. But later I wondered if the library should be the place for a scholar to do everything – if so, why do we have the other parts of universities (faculties with labs, offices)?
Visit annotations in context

Annotators

bencomp

URL

quod.lib.umich.edu/d/dh/12172434.0001.001/1:3/--hacking-the-academy-new-approaches-to-scholarship
scholarship.law.nd.edu scholarship.law.nd.edu

"Against Notice Skepticism in Privacy (and Elsewhere)" by M. Ryan Calo

1
1. bencomp 13 Jul 2017
  
  in Public
  
  In a Wall Street Journal article on Google's often hidden funding of research, the author of this article is paraphrased saying he should have disclosed that Google funded this research.
  
  disclosure seeAlso
Visit annotations in context

Tags

seeAlso

disclosure

Annotators

bencomp

URL

scholarship.law.nd.edu/ndlr/vol87/iss3/3/
www.wsj.com www.wsj.com

Paying Professors: Inside Google’s Academic Influence Campaign

3
1. bencomp 13 Jul 2017
  
  in Public
  
  Most popular U.S. mobile apps, by unique monthly visitors
  
  Is WhatsApp not popular in the U.S.? In the Netherlands, 'whatsapp' is a verb.
2. bencomp 13 Jul 2017
  
  in Public
  
  There are no professional standards on such disclosures in the research papers, which are mostly published in law journals at the universities.
  
  That surprises me. Many, if not most, scholarly publications include funding in articles because it increases transparency and (ideally) trust in the outcomes. Not including corporate/private funding feels like hiding a hidden agenda; often when it is later revealed, researchers get scrutiny for hiding the information – which is the point of this article.
3. bencomp 13 Jul 2017
  
  in Public
  
  Google has paid professors whose papers, for instance, declared that the collection of consumer data was a fair exchange for its free services;
  
  [citation needed]
Visit annotations in context

Annotators

bencomp

URL

wsj.com/articles/paying-professors-inside-googles-academic-influence-campaign-1499785286
www.tandfonline.com www.tandfonline.com

Developing collaborative best practices for digital humanities data collection: A case study

1
1. bencomp 12 Jul 2017
  
  in Public
  
  best practices for assisting digital humanists defined
  
  Is it possible to define best practices based on a single case study?
Visit annotations in context

Annotators

bencomp

URL

tandfonline.com/doi/abs/10.1080/10691316.2017.1326330
support.datacite.org support.datacite.org

DataCite DOI Display Guidelines

2
1. bencomp 12 Jul 2017
  
  in Public
  
  We encourage style guides to update their recommendations for DOIs to use the full URL form.
  
  Agreed, unlinked identifiers are soooo… well, hard to use.
2. bencomp 12 Jul 2017
  
  in Public
  
  The short form of the DOI for https://doi.org/10.5285/1D4D70AD-DC38-4E5F-BC39-066BABCA2FB2 is https://doi:10/bcc7.
  
  The second link does not resolve, as "Firefox can’t find the server at doi." Are you sure this is correct?
Visit annotations in context

Annotators

bencomp

URL

support.datacite.org/v1.0/docs/datacite-doi-display-guidelines

Ben

Annotations: 391

Joined: August 3, 2016

ORCID: 0000-0002-7023-9047

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators