108 Matching Annotations
  1. Last 7 days
    1. Key to DataCite service is the concept of a long-term or persistent identifier. A persistent identifier is an association between a character string and a resource. Resources can be files, parts of files, persons, organisations, abstractions, etc. DataCite uses Digital Object Identifiers (DOIs)(2) at the present time and is considering the use of other identifier schemes in the future.
  2. May 2022
  3. Apr 2022
    1. Feature request (implement something that allows the following): 1. From any page containing a bookmarklet, invoke the user-stored bookmarklet בB 2. Click the bookmarklet on the page that you wish to be able to edit in the Bookmarklet Creator 3. From the window that opens up, navigate to a stored version of the Bookmarklet Creator 4. Invoke bookmarklet בB a second time from within the Bookmarklet Creator

      Expected results:

      The bookmarklet from step #2 is decoded and populates the Bookmarklet Creator's input.

      To discriminate between invocation type II (from step #2) and invocation type IV (from step #4), the Bookmarklet Creator can use an appropriate class (e.g. https://w3id.example.org/bookmarklets/protocol/#code-input) or a meta-based pragma or link relation.

  4. Mar 2022
  5. Feb 2022
    1. Metadaten bilden die Grundlage zur Beschreibung und Erschließung von Semantik.

      Rolle von Metadaten



    1. Zur Sicherstellung der Effektivität und Effizienz in der Entwicklung, dem Betrieb und der Nutzung von BI-Systemumwelten ist ein adäquates Management der Metada-ten erforderlich [4].


    2. alle Informationen, die für den Entwurf, die Konstruktion oder Benutzung eines BI-Systems benötigt werden [13].

      Metadaten im BI-System

    3. Im Zuge stetig steigender Komplexität in Business-Intelligence-Systemen nimmt auch die Bedeutung des Metadatenmanagements (MDM) kon-tinuierlich zu.

      Warum Metadatenmanagement?



  6. Jan 2022
    1. search can’t be fixed solely by focusing on the interface or engine, because it depends upon the foundation of content and metadata, which in turn are shaped by governance, incentives, and metrics.

      Metadata supports search

    1. The matter is not sim-ply, as in the case of libraries and archives, handling the usually rather tricky language of the indexer,

      Modern digital indices have the ability to easily create aliases so that similar or related headings might be concatenated. As an example, I might have four different variations of R. Llull's name in my system or English and Latin versions of names like "excerpting" and "ars excerptendi" which can be mapped to the same endpoints without worrying about the existence of synonyms.

  7. Dec 2021
    1. as of February 2021, Europeana comprises 59%images and 38% text objects, but only 1% sound objects and 2% video objects.3 DPLA iscomposed of 25% images and 54% text, with only 0.3% sound objects, and 0.6% videoobjects.4Another reason, beyond cost, that audiovisual recordings are not widely accessible is the lack ofsufficiently granular metadata to support identification, discovery, and use, or to supportinformed rights determination and access control and permissions decisions on the part ofcollections staff and users.

      Despite concerted efforts, there is a minimal amount of A/V material in Europeana and DPLA. This report details a pilot project to use a variety of machine-generated-metadata mechanisms to augment the human description efforts. Although this paragraph mentions rights determination, it isn't clear from the problem statement whether the machine-generated description includes anything that will help with rights. I would expect that unclear rights—especially for moving image content—would be a significant barrier to the open publication of A/V material.

    1. Most of the metadata about our document lives in the head of the document, and it makes perfect sense. Most information about our bodies also live in our head.

      Metadata placement.

  8. Nov 2021
    1. Inside each first-level directory of a MAFF archive, the second-level directory named ^metadata^ (case-sensitive) is reserved and should not contain actual content. A file or folder named ^metadata^ (case-insensitive) should not exist inside any first-level directory.
  9. Oct 2021
  10. Sep 2021
    1. Bibleref is a simple approach to automatically identifying Bible references that are embedded in blog posts and other web pages. This enables search engines, content aggregators, and other automated tools to correctly label the references so they're more easily searchable. Bibleref is part of a general movement toward markup that expresses more semantic, rather than presentational, element.
  11. Aug 2021
    1. Some thoughts about leaving space in new notebooks, especially for one's future self:

      • contact information in front in case of loss
      • space for a future table of contents to come
      • space for page numbers and dates
      • space in the back for house keeping, indices, etc.


  12. Jun 2021
    1. One thing that should be learned from the bitter lesson is the great power of general purpose methods, of methods that continue to scale with increased computation even as the available computation becomes very great. The two methods that seem to scale arbitrarily in this way are search and learning

      This is a big lesson. As a field, we still have not thoroughly learned it, as we are continuing to make the same kind of mistakes. To see this, and to effectively resist it, we have to understand the appeal of these mistakes. We have to learn the bitter lesson that building in how we think we think does not work in the long run. The bitter lesson is based on the historical observations that 1) AI researchers have often tried to build knowledge into their agents, 2) this always helps in the short term, and is personally satisfying to the researcher, but 3) in the long run it plateaus and even inhibits further progress, and 4) breakthrough progress eventually arrives by an opposing approach based on scaling computation by search and learning. The eventual success is tinged with bitterness, and often incompletely digested, because it is success over a favored, human-centric approach.

    1. To get closer to attaining coveted “rich” metadata and ultimately contribute to a “richer” scholarly communication ecosystem, journals first need to have machine-readable metadata that is clean, consistent, and as interoperable as possible.

      What WordPress plugins provide structured metadata functionality for OpenPhysio.

  13. May 2021
    1. Despite the surprising lack of digital editions, the commonplace book, more than any other genre of writing, seems well suited to a digital format, since, by its very structure, it is a linked web of fragments that have been “coded” and “marked up” with metadata. For this reason, we have put much thought and planning into which tools to use and how design this digital edition.
  14. Apr 2021
    1. A modified timestamp signifies the last time the contents of a file were modified. A program or process either edited or manipulated the file. “Modified” means something inside the file was amended or deleted, or new data was added. Changed timestamps aren’t referring to changes made to the contents of a file. Rather, it’s the time at which the metadata related to the file was changed. File permission changes, for example, will update the changed timestamp.

      They shouldn't use synonyms for this (modified = changed).

      It would be clearer if the word that differed between the terms indicated what changed:

      • "content modified" time (cctime)?
      • "meta modified" time (cmtime)?
  15. Mar 2021
  16. Feb 2021
  17. Oct 2020
    1. they co-operate with law enforcement by providinglawful access to encrypted communications and engage in consultation with governmentsand other stakeholders to facilitate legal access in a way that is substantive and genuinelyinfluences design decisions

      So DHA wants tech companies to provide the government access to the contents of encrypted communications. -- I'm not sure how this can be done without putting inadvertent vulnerabilities in.

      Maybe a solution is for tech companies just pass metadata to law enforcement, similar to the way telecommunications companies do. The tech companies could be put in the same bucket at telco companies and be subject to Australia's data retention obligations. https://www.homeaffairs.gov.au/about-us/our-portfolios/national-security/lawful-access-telecommunications/data-retention-obligations

    1. I’m iffy on the value of that metadata - whether schema.org-style annotations have any value. Semantic web has a real religious bent to it that I don’t feel. I want to see the implementations and the full working systems, and I think it’s been long enoough since the introduction of RDFa, Microdata, and JSON-LD that we should be seeing practical, real uses of them. And I’m just not seeing those uses.

      These are some valid points.

      I have been seeing some interesting use cases for microformats getting stronger recently.

    1. If everyone would subscribe to such a system and create good metadata for the purposes of describing their goods, services and information, it would be a trivial matter to search the Internet for highly qualified, context-sensitive results: a fan could find all the downloadable music in a given genre, a manufacturer could efficiently discover suppliers, travelers could easily choose a hotel room for an upcoming trip. A world of exhaustive, reliable metadata would be a utopia. It's also a pipe-dream, founded on self-delusion, nerd hubris and hysterically inflated market opportunities.

      Apparently this also now applies to politics and democracy too.

  18. Sep 2020
  19. Jun 2020
    1. There were also underlying security issues. Most of the messaging apps Tor Messenger supported are based on client-server architectures, and those can leak metadata (such as who's involved in a conversation and when) that might reveal who your friends are. There was no real way for the Tor crew to mitigate these issues.
    2. Tor suggests CoyIM, but it's prone to the same metadata issues as Messenger. You may have to accept that a small amount of chat data could find its way into the wrong hands, even if the actual conversations are locked down tight.
    1. Of course, with Facebook being Facebook, there is another, more commercial outlet for this type of metadata analysis. If the platform knows who you are, and knows what you do based on its multi-faceted internet tracking tools, then knowing who you talk to and when could be a commercial goldmine. Person A just purchased Object 1 and then chatted to Person B. Try to sell Object 1 to Person B. All of which can be done without any messaging content being accessed.
  20. May 2020
  21. Apr 2020
    1. api-version A query string parameter, indicating the API version for the IMDS endpoint. Please use API version 2018-02-01 or greater.

      Couldn't find where the exhaustive list of API versions are listed, but found this on the Azure Instance Metadata Service (aka IMDS) page:

      2017-04-02, 2017-08-01, 2017-12-01, 2018-02-01, 2018-04-02, 2018-10-01, 2019-02-01, 2019-03-11, 2019-04-30, 2019-06-01, 2019-06-04, 2019-08-01, 2019-08-15


      The version 2019-11-01 is currently getting deployed and may not be available in all regions.

  22. Mar 2020
  23. Dec 2019
    1. St John's Medical College and St John's Research Institute

      As far as I can tell, the two are (nowadays at least) separate entities, so why are they listed under the same affiliation?

    1. a brief but adequate description for the items in the SDR content management system and on PURL pages associated with the items

      are these the splash pages? What are these PURL pages?

  24. Nov 2019
    1. We’re excited to announce that The Francis Crick Institute have partnered with our portfolio companies Symplectic and Figshare to enable richer profiles for their researchers and to make it easier for them to publish their papers and data Open Access.

  25. Oct 2019
  26. Sep 2019
  27. Jul 2019
    1. Every time your child opens the email, that person knows generally where they are (or specifically, if they have other info to triangulate against).
  28. Jun 2019
    1. What are the risks in digitizing these materials outside the library? Are there possibilities of this collection not being well integrated with other holdings? Did you collaborate with metadata experts?

  29. Apr 2019
  30. Nov 2018
  31. Oct 2018
    1. Written while Owens was a digital archivist at the Library of Congress, this tutorial introduces the idea of glitching—intentionally corrupting a digital artifact—as a kind of playful deformance. Such intrusive digital interventions can serve a forensics purpose by exposing underlying metadata, but they can also defamiliarize the digital media in question. Owens walks through the glitching of music and image files. This process could be easily expanded to include other media types, including audiobooks, PDFs, and EPUBs, all very much of interest in a literature classroom.

      definitely deformative

  32. Sep 2018
    1. This [html-extensions] specification defines new HTML attributes to embed simple machine-readable data in HTML documents.
  33. Aug 2018
    1. Our results demonstrate the promise of spatio-temporalfiltering techniques for“tuning”measurement of hazard-related rumoring to enableobservation of rumoring at scales that have long been infeasible.

      Result: Claims spatio-temporal filtering technique to better capture signal from noise of large crisis data sets.

    2. Rumoring about disaster events is typically temporally and spatiallyconstrained to places where that event is salient. Accordingly, we use spatio and temporalsubsampling to increase the resolution of our detection techniques. Byfiltering out datafrom known sources of error (per rumor theories), we greatly enhance the signal ofdisaster-related rumoring activity. We use these spatio-temporalfiltering techniques todetect rumoring during a variety of disaster events, from high-casualty events in majorpopulation centers to minimally destructive events in remote areas.

      Method: Used spatio and temporal subsampling. How is that defined? Is the sample constrained to users' that geolocate tweets?

    3. Measuring features of informal communicationetiming, content, locationewith any degree of precision has historically been extremelychallenging in small studies and infeasible at large scales

      Acknowledges the difficulty of collecting sensemaking attributes, like temporality, in informal user-generated content during a disaster.

      Note: claims collection of precise spatial and temporal metadata. How is that defined?

  34. May 2018
  35. Apr 2018
    1. he odd result of this is that we have trending topics in networks like Twitter and Facebook, where the vast majority of updates are short and trivial, but we don’t have easily-explorable tags, hashtags or trending topics for articles and stories that are longer and more substantive.

      Another critical point.

    2. It’s striking that we likely have author information provided as metadata on the majority of articles published today, but almost none of our reading tools expose this information in useful ways, or let us search or explore using the metadata.

      This is a really really important point.

  36. Mar 2018
    1. process of annotation respects and renders visible the experiences of those people whose lives have been documented by state organizations and thus can build trust with Indigenous peoples.

      Existing collections such as North American Indian Thought and Culture could greatly benefit from this sort of retroactive description. For collections that are already owned, coordinated efforts by those with purchasing/institutional power would be needed. Also conversations "on the trade show floor" with companies such as Gale and Adam Matthews would be beneficial.

      Example: the work of Edward S. Curtis / and the digital repatriation model (referenced on p. 33)

  37. Nov 2017
  38. Jun 2017
    1. Hypothesis now uses DOIs to join variants of the same document in the same way it uses PDF fingerprints. Both pieces of metadata — the DOI, and the PDF URL — are typically included in HTML metadata.

      This is really cool! Thanks for the great explanation!

  39. Nov 2016
  40. Sep 2016
    1. Many have argued that minimalist metadata (such as DC Core), is not only easier to create than IMS/IEEE descriptions, but also more cost-effective. One question the iLumina project is addressing is whether this is true.

      I only think it's true if you don't think too hard about the Dublin Core definitions.

  41. Jun 2016
    1. Can you avoid metadata? No, not at all. Sorry, not at all, anymore. I think you can even generate more metadata from existing data. Just think about living near a sightseeing building or something like that, how many tourists took holiday pictures and published them on Facebook. How many of these pictures contains your face? Then, one day, somebody will create a facial recognition algorithm and gather even more metadata about you.
  42. Apr 2016
  43. Sep 2015
    1. git-annex provides file tagging, and tag-based views materialized as filesystem checkouts a git-based data store.

  44. Aug 2015
  45. Jul 2015
    1. I have used the bibliographies to conduct my own research in the area of cataloging assessment, and the social justice bibliography has helped me with a project I’m working on to examine video classification practices.

      A lot of my research involves digital library/digital repository assessment, and the assessment literature in that area also relies heavily on quantitative measurements of assessment. I'm very interested in seeing the cataloging + social justice bibliography and if it can help my digital library assessment research.

    2. I began to wonder if by merely assessing the mechanistic aspects of cataloging work we were missing out on an opportunity to include broader social concepts in our assessment and planning processes

      yes, this! I'm really interested in this

  46. May 2015
  47. Apr 2015
  48. Nov 2014
  49. Mar 2014
    1. mEDRA has been created as a solution for the need of the publishing industry to be supported in the technological innovation process by professionals with a deep understanding of the specific requirements and dynamics of the sector, that could provide specialised technological solutions and expertise.

      mEDRA brings toghther publishing and tecnology experise. This is an effective approach to digital disruption.

  50. Jan 2014
    1. In addition, the results imply that there is a lack of awareness about the importance of metadata among the scientific community –at least in practice– which is a serious problem as their involvement is quite crucial in dealing with problems regarding data management.

      Is there any reasonable agreement about what the term metadata means or includes? For example, how important is the unit of measure to scientists (feet vs meters) and is that information considered metadata or simply an implied part inherent in the data itself?

    2. Adding descriptive metadata to datasets helps makes the dataset more accessible by others and into the future. Respondents were asked to indicate all metadata standards they currently use to describe their data. More than half of the respondents (56%) reported that they did not use any metadata standard and about 22% of respondents indicated they used their own lab metadata standard. This could be interpreted that over 78% of survey respondents either use no metadata or a local home grown metadata approach.

      Not surprising that roughly 80% use no or ad hoc metadata.

  51. Dec 2013
    1. canonical URL

      In W3C parlance, this is probably the "shortname", like w3.org/TR/SVG/.

  52. Aug 2013
    1. 書協DBに登録したデータをJPO近刊情報センターに転送できます。ちなみに逆(JPO近刊情報センターに登録したデータを書協DBに転送)も可能だったりします。なんかこのあたりごっちゃになってる方が多いようなので9月に未参加の出版社向け説明会を開催しようと思ってます。 http://t.co/OFXTZG0hsE www.jbpa.or.jp