18 Matching Annotations
  1. Mar 2023
    1. Different people may have different perspectives on provenance, and as a result different types of information might be captured in provenance records.

      It would be nice to have three examples, on the different views

    1. The answer to the question, “when should I use RDF-Star and when shouldn’t I?” is actually right in the RDF-Star working group charter; it is for making statements about statements, and that’s all. It’s not for making statements about things in your domain, like flights, doctor visits, procedures, or costs.

      Example for statements about statements would be nice

    2. One of the issues that inhibits adoption of RDF is the the perception that it is lacking somehow in representational power.

      What is the source for that statement. RDF is build upon the second most simple data structure and can be used to represent anything. The complexity of expression might be a concern but not the representational power

      Key Value store are assumed to be the simplest strucutres

    3. One of the book club members made an interesting comment; he asked why Michael hadn’t modeled this in the ‘obvious’ and ‘simple’ way, by saying something like:

      This is in fact a simplified view that we want to get, but if we ever need to store more information this simplified version becomes hard to use

    4. With RDF-Star, we add in the ability to make statements about statements. What sorts of things do we have to say about statements? We could tell “who claims this statement is true?” or “at what point in time was this statement true?” or even “when did we learn about this statement?”.

      This is regarded meta data about statements and could also be linked to PROV. In fact all those examples can already be solved with reification

    5. Using RDF-Star simply encouraged poor modeling practice, of ignoring the everyday name for something (a “flight”) and using a statement in its place. And when I say “everyday name”, for those of you who do enterprise data modeling, you should read “business terminology”, since in an enterprise setting, these terms are the terms that the business uses (who do you think taught me a word like “offer”? A data modeler? Or an investment banker? Or William Shatner?).

      This is the most important message of this post.

      But sometime there is no way to name something or there is no agreed name for. In the end it is not important how the triple store handles the data, but how the users are able to query it.

    6. If at this point you have decided that modeling a visit isn’t such a bad thing to do, you might be wondering how to find notions like “visit”, that appear in our everyday discourse, and know when those are things you should be modeling. If you are wondering this, you’re in luck; Michael’s book (linked above) needs to go on your reading list.

      This is the major concern, especially in foreign domains. The domain "inhabitants" generally adjust their language to include and welcome foreigners but you as a data engineer will miss the nuances that you need for modelling

    7. One more triple doesn’t seem like a lot, but if you do this for every statement, that’s twice as many triples. And that means twice as much space, twice as much to think about, twice the overhead.

      Space should not be the concern of modelling but the concern of optimization and enclosed within the triple store

      If we start to introduce optimization on this general level we will create complex graphs that shift the cognitive load onto humans instead of shifting the load onto the machines

    8. So what is RDF-Star? According to the working group charter, it is “[an extension of] RDF and SPARQL related recommendations, with the ability to concisely represent and query statements about statements.” In short, it is a way to include triples whose subject or object are themselves references to RDF triples.

      RDF star is syntax sugar for reification

  2. Feb 2023
    1. exc:compose1 prov:qualifiedUsage [ a prov:Usage ; prov:entity exg:dataset1 ; prov:hadRole exc:dataToCompose ] .

      This creates a blank node and blank nodes make working without a reasoner quite hard. As different triple stores generate different IDs

    2. article

      article1

    3. regionList

      regionList1

    4. role

      Activity

    5. Provenance can be used for many purposes, such as understanding how data was collected so it can be meaningfully used, determining ownership and rights over an object, making judgements about information to determine whether to trust it, verifying that the process and steps used to obtain a result complies with given requirements, and reproducing how something was generated.

      Use of provenance

  3. Aug 2022
    1. There are things out there but only the academic who worked on it knows how to make it work. The documentation is usually non-extant and trying to figure things out can take a lot of precious time.

      This is definitely the pet peeve No.1. Almost all tools are only maintained as long there is some kind of funding. Funding often discourages the use of existing tools and forces you to develop new ones

  4. Jan 2022
    1. Inputs

      This whole section is quite confusing. Ans should be rewritten and all examples tested.