- May 2015
-
www.semantic-web-journal.net www.semantic-web-journal.net
-
The model uses the main clas- ses of Darwin Core.
Figure 1 doesn't make it evident which classes are from DwC and which ones are not. This should be made visually clear in the figure, and mentioned in the figure legend.
-
with an outline of the model suggest- ed by Richard Pyle
Acknowledgement of individual contributors and groups of contributors (such as mailing lists) whose work didn't rise to the level of co-authorship should all be moved to the Acknowledgements section. In the middle of the text they distract.
-
This approach differs significantly from that taken in the
How so? It it's important, it should be expanded.
-
DSW sees class instances as nodes that group related properties rather than as enti ties that are heavily constrained ontologically
I'm not sure I understand what is meant by this. I question though whether the authors have as much choice in "seeing" class instances as the sentence could be made out to say. If this is an OWL ontology (which I understand it is), and instance data are expressed in OWL, then class instances are owl:Individuals, which are entities, not "nodes". I.e., this statement makes me wonder whether the authors' view of their instance data might be in conflict with the very entailments of their OWL ontology.
-
is to facilitate the linking of real data
In contrast to unreal or theoretical data? Probably not. Express more directly what the motivation or driving goal is.
-
In many cases, URIs identify real resources in the wild, although example triples composed of those URIs are not necessarily asserted there.
This doesn't make much sense to me. What are the authors trying to say, and why is it important?
-
Those instances can serve as anchor points to which resources outside the biodi- versity community may be linked.
Need to explain why this can be a useful thing to do.
-
The Darwin - SW model
The readership of SWJ should not be assumed to have any biodiversity domain knowledge, and hence I suggest that as the first subsection the authors briefly introduce the salient aspects of a canonical biodiversity observation.
-
Instances of key classes within the biodiversity domain (such as organisms, specimens, and taxa) within or among institutions can be linked using DSW - defined object properties that have no ana- logues in D arwin Core.
This will likely be very confusing to anyone not intimately familiar with biodiversity domain knowledge. Also, it's not obvious why these properties are missing from DwC.
-
the choice was made
Assuming that it was the authors who made this choice, use active voice.
-
However, even with the RDF guide, Darwin Core does not provide object properties to link instances of its main classes
This needs expanding - obviously it's one of the main motivating use cases for DSW.
-
It was developed in 2010 - 11 to meet an immediate need in the Bioimages 22 and Xmalesia 23 projects
I suggest to expand this - it could for the motivating use-cases from which to hang and unroll the unmet challenges, and consequently the goals behind developing DSW.
-
Introduction
This section needs to be brought into a proper narrative. In its current form, especially following there is much to draw from the text is more like an enumerated sequence (which isn't even strictly chronological) and lacks a narrative thread that keeps a reader oriented.
-
seria l- ized as
expressed as?
-
and lack of funding made their maintenance difficult .
Doesn't this apply to any standard, or more broadly informatics product, including the presented here?
-
was known
became known?
-
A central part of this architecture was the creation of an ontol- ogy to facilitate the integration of standards .
What standards? Surely not any standard?
-
However, progress towards exposing biodiversity data as RDF has been ham- pered by the lack of a consen sus graph - based model for the biodiversity domain
Isn't the reason more the debate over consensus semantics?
Tags
Annotators
URL
-
- Mar 2014
-
lappland.io lappland.io
-
provide a URL interface for showing annotations on top of the online text that was annotated
Slightly reworded from initially posted version to be less confusing. Thanks @dwhly.
-
-
f1000research.com f1000research.com
-
To paraphrase William Gibson
I notice that it's not clear to what exact piece or event to source this. Perhaps still link to the William Gibson's Wikiquote page, which includes the quote and its provenance?
-
We do our best to cover these ideas in our instructor training program, but are less good about actually applying them in our workshops.
Is there some insight available into why instructors find it difficult to apply what they have been taught? Is it the imparting of these ideas that needs improvement, or are the ideas not as applicable in SwC as they were thought to be, or is there simply heterogeneity in that some ideas are much easier to apply than others? If the latter, which ones fall into which category?
-
though we hope that will diminish as we organize and recruit along disciplinary lines instead of geographically
Aren't you arguing above that diversity of backgrounds and starting skills is a constant challenge? It didn't seem from earlier arguments in the text that simply recruiting along a uniform discipline will address this problem.
-
We also don’t give them as diverse a range of exercises as we should, and those that we do give are often at the wrong level.
How do you know that this is the case? From feedback alone, or are there other kinds of observations or evidence?
-
However, we do need to be more systematic about varying our content on purpose rather than by accident.
As a reader, I feel left hanging by the section ending with this statement. Are there ideas about how this could be done, and to begin with, what were some of problems encountered with the less systematic approach being practiced now? (The preceding text seems to only cite advantages.)
-
xUnit-style framework
I'm embarrassed to ask what's an xUnit style framework. Spell out what that is, and/or add a reference or URL?
-
don’t as good a job
Insert "do" after don't.
-
The fact that this is also true of most high-profile MOOCs is little comfort.
If your goal is a high rate of retention and completion, that is. However, widening reach could also be a worthwhile goal. If a single MOOC reaches 10,000 students instead of 800 students reached by 20 physical SwC workshops, even a completion rate of only 10% will still have taught more students with the single MOOC than with the 20 physical workshops. MOOCs clearly aren't a panacea, and they may indeed be ill-suited to the learning objectives of SwC, but that and why this is so needs a little more depth to be convincing.
-
naive
Is this meant to be "native"?
-
no one knows how to measure the productivity of programmers, or the productivity of scientists
I think this assertion needs better qualification to be really justified. Obviously, several ways to assess programmer productivity, and also scientist productivity, exist. Hiring and tenure committees regularly assess productivity of scientists. Arguably, the ways this is usually done suffers from various problems such as failing to encompass the full spectrum of products resulting from a scientist's work. Perhaps the author means that it is some of these shortcomings of current productivity assessment methods that effectively prevent measuring the productivity impact of SwC's teachings, but that needs to be spelled out better.
-
Many of our instructors also teach regular university courses, and several of them are now using part or all of our material as the first few lectures in them.
Isn't this somewhat contradicting some lessons learned stated earlier, which seemed to say that for several reasons the SwC curriculum faces impossibly high barriers for integration into university curricula, at least in the current environment. If contrary to expectation this has now become possible, can something be learned from the cases where it has been successfully integrated?
-
DiRAC
Spell out. Also, how about a URL?
-
The (considerable) downside is that it can be quite difficult for newcomers to contribute material; we are therefore working to streamline that process.
This needs some qualification to fully make sense as following from the preceding sentence. If the tools and approaches SwC teaches are good ones that "work", and SwC uses those tools and approaches itself, how can this be a downside, presuming that those able to contribute material are in fact familiar with those tools and approaches. I can imagine some ways in which this can still be a downside, but for clarity this should be spelled out better.
-
As this is usually several times more than a small registration fee would bring in, we usually choose the higher no-show rate as the lesser evil.
The biggest problem of a significant rate of no-shows is probably the fact that due to the space limitations other students who would have and benefitted from the course had to be denied because of the no-shows taking the space away. Have other possibilities to deter no-shows been explored (and if so, how effective have they been found)?
If the no-show rate is somewhat predictable (and it sounds like it is), then wait-listed students could be told to show up anyway on the day of the course, because there would likely be enough no-shows to make room for them. Has this been tried, and to what extent does it work?
-
What does require permission is use of our name and logo, both of which are trademarked. We are happy to give such permission if we have certified the instructor and have a chance to double-check the content, but we do want a chance to check: we have had instances of people calling something “Software Carpentry” when it had nothing to do with what we usually teach. We’ve worked hard to create material that actually helps scientists, and to build some name recognition around it, and we’d like to make sure our name continues to mean something.
This whole paragraph doesn't mention the words "brand", "brand recognition", and "brand reputation"; yet it is essentially about those concepts, isn't it? Why not say it directly?
-
with the best
Insert "even" before with.
-
now that we have enough instructors to be able to specialize
It's probably not just a question of having instructors, but also of having demand for (and thus acceptance of) the SwC curriculum as useful in increasingly many disciplinary areas.
-
While some people feel that using R instead of Python is like using feet and pounds instead of the metric system
I have heard concerns and objections some people have with R's syntax and way of doing things. But every language (including Python) has its detractors, and I don't think the particular concerns with R are necessarily widely known let alone understood. So I would suggest to either delete this clause (is it really needed for the argument?), or if chosen to be left in place, to substantiate it, at least by giving a reference to a fuller discussion of R's problems.
-
80–90% of attendees typically report
What does typically mean? 80-90% of all SwC enrolled students, or on average 80-90% of those enrolled in a workshop? I.e., how much variance is there between workshops?
-
have grown steadily (Figure 1 and Figure 2).
The figures suggest a tapering off in the recent past. Is this more likely a fluke due to limited or censored data, or is there a trend showing?
-
Hopefully these two effects more or less cancel out and should should not detract from the overall trend displayed.
Hope is nice but not a good basis on which to base scientific conclusions. Do you have evidence that suggests that neither fraction of people is significant with respect to those enrolled and attending both days? Evidence that both fractions of people has stayed relatively constant over time, and not changed more recently?
-
Enrolment
Typo (one instead of two 'l')
-
Switching to a “host site covers costs” model was equally important: funding is still needed for the coordinator positions (the author and two part-time administrative assistants at Mozilla, and part of one staff member’s time at the Software Sustainability Institute in the UK), but our other costs now take care of themselves.
I'd find it really useful to spell this out a little more. What are our other costs? Instructor travel and expenses, room rental? What tasks do the coordinators perform, how does this scale? Or in other words, presumably there is a division between costs of operating that benefit from economies of scale, and those that do not. More insight into this division would be quite helpful as a lesson learned.
-
This time, the model was two-day intensive workshops
I'm curious as to why 2 days. The lessons learned stated earlier seem to say that attention drops after 3 days, not 2 days. Why was the decision made to shorten to 2 days, not 3 days?
-
The Hacker Within
Is there no link or other reference available?
-
and backing from the Mozilla Foundation
The difference in wording suggests that the Mozilla Foundation's backing didn't come in the form of a grant. Can it be spelled out (at least broadly) what that support consisted of?
-
The biggest take-away from this round was the need come up with a scalable, sustainable model. One instructor simply can’t reach enough people, and cobbling together funding from half a dozen different sources every twelve to eighteen months is a high-risk approach.
For readers who aren't already fully on board with this, It would help to better set up the argument. Why is scaling up the model desirable or necessary? What is enough people? Couldn't funding also come from a single or few sources? Many courses are sustained by student tuition; how would this likely not work for SwC?
-
Most importantly, the MOOC format didn’t work
I think it's worth to qualify this statement in respect to the goals. As the paragrapgh goes on, it could be said that In some definition the MOOC format has worked (for example, compared to retention and completion rates of other MOOCs); the failure that the author reports presumably means chiefly that the goals laid out for a SwC course weren't met by the MOOC format.
-
Open access publishing, crowd sourcing, and dozens of other innovations had convinced scientists that knowing how to program was now as important to doing science as knowing how to do statistics.
Is there evidence or references for the factors the author enumerates constituting the major driving causes? More specifically, the list is conspicuously missing the explosion of data that had swept, and has continued to sweep into almost every scientific discipline. Data richness is enormously powerful for science, yet wrestling insight from it at this scale invariably and pervasively requires computational processing. Maybe this is part of the dozens of other innovations, but I would still argue that the data deluge has constituted a primary rather than a marginal driver of this landscape change.
-
MOOC
Spell out at first use.
-
The author rebooted Software Carpentry in May 2010 with support from Indiana University, Michigan State University, Microsoft, MITACS, Queen Mary University of London, Scimatic, SciNet, SHARCNet, and the UK Met Office.
The backstory to what motivated (or necessitated?) the large consortium of funders is missing here. However, given the last paragraph in this section, it seems there would be interesting aspects of it that would help make setting up the argument. Does the large consortium reflect primarily wide buy-in to SwC's utility, or primarily the difficulty of obtaining enough funding from any one institution or partner? The last paragraph suggests it's the latter, but it's not clear.
-
The sweet spot for this kind of training is therefore the first two or three years of graduate school. At that point, students have time (at least, more time than they’ll have once they’re faculty) and real problems of their own that they want to solve.
Perhaps it's primarily the real problems of their own that provide the motivation for having the time (to learn about addressing them). I.e., percentage-wise, how many students does SwC get today who take the course primarily because they have time, and who do not yet have real problems of their own for which they hope to learn solutions?
More importantly perhaps, does this not also point out a path for justifying the inclusion of SwC-inspired teaching units into undergraduate CS curricula? While for some (or most?) academic research career paths the relevance of version control mastery is perhaps less obvious, it's a qualification nearly all of industry ask of CS graduates applying for a software engineer position.
-
This is partly because educators’ preferred file formats (Word, PowerPoint, and PDF) can’t be handled gracefully by existing version control systems, but more importantly, there simply isn’t a “culture of contribution” in education for projects like Software Carpentry to build on
I'm not convinced that one isn't mostly or entirely a consequence of the other. Open source and collaborative development also was far less widespread in scientific software development before many of the barriers to that were significantly reduced by distributed version control such as Git, and usability and social coding focused resources such as Github. If the tools and file formats that are most widely used are simply refractory to collaboration, it's not a surprise if then a culture of collaboration is rare.
-
As long as universities reward research first, and supply teaching last, it is simply not in most computer scientists own best interests to offer this kind of course.
If this is the main driver behind this kind of course not finding interest at university CS programs, is the situation then different at teaching-focused schools, such as small liberal arts colleges? There are small liberal arts colleges with strong CS programs; have they indeed been more welcoming to adopting SwC into their curricula?
-
It works too well to be interesting
Based on context, it would be the SwC workshop or material. I suggest to reword so it is clear that it actually refers to the practices and tools being taught by SwC.
-
In the absence of an institutional mechanism to offer credit courses at some inter-departmental level, this course, like many other interdisciplinary courses, fell between two stools.
Perhaps this would be beyond the scope of the paper as a commentary, but it would be interesting to see whether this is then different at decidedly interdisciplinary programs, for example programs interfacing computational biology / computer science / math.
-
(even though a significant minority of their students, particularly those coming from non-CS backgrounds, have no more experience of practical software development than the average physicist
Remove parentheses.
-
In response, John Reynders (then director of the Advanced Computing Laboratory at Los Alamos National Laboratory) invited the author and Brent Gorda (now at Intel) to teach a week-long course on these topics to LANL staff. The course ran for the first time in July 1998, and was repeated nine times over the next four years.
I suggest that the authors highlights the major ways in which these courses differ from the SwC courses run today. As written now, deducing that from the two lessons learned is left as an exercise to the reader, and only those already familiar with SwC will know that indeed today's SwC workshops do differ in these ways.
-
(then director of the Advanced Computing Laboratory at Los Alamos National Laboratory)
Change parentheses to comma. The parenthetical phrase is important to make sense of the sentence. (And if similar contextual information can be given about Brent Gorda, i.e., information that helps to understand why he was invited, I suggest that be added to, as his current affiliation fails to explain that.)
-
i.e., to parallelize complex programs
This seems more an example to me than a restatement of run before they could walk. Thus, this should be e.g. (or spelled out for example).
-
many researchers still find it hard to apply what we teach
Researchers at-large, or researchers who participated in a Software Carpentry workshop?
-
learning, and applying
Assuming that learning binds to at least some of what we taught as well, the comma is extraneous. Or add a comma after and applying.
-
rarely if ever shown how to design a maintainable program in a systematic way
Are they not in fact taught, even if only indirectly, that programs are typically not revisited again once passed on (to the course instructor, for example), and hence thinking about maintainability is wasted effort?
-
desktop majority
Do you mean the complement to those doing HPC? The phrase strikes me as needlessly cryptic. And are you sure that scientists developing HPC software are exempt from the trend you describe?
-
hardware and algorithms are only two sides of the iron triangle of programming
Is there a reference for this form of the Iron Triangle? Googling the phrase only turns up the well-known project management Iron Triangle, and its adaptation to software projects. The latter has Resources, Scope, and Time at its corners, not hardware, algorithms, and programming.
Tags
Annotators
URL
-
- Oct 2013
-
www.biodiversitylibrary.org www.biodiversitylibrary.org
-
A pineal foramen is absent
-