On 2015 Sep 22, Kenneth Witwer commented:
Dr. Dweep's response is appreciated but does not engage our suggestions or explain why the miRWalk validated target module was:
1) extensively erroneous (1.0, as we pointed out I believe in 2011),
2) greatly expanded and again erroneous (2.0 at the start of 2015, apparently due to a database error, and as we shared with the authors earlier this year),
3) and finally what appears to be a mirror of miRTarBase, at least according to our results.
Instability and a lack of clear information on versions and methods causes confusion, especially when scientists take these data at face value. I have reviewed submitted manuscripts that used miRWalk "validated" results as input for further experiments or analysis, even when these results were, unbeknownst to the authors, completely erroneous.
Dr. Dweep suggests that our analysis was deficient because of the terms we used for our 50 selected genes. This would indeed be important had we attempted to gauge the comprehensiveness of miRWalk or other databases (we expressly did not), or if we had used one set of terms for our miRWalk query, and another set for querying all the other databases (we did not, unless forced to do so by an interface). Which gene terms we used in our comparison of databases, then, is irrelevant.
Dr. Dweep's second point is that our analysis focused only on a small portion of the miRWalk database, the validated target module. Should we have ignored perceived problems with such modules, simply because the file sizes for these data are smaller than the sum of all predictive or correlative information on miRNAs in any given database, or on the internet, or whatever other set of information we might consider?
Finally, Dr. Dweep refers to supplemental methods that explain, albeit in vague terms that do not allow reproduction of the results, how validated targets are gathered using text searches and four databases. This does not explain why the results we downloaded from miRWalk2.0 in the period of April-August 2015 were exactly the same as found in another database, miRTarBase (last updated two years ago), down to the sorting of the hits, nor does it explain the drastic fluctuations in numbers of validated hits over the years, almost all of which we examined were erroneous. Thus, a miRWalk validated target module user in January, 2015, would have received a completely different set of results compared with the same query a year earlier or six months later. As we have suggested to Dr. Dweep, one might simply to link to miRTarBase in cases where miRWalk2.0 does not provide additional information, and provide more extensive versioning information or even an automated email update to users when mistakes are found or major changes implemented.
We agree that validated target databases have potential uses and hope that our findings are somehow helpful, as a cautionary note even if they are not used to improve databases.
This comment, imported by Hypothesis from PubMed Commons, is licensed under CC BY.