3,440 Matching Annotations
  1. May 2022
    1. Manton says owning your domain so you can move your content without breaking URLs is owning your content, whereas I believe if your content still lives on someone else's server, and requires them to run the server and run their code so you can access your content, it's not really yours at all, as they could remove your access at any time.

      This is a slippery slope problem, but people are certainly capable of taking positions along a broad spectrum here.

      The one thing I might worry about--particularly given micro.blog's--size is the relative bus factor of one represented by Manton himself. If something were to happen to him, what recourse has he built into make sure that people could export their data easily and leave the service if the worst were to come to happen? Is that documented somewhere?

      Aside from this the service has one of the most reasonable turn-key solutions for domain and data ownership I've seen out there without running all of your own infrastructure.

    2. First, Manton's business model is for users to not own their content. You might be able to own your domain name, but if you have a hosted Micro.blog blog, the content itself is hosted on Micro.blog servers, not yours. You can export your data, or use an RSS feed to auto-post it to somewhere you control directly, but if you're not hosting the content yourself, how does having a custom domain equal self-hosting your content and truly owning it? Compared to hosting your own blog and auto-posting it to Micro.blog, which won't cost you and won't make Micro.blog any revenue, posting for a hosted blog seems to decrease your ownership.

      I'm not sure that this is the problem that micro.blog is trying to solve. It's trying to solve the problem of how to be online as simply and easily as possible without maintaining the overhead of hosting and managing your own website.

      As long as one can easily export their data at will and redirect their domain to another host, one should be fine. In some sense micro.blog makes it easier than changing phone carriers, which in most cases will abandon one's text messages without jumping through lots of hoops. .

      One step that micro.blog could set up is providing a download dump of all content every six months to a year so that people have it backed up in an accessible fashion. Presently, to my knowledge, one could request this at any time and move when they wished.

    1. The ad lists various data that WhatsApp doesn’t collect or share. Allaying data collection concerns by listing data not collected is misleading. WhatsApp doesn’t collect hair samples or retinal scans either; not collecting that information doesn’t mean it respects privacy because it doesn’t change the information WhatsApp does collect.

      An important logical point. Listing what they don't keep isn't as good as saying what they actually do with one's data.

    1. The main thing Smith has learned over the past seven years is “the importance of ownership.” He admitted that Tumblr initially helped him “build a community around the idea of digital news.” However, it soon became clear that Tumblr was the only one reaping the rewards of its growing community. As he aptly put it, “Tumblr wasn’t seriously thinking about the importance of revenue or business opportunities for their creators.”
    1. Third, the post-LMS world should protect the pedagogical prerogatives and intellectual property rights of faculty members at all levels of employment. This means, for example, that contingent faculty should be free to take the online courses they develop wherever they happen to be teaching. Similarly, professors who choose to tape their own lectures should retain exclusive rights to those tapes. After all, it’s not as if you have to turn over your lecture notes to your old university whenever you change jobs.

      Own your pedagogy. Send just like anything else out there...

    1. And yes, some add-ons exist, but I just wish the feature was native to the browser. And I do not want to rely on a third party service. My quotes are mine only and should not necessary be shared with a server on someone's else machine.

      Ownership of the data is important. One could certainly set up their own Hypothes.is server if they liked.

      I personally take the data from my own Hypothes.is account and dump it into my local Obsidian.md vault for saving, crosslinking, and further thought.

    1. With Alphabet Inc.’s Google, and Facebook Inc. and its WhatsApp messaging service used by hundreds of millions of Indians, India is examining methods China has used to protect domestic startups and take control of citizens’ data.

      Governments owning citizens' data directly?? Why not have the government empower citizens to own their own data?

    1. The highlights you made in FreeTime are preserved in My Clippings.txt, but you can’t see them on the Kindle unless you are in FreeTime mode. Progress between FreeTime and regular mode are tracked separately, too. I now pretty much only use my Kindle in FreeTime mode so that my reading statistics are tracked. If you are a data nerd and want to crunch the data on your own, it is stored in a SQLite file on your device under system > freetime > freetime.db.

      FreeTime mode on the Amazon Kindle will provide you with reading statistics. You can find the raw data as an SQLite file under system > freetime > freetime.db.

    1. I tried very hard in that book, when it came to social media, to be platform agnostic, to emphasize that social media sites come and go, and to always invest first and foremost in your own media. (Website, blog, mailing list, etc.)
    1. Facebook provides some data portability, but makes an odd plea for regulation to make more functionality possible.

      Why do this when they could choose to do the right thing? They don't need to be forced and could certainly try to enforce security. It wouldn't be any worse than unveiling the tons of personal data they've managed not to protect in the past.

    1. Goodreads lost my entire account last week. Nine years as a user, some 600 books and 250 carefully written reviews all deleted and unrecoverable. Their support has not been helpful. In 35 years of being online I've never encountered a company with such callous disregard for their users' data.

      A clarion call for owning your own data.

    1. I like how Dr. Pacheco-Vega outlines some of his research process here.

      Sharing it on Twitter is great, and so is storing a copy on his website. I do worry that it looks like the tweets are embedded via a simple URL method and not done individually, which means that if Twitter goes down or disappears, so does all of his work. Better would be to do a full blockquote embed method, so that if Twitter disappears he's got the text at least. Images would also need to be saved separately.

    1. Common Pitfalls to Avoid When Choosing Your App

      What are the common pitfalls when choosing a note taking application or platform?

      Own your data

      Prefer note taking systems that don't rely on a company's long term existence. While Evernote or OneNote have been around for a while, there's nothing to say they'll be around forever or even your entire lifetime. That shiny new startup note taking company may not gain traction in the market and exist in two years. If your notes are trapped inside a company's infrastructure and aren't exportable to another location, you're simply dead in the water. Make sure you have a method to be able to export and own the raw data of your notes.

      Test driving many

      and not choosing or sticking with one (or even a few)<br /> Don't get stunned into inaction by the number of choices.

      Shiny object syndrome

      is the situation where people focus all attention on something that is new, current or trendy, yet drop this as soon as something new takes its place.<br /> There will always be new and perhaps interesting note taking applications. Some may look fun and you'll be tempted to try them out and fragment your notes. Don't waste your time unless the benefits are manifestly clear and the pathway to exporting your notes is simple and easy. Otherwise you'll spend all your time importing/exporting and managing your notes and not taking and using them. Paper and pencil has been around for centuries and they work, so at a minimum do this. True innovation in this space is exceedingly rare, and even small affordances like the ability to have [[wikilinks]] and/or bi-directional links may save a few seconds here and there, in the long run these can still be done manually and having a system far exceeds the value of having the best system.

      (Relate this to the same effect in the blogosphere of people switching CMSes and software and never actually writing content on their website. The purpose of the tool is using it and not collecting all the tools as a distraction for not using them. Remember which problem you're attempting to solve.)

      Future needs and whataboutisms

      Surely there will be future innovations in the note taking space or you may find some niche need that your current system doesn't solve. Given the maturity of the space even in a pen and paper world, this will be rare. Don't worry inordinately about the future, imitate what has worked for large numbers of people in the past and move forward from there.

      Others? Probably...

    1. Even with data that’s less fraught than our genome, our decisions about what we expose to the world have externalities for the people around us.

      We need to think more about the externalities of our data decisions.

    1. It's the feedback that's motivating A-list bloggers like Digg founder Kevin Rose to shut down their blogs and redirect traffic to their Google+ profiles. I have found the same to be true.

      This didn't work out too well for them did it?

    1. The European Commission has prepared to legislate to require interoperability, and it calls being able to use your data wherever and whenever you like “multi-homing”. (Not many other people like this term, but it describes something important – the ability for people to move easily between platforms

      an interesting neologism to describe something that many want

    1. the decentralised and open source nature of these systems, where anyone can host an instance, may protect their communities from the kinds of losses experienced by users of the many commercial platforms that have gone out of business over the last decades (e.g. Geocities, Wikispaces or Google + to name just a few).

      https://indieweb.org/site-deaths names a large number of others

    1. Subsidiarity, which uses “data cooperatives, collaboratives, and trusts with privacy-preserving and -enhancing techniques for data processing, such as federated learning and secure multiparty computation.”

      Another value of the data cooperative model might be that each individual might not have time to research and administer possible new data-sharing requests/opportunities, and it would be helpful to entrust that work to a cooperative entity that already has one's trust.

    1. A 20-year age difference (for example, from 20 to 40, or from 30 to 50 years old) will, on average, correspond to reading 30 WPM slower, meaning that a 50-year old user will need about 11% more time than a 30-year old user to read the same text.
    2. Users’ age had a strong impact on their reading speed, which dropped by 1.5 WPM for each year of age.
    1. Overall, having spent a significant amount of time building this project, scaling it up to the size it’s at now, as well as analysing the data, the main conclusion is that it is not worth building your own solution, and investing this much time. When I first started building this project 3 years ago, I expected to learn way more surprising and interesting facts. There were some, and it’s super interesting to look through those graphs, however retrospectively, it did not justify the hundreds of hours I invested in this project.I’ll likely continue tracking my mood, as well as a few other key metrics, however will significantly reduce the amount of time I invest in it.

      Words of the author of https://krausefx.com//blog/how-i-put-my-whole-life-into-a-single-database

      It seems as if excessive personal data tracking is not worth it

  2. Apr 2022
    1. ReconfigBehSci [@SciBeh]. (2021, October 1). @alexdefig against this survey data you might set actual uptake figures in France, various Canadian provinces, and Germany after the introduction of passports [Tweet]. Twitter. https://twitter.com/SciBeh/status/1443955929985159174

    1. ReconfigBehSci [@SciBeh]. (2021, October 1). @alexdefig and I didn’t say we should mandate them. I simply pointed out that when considering the impact of passports on uptake we should probably look at actual uptake in response to actual mandates in addition to survey data, which may or may not translate into action, no? [Tweet]. Twitter. https://twitter.com/SciBeh/status/1443958577173917699

    1. ReconfigBehSci [@SciBeh]. (2021, October 1). @alexdefig so, observational data has weaknesses- so does survey data, but it’s there and we should look at it. On your second point, yes, that is important, we should study that, if we have no data we can’t factor it into decision. Third is separate issue/factor to weigh. [Tweet]. Twitter. https://twitter.com/SciBeh/status/1443960096497627141

    1. The combined stuff is available to components using the page store as $page.stuff, providing a mechanism for pages to pass data 'upward' to layouts.

      bidirectional data flow ?! That's a game changer.

      analogue in Rails: content_for

      https://github.com/sveltejs/kit/pull/3252/files

    1. ReconfigBehSci. (2022, January 24). @STWorg @FraserNelson @GrahamMedley no worse- he took Medley’s comment that Sage model the scenarios the government asks them to consider to mean that they basically set out to find the justification for what the government already wanted to do. Complete failure to distinguish between inputs and outputs of a model [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1485625862645075970

    1. Jackie Parchem, MD [@jackie_parchem]. (2021, July 29). @MeadowGood @ACOGPregnancy Some of the docs who stepped up and got vaccinated early when we didn’t have the data we do now. What we all knew: Protecting moms protects babies! All have had their babies by now! @IlanaKrumm @anushkachelliah @gumbo_amando @emergjenncy @JuliaNEM33 https://t.co/h9UJo6h3fQ [Tweet]. Twitter. https://twitter.com/jackie_parchem/status/1420785474499645442

    1. For this reason, the Secretary of State set out a vision1 for health and care to have nationalopen standards for data and interoperability that are mandated throughout the NHS andsocial care.
    1. Nick Sawyer, MD, MBA, FACEP [@NickSawyerMD]. (2022, January 3). The anti-vaccine community created a manipulated version of VARES that misrepresents the VAERS data. #disinformationdoctors use this data to falsely claim that vaccines CAUSE bad outcomes, when the relationship is only CORRELATED. Watch this explainer: Https://youtu.be/VMUQSMFGBDo https://t.co/ruRY6E6blB [Tweet]. Twitter. https://twitter.com/NickSawyerMD/status/1477806470192197633

    1. Carl T. Bergstrom. (2021, August 18). 1. There has been lots of talk about recent data from Israel that seem to suggest a decline in vaccine efficacy against severe disease due to Delta, waning protection, or both. This may have even been a motivation for Biden’s announcement that the US would be adopting boosters. [Tweet]. @CT_Bergstrom. https://twitter.com/CT_Bergstrom/status/1427767356600688646

    1. ReconfigBehSci. (2021, February 1). @islaut1 @richarddmorey I think diff. Is that your first response seemed to indicate the evidence was the search itself (contra Richard) so turning an inference from absence of something into a kind of positive evidence ('the search’). Let’s call absence of evidence “not E”. 1/2 [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1356215051238191104

    1. The Lancet. (2021, April 16). Quantity > quality? The magnitude of #COVID19 research of questionable methodological quality reveals an urgent need to optimise clinical trial research—But how? A new @LancetGH Series discusses challenges and solutions. Read https://t.co/z4SluR3yuh 1/5 https://t.co/94RRVT0qhF [Tweet]. @TheLancet. https://twitter.com/TheLancet/status/1383027527233515520

    1. Dr Nisreen Alwan 🌻. (2020, March 14). Our letter in the Times. ‘We request that the government urgently and openly share the scientific evidence, data and modelling it is using to inform its decision on the #Covid_19 public health interventions’ @richardhorton1 @miriamorcutt @devisridhar @drannewilson @PWGTennant https://t.co/YZamKCheXH [Tweet]. @Dr2NisreenAlwan. https://twitter.com/Dr2NisreenAlwan/status/1238726765469749248

    1. Youyang Gu. (2021, May 25). Is containing COVID-19 a requirement for preserving the economy? My analysis suggests: Probably not. In the US, there is no correlation between Covid deaths & changes in unemployment rates. However, blue states are much more likely to have higher increases in unemployment. 🧵 https://t.co/JrikBtawEb [Tweet]. @youyanggu. https://twitter.com/youyanggu/status/1397230156301930497

    1. Trevor Bedford. (2022, January 10). Given ~680k cases per day, this would in turn suggest 0.8% or 1% of the US being infected with SARS-CoV-2 every day. This would translate to perhaps 5% or 10% of individuals currently infected with SARS-CoV-2 in the US. 15/15 [Tweet]. @trvrb. https://twitter.com/trvrb/status/1480610448563060738

    1. Lewis, S. J., Dack, K., Relton, C. L., Munafo, M. R., & Smith, G. D. (2021). Was the risk of death among the population of teachers and other school workers in England and Wales due to COVID-19 and all causes higher than other occupations during the pandemic in 2020? An ecological study using routinely collected data on deaths from the Office for National Statistics. BMJ Open, 11(11), e050656. https://doi.org/10.1136/bmjopen-2021-050656

    1. Locally, Curry said troopers handled 275 distracted driving crashes in 2020, 322 in 2021 and 47 so far this year. He added local troopers issued 267 distracted driving citations in 2020, 299 in 202 and 51 so far this year.

      This article was published on April 9th, (Day 99 of 2022) which is 27% through the year. So based on the data provided, what can we expect?

      In terms of CRASHES, we've had 47 so far when 27% of the way through 2020 and 2021 we would have had 75 and 87 crashes by now (assuming that distracted driving crashes are generally evenly distributed through the year) - so we're on track for 173 distracted driving crashes this year; that's only a little over half (54% of last year's numbers).

      As they said in the first paragraph:

      The Delaware Post of the Ohio State Highway Patrol is stepping up enforcement this month in an effort to curb distracted driving, which the agency reports is leading to increased traffic crashes and deaths statewide.

      ...so they're doing this on a PR schedule - not because the numbers are up - in fact, the numbers are down locally by a huge margin.

      With Troopers focusing on this, it means they're not focusing on safety problems that are increasing.

    1. Let’s look at a recent paper by Xia, Bao, Lo, Xing, Hassan, & Li entitled Measuring Program Comprehension: A Large-Scale Field Study with Professionals and published in IEEE Transactions on Software Engineering, 44, 951-976, 2018. This paper is quite interesting in that it describes in great details how the figures are obtained. And it says that Comprehension took on average ~58%.

      Developers spend most of their time figuring the system out

    1. A New York Times article uses the same temperature dataset you have been using to investigate the distribution of temperatures and temperature variability over time. Read through the article, paying close attention to the descriptions of the temperature distributions.

      Unfortunately, like most NYT content, this article is behind a paywall. I'm partly reading this as I plan to develop a set of open education resources myself and the problem of how to manage dead/unavailable links looks like a key stumbling block.

    1. Tyler Black, MD. (2021, December 10). Statistics Canada has been asking kids about mental health during the pandemic. Initially, after the first 5 months (with school shutdowns, summer break, lots of restrictions), more kids said they were better than worse, most reported no change. 86% “No change or better” [/1] https://t.co/3shKtrxEVU [Tweet]. @tylerblack32. https://twitter.com/tylerblack32/status/1469380405451100162

  3. Mar 2022
    1. ReconfigBehSci on Twitter: ‘@alexdefig are you really going to claim that responses to the introduction of passports on uptake across 4 other countries are evidentially entirely irrelevant to whether or not passports are justified or not?’ / Twitter. (n.d.). Retrieved 31 March 2022, from https://twitter.com/SciBeh/status/1444358068280565764

    1. Strategic, cost-efficient evidence-building relies onstrong data governance that facilitates the access, pro-tection, and use of program and other administrativedata to enable and support secondary uses, including for
    2. The statutemakes agency evidence-building plans, known as LearningAgendas, foundational to building a culture of evidencegeneration and use.
    1. Unwin, H. J. T., Hillis, S., Cluver, L., Flaxman, S., Goldman, P. S., Butchart, A., Bachman, G., Rawlings, L., Donnelly, C. A., Ratmann, O., Green, P., Nelson, C. A., Blenkinsop, A., Bhatt, S., Desmond, C., Villaveces, A., & Sherr, L. (2022). Global, regional, and national minimum estimates of children affected by COVID-19-associated orphanhood and caregiver death, by age and family circumstance up to Oct 31, 2021: An updated modelling study. The Lancet Child & Adolescent Health, 6(4), 249–259. https://doi.org/10.1016/S2352-4642(22)00005-0

    1. The audit found that the CIO has limited insight into each Sector’s entire data holdings given a decentralized model, and lack of centralized guidance, standard definitions, and corporate data management system. CMSS representatives acknowledged that the NRCan Data Inventory is not a complete listing of NRCan datasets; however, it was found that it serves as a good starting point in identifying datasets held within the Department. However, per TBS guidance, a complete departmental inventory should include a list of all datasets even if they are identified as not eligible for release.
    1. Kerr, P. J., Cattadori, I. M., Liu, J., Sim, D. G., Dodds, J. W., Brooks, J. W., Kennett, M. J., Holmes, E. C., & Read, A. F. (2017). Next step in the ongoing arms race between myxoma virus and wild rabbits in Australia is a novel disease phenotype. Proceedings of the National Academy of Sciences, 114(35), 9397–9402. https://doi.org/10.1073/pnas.1710336114

    1. Eran Segal. (2021, August 17). Israel data showing the decay of vaccine efficacy over time. Y-axis is cases per 1000 from July 7 to Aug 10, for unvaccinated, and for people vaccinated at different times Cases are higher in those vaxed earlier Despite world-data caveats, this seems quite compelling https://t.co/5aNz48AC8F [Tweet]. @segal_eran. https://twitter.com/segal_eran/status/1427696623988117505

    2. Natalie E. Dean, PhD. (2021, August 17). Real-world data from Israel show a growing gap between the earliest vaccinated (blue arrow) and the recently vaccinated (green arrow) within age groups. Confounding is always a concern (are these groups fundamentally different?) but the magnitude of the difference is notable. Https://t.co/s8pevRbax8 [Tweet]. @nataliexdean. https://twitter.com/nataliexdean/status/1427703094062706691

    1. Learn Data Science from IIT Madras faculty & Industry experts and earn a Data Science certification from India's best Engineering College. Become a Data Scientist through multiple data Science courses covered in this 7-month data science certification program with hands-on exercises & Project work.

      This Data Science Course is offered by Intellipaat in collaboration with IIT Madras (one of the renowned institutes in India) to help you master Data Science skills like Python, programming, Data Visualization, Statistical analysis and computing, Deep Learning, etc.

      Eager to step into the field of Data Science? Explore the Page now!

    1. Data integrity is a good thing. Constraining the values allowed by your application at the database-level, rather than at the application-level, is a more robust way of ensuring your data stays sane.
    1. This is particularly useful in cases where you want to separate your data migrations from your schema migrations or where you have multiple steps in your migration process that must have other steps invoked throughout.
    1. The code will work without exception but it doesn’t set correct association, because the defined classes are under namespace AddStatusToUser. This is what happens in reality: role = AddStatusToUser::Role.create!(name: 'admin') AddStatusToUser::User.create!(nick: '@ka8725', role: role)
    1. There are three keys to backfilling safely: batching, throttling, and running it outside a transaction. Use the Rails console or a separate migration with disable_ddl_transaction!.
    2. Active Record creates a transaction around each migration, and backfilling in the same transaction that alters a table keeps the table locked for the duration of the backfill. class AddSomeColumnToUsers < ActiveRecord::Migration[7.0] def change add_column :users, :some_column, :text User.update_all some_column: "default_value" end end
    1. No need to construct strings that then need to be deconstructed later.
    2. I believe we need the break free of these anachronistic designs and use event loggers, not message loggers
    3. µ/log's idea is to replace the "3 Pillars of Observability" with a more fundamental concept: "the event"

      bold goal

    4. Event-based data is easy to index, search, augment, aggregate and visualise therefore can easily replace traditional logs, metrics and traces.
    1. Linked data makes it possible to completely decouple computable information from the system that ordinarily houses it.
    1.  75% of people in the U.S. never tweet.On an average weeknight in January, just 1% of U.S. adults watched primetime Fox News (2.2 million). 0.5% tuned into MSNBC (1.15 million).Nearly three times more Americans (56%) donated to charities during the pandemic than typically give money to politicians and parties (21%).
    1. ReconfigBehSci. (2022, February 17). @thackerpd @STWorg “carping about anti-vaxxers”? You mean constant attempts to try and save lives and end pandemic by generating, curating and promoting research data on the benefits of vaccination and/or generating, curating and promoting data that undercuts the wilful disinformation on vaxx? [Tweet]. @SciBeh. https://twitter.com/SciBeh/status/1494201269724012546

    1. Working on a new data visceralization. I’m particularly interested in the tactile quality of this one. Covid deaths from 3/2020-6/2021

      Working on a new data visceralization. I’m particularly interested in the tactile quality of this one. Covid deaths from 3/2020-6/2021 pic.twitter.com/MjFZCqDP4x

      — Jacqueline Wernimont (@profwernimont) March 1, 2022
      <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>
  4. Feb 2022
    1. let zeta = getProcessControl.bind(this); Object.setPrototypeOf(zeta, Object.getPrototypeOf(this)); return zeta;

      useful pattern

    1. Linked Data bezieht sich dabei auf die technische Aufbereitung der Daten, so dass eine Verknüpfung (Linking) der Daten möglich ist. Das dabei zum Einsatz kommende Datenmodell ist RDF, das ursprünglich für das Semantic Web entwickelt wurde.
    1. When the C.D.C. published the first significant data on the effectiveness of boosters in adults younger than 65 two weeks ago, it left out the numbers for a huge portion of that population: 18- to 49-year-olds, the group least likely to benefit from extra shots, because the first two doses already left them well-protected.

      US is not only the worst country from a death/cases standpoint, but also its governmental health services are not adept to the task.

      US is a failed state in many domains outside defense & security.

    1. Cornelius Roemer. (2022, February 12). Fantastic work by @UKHSA comparing serial intervals of BA.1, BA.2 and Delta as published in the most recent technical briefing. BA.2 seems to have even shorter serial interval than BA.1 This could help explain different relative growth rates of BA.2 vs BA.1 in different countries https://t.co/Gch94Ew8CX [Tweet]. @CorneliusRoemer. https://twitter.com/CorneliusRoemer/status/1492434232664375304

    1. Altarawneh, H. N., Chemaitelly, H., Hasan, M. R., Ayoub, H. H., Qassim, S., AlMukdad, S., Coyle, P., Yassine, H. M., Al-Khatib, H. A., Benslimane, F. M., Al-Kanaani, Z., Al-Kuwari, E., Jeremijenko, A., Kaleeckal, A. H., Latif, A. N., Shaik, R. M., Abdul-Rahim, H. F., Nasrallah, G. K., Al-Kuwari, M. G., … Abu-Raddad, L. J. (2022). Protection against the Omicron Variant from Previous SARS-CoV-2 Infection. New England Journal of Medicine, 0(0), null. https://doi.org/10.1056/NEJMc2200133

    1. To achieve nimbleness, we can simplify the data landscape by using a semantic fabric, popularly called data fabric, based on a strong Metadata Management operating model.

      data fabric

    1. Wordle's spread on social media was enabled in part by its low-tech approach for e.g. sharing scores.

      One low-tech approach that could've been used here for data persistence would be to generate and prompt the user to save their latest scorecard in PDF or Word format—only it's not a PDF or Word format, but instead "wordlescore.html" file, albeit one that they are able to save to disk and double click to open all the same. When they need to update their scorecard with today's data, you use window.open to show a page that prompts the user to open their most recent scorecard (using either Ctrl+/Cmd+O, or by navigating to the place where they saved it on disk via bookmark). What's not apparent on sight alone is that their wordlescore.html also contains a JS payload as an inline script. When wordlescore.html is opened, it's able to communicate with the Wordle tab via postMessage to window.opener, request the newest data from the app, and then update wordlescore.html itself as appropriate.

    1. Alastair Grant. (2022, February 16). Samples likely to be BA.2 (SGT positive in TaqPath data) now make up 34% of COVID cases in England. The proportion has roughly doubled in a week. That represents a growth in absolute numbers of BA.2, even if overall infections are falling at the same rate as reported cases https://t.co/LNr5baChby [Tweet]. @AlastairGrant4. https://twitter.com/AlastairGrant4/status/1493880986660225024

    1. Data Mining und Knowledge Discovery in Databases be-inhalten Methoden der Informations- und Wissensextraktion aus strukturierten Datensätzen [99].

      Data Mining-Systeme

    2. Entscheidungs- und Führungsinformationssysteme beruhen zumeist auf bereichsbezogenen, integrierten und zeitlich veränderli-chen Datensammlungen, sog. Data Warehouses.

      Data-Warehouse-Systeme:

    1. Deepti Gurdasani. (2022, January 10). Lots of people dismissing links between COVID-19 and all-cause diabetes. An association that’s been shown in multiple studies- whether this increase is due to more diabetes or SARS2 precipitating diabetic keto-acidosis allowing these to be diagnosed is not known. A brief look👇 [Tweet]. @dgurdasani1. https://twitter.com/dgurdasani1/status/1480546865812840450

    1. Eric Feigl-Ding. (2022, January 17). Pandemic leadership matters. #COVID19 mortality per capita by state. 📍Public health is policy, policy is politics. 📍Human behavior is often driven by misinformation. 📍Misinformation is often driven by politics. 📍Politics can be changed by voting—Unless voters can’t. Https://t.co/pFkndQZrfr [Tweet]. @DrEricDing. https://twitter.com/DrEricDing/status/1483181226815012867

    1. APPG on Coronavirus. (2022, January 18). 🗣Dr.Claire Steves continued: “Looking in the national core studies, from cohort studies across the UK we’ve looked at 10 different longitudinal studies. Our best estimates are that about 5% of middle aged people are experiencing long term.. 27/ #APPGCoronavirus #LongCovid [Tweet]. @AppgCoronavirus. https://twitter.com/AppgCoronavirus/status/1483453895061999618

    1. Adam Kucharski. (2022, January 18). Below analysis was two years ago (https://bbc.co.uk/news/health-51148303). As well as providing an early warning about the COVID threat, it’s a good illustration of what is often an under-appreciated point: If we want to make sense of epidemic data and dynamics in real-time, we need models… 1/ https://t.co/ZdpzOq3Bzp [Tweet]. @AdamJKucharski. https://twitter.com/AdamJKucharski/status/1483368504392880128

    1. F. Perry Wilson, MD MSCE. (2022, February 4). If you, like me, are “skipping ahead” during the ACIP meeting re: Moderna vaccine—This slide really drives home the benefit / risk paradigm among the group at highest risk of myocarditis (men 18-35). 2 million shots = 1903 avoided hospitalizations, and 68 myocarditis cases. Https://t.co/3nzWXGXyD1 [Tweet]. @fperrywilson. https://twitter.com/fperrywilson/status/1489649379979972609

    1. neuer Typ von Datenplattform für die Speicherung, Integrationund Analyse aller Arten von (Roh-)daten etabliert

      Data Lake als ein neuer Typ von Datenplattform

    2. Data-Lake-Management-Plattformen stellen Werkzeugen-Suiten dar, die auf der Basiseines Datenkatalogs weitere Funktionalitäten für das Datenmanagement im Data Lakeintegrieren. Typischerweise geht es um ergänzende Funktionalitäten für ETL, Self-Service-Data-Preparation und Datenföderation (engl. data federation), die eng mit dem Datenkatalogintegriert sind.

      Data-Lake-Management-Plattformen

    3. Den Kern für das Metadatenmanagement im Data Lake bilden originäre Data-Lake-Metadaten sowie Ergebnis-Metadaten

      Kern für das Metadatenmanagement im Data Lake

    4. Originäre Data-Lake-Metadaten sind Metadaten über die im Data Lake gespeichertenDaten, z.B. Daten in relationalen Datenbanken und Message-Queues des DataLake.

      Originäre Data-Lake-Metadaten

    5. Mit dem zunehmenden Einsatz von Data Lakes ergeben sich in der industriellen Praxiserweiterte Anforderungen, insbesondere hinsichtlich der Sicherstellung von Transparenz,Qualität und Compliance der Daten im Data Lake sowie der Unterstützung von Self-Service-Szenarien für Fachanwender. Diese Anforderungen machen Metadatenmanagement3imData Lake zum kritischen Erfolgsfaktor [QHV16, HGQ16]. Es soll damit verhindert werden,dass aus dem Data Lake ein Datensumpf (engl. data swamp) [HGQ16] entsteht, also eineDatenplattform mit nicht mehr sinnvoll nutzbaren Daten.

      Metadatenmanagement für Data Lakes

    1. Sie helfen beispielsweise, die heterogenen Datensilos eines Unternehmens zu erschließen, sie intelligent zu verknüpfen, neu zu interpretieren und im Firmen-Intranet gezielt bereitzustellen.

      Potential von semantischen Technologien: Auflösung von heterogenen Daten-Silos Technologie: Linked Data

    1. Darüber hinaus ist ein wichtiger Trend Linked Data im Unternehmensumfeld zu etablieren, um eine neue Generation semantischer, vernetzter Daten-Anwendungen auf Basis des Linked Data Paradigmas zu entwickeln, zu etablieren und erfolgreich zu vermarkten. Im BMBF Wachstumskernprojekt „Linked Enterprise Data Services“ entsteht hierfür beispielsweise eine Technologieplattform, die es Unternehmen ermöglichen soll, neue Dienstleistungen im Web 3.0 zu etablieren.

      BMBF Wachstumskernprojekt „Linked Enterprise Data Services

    1. Because CENS was an academic research lab, faculty members held a large amount of power to decide which projects students pursued and what issues students faced during design, testing, and implem

      CENS seems like it takes its job seriously. Like I said in my other annotation for week 5. Just because data scientists are trying to root out bias in all forms doesn't mean it is always effective or that what is effective can't be improved.

    1. A third introduced me to an app that allowed us to upload bibliographic data to our personal databases just by scanning a book’s ISBN number with a phone camera.

      I'm pretty sure I can wire up something simple to do this to dovetail with Zotero.

    1. Eine wesentliche Idee von Linked Data ist es, dass Daten und Informationen un-terschiedlichster Herkunft und Struktur auf Basis von Standards interpretiert, (weiter-)verarbeitet, verknüpft und schließlich dem User in einer Form präsentiert werden können,sodass dieser seine Aufwände zur Informationsgewinnung und -aufbereitung verringernkann

      Leitidee von Linked Data

    2. bbil-dung 2.8 zeigt einen Überblick über die sogenannte „Linking Open Data Cloud“

      Abbildung

    1. Large enterprises and organizations use a vast variety ofdifferent information systems, databases, portals, wikis andKnowledge Bases [KBs] combining hundreds and thousandsof data and information sources.

      Problemstellung: Big Data

    2. The unified approachhas the advantage, that the enterprise has more control overthe data and quality, and the data querying is significantlyfaster.
    3. he transitionary approach is advisable when datasecurity plays a vital role.