34 Matching Annotations
  1. Dec 2023
    1. Technique #2: Sampling

      How do you load only a subset of the rows?

      When you load your data, you can specify a skiprows function that will randomly decide whether to load that row or not:

      ```

      from random import random

      def sample(row_number): ... if row_number == 0: ... # Never drop the row with column names: ... return False ... # random() returns uniform numbers between 0 and 1: ... return random() > 0.001 ... sampled = pd.read_csv("/tmp/voting.csv", skiprows=sample) len(sampled) 973 ```

  2. Aug 2022
  3. Mar 2022
  4. Jan 2022
    1. Generally, bootstrap involves the following steps:
      1. A sample from population with sample size n.
      2. Draw a sample from the original sample data with replacement with size n, and replicate B times, each re-sampled sample is called a Bootstrap Sample, and there will totally B Bootstrap Samples.
      3. Evaluate the statistic of θ for each Bootstrap Sample, and there will be totally B estimates of θ.
      4. Construct a sampling distribution with these B Bootstrap statistics and use it to make further statistical inference, such as:
        • Estimating the standard error of statistic for θ.
        • Obtaining a Confidence Interval for θ.
  5. Nov 2021
  6. Oct 2021
    1. ult to provide workable guidelines here.

      In Grounded Study - decide where to start and what tools will be used to collect information.

    Tags

    Annotators

  7. Apr 2021
  8. Mar 2021
  9. Jan 2021
  10. Oct 2020
    1. M.B can’t be reduced to stereotypes, of course. But there’s also a bar to entry into this social-media network, and it’s a distinctly technophilic, first-world, Western bar.

      You can only say this because I suspect you're comparing it to platforms that are massively larger by many orders of magnitude. You can't compare it to Twitter or Facebook yet. In fact, if you were to compare it to them, then it would be to their early versions. Twitter was very technophilic for almost all of it's first three years until it crossed over into the broader conscious in early 2009.

      Your argument is somewhat akin to doing a national level political poll and only sampling a dozen people in one small town.

  11. Sep 2020
  12. Aug 2020
    1. Vogels, C. B. F., Brackney, D., Wang, J., Kalinich, C. C., Ott, I., Kudo, E., Lu, P., Venkataraman, A., Tokuyama, M., Moore, A. J., Muenker, M. C., Casanovas-Massana, A., Fournier, J., Bermejo, S., Campbell, M., Datta, R., Nelson, A., Team, Y. I. R., Cruz, C. D., … Grubaugh, N. (2020). SalivaDirect: Simple and sensitive molecular diagnostic test for SARS-CoV-2 surveillance. MedRxiv, 2020.08.03.20167791. https://doi.org/10.1101/2020.08.03.20167791

  13. Jul 2020
  14. Jun 2020
    1. Akhvlediani, T., Ali, S. M., Angus, D. C., Arabi, Y. M., Ashraf, S., Baillie, J. K., Bakamutumaho, B., Beane, A., Bozza, F., Brett, S. J., Bruzzone, R., Carson, G., Castle, L., Christian, M., Cobb, J. P., Cummings, M. J., D’Ortenzio, E., Jong, M. D. de, Denis, E., … Webb, S. (2020). Global outbreak research: Harmony not hegemony. The Lancet Infectious Diseases, 0(0). https://doi.org/10.1016/S1473-3099(20)30440-0

  15. May 2020
  16. Sep 2019
    1. current microbiome profiling approaches such as metagenomic sequencing require homogenization of input material, which means that underlying spatial information is lost
  17. Jun 2015
    1. I don't claim to understand the thought processes that would drive someone to do this, but given the rarity and extremity of suicide, we can assume for every worker who goes ahead with suicide for work-related reasons, there are a hundred or a thousand who feel miserable but not quite suicidal.