86 Matching Annotations
  1. Last 7 days
    1. 2.5 Recursive Partitioning: Tree-Based Models

      I think the title of sec 2.5 should be more general, like "Predictive Modeling" and then make Recursive Partitioning the first subsection. Why? because with Recursive Partitioning the title of the entire section, it seems to hide the real meat of the section, witch I think is the New Directions ...

    1. are bootstrap estimates corrected for overfitting

      "... are bootstrap ... corrected .. " Is this the start of a question?

      or is the intended sentence ... "... boostrap estimates ARE corrected?

  2. May 2026
    1. For related information

      Looks like the de Mul citation is not linked, just the doi:

      De Mul N, de Jong V, Cremer O ... Practical guidance for validating the predictive performance in the presence of missing data: a guide for the clinical researcher Journal of Clinical Epidemiology, 2026; 192

  3. Feb 2026
  4. May 2025
    1. Using Cox survival estimates at 60 Months

      CalibratioN ... suggest adding note about when to worry!

      "We're going to worry about over-predicting survival when it's high and under-predicting when it's low. Frank Harrell 09:02:47 Uh, and this is the classic kind of regression to the mean. So to really… If you wanted to use this as a prognostic model. Frank Harrell 09:02:55 Predictive model, you probably go back and apply some shrinkage, or really talk to the experts to do more data reduction than I did."

    2. Choosing the Number of Parameters and Fitting the Model

      Note Frank says he would not analyze this ds this way now. and there might be an error in the data

      ?? maybe put a note here for the reader? or maybe link to the video 5-20-2025

  5. Jan 2025
  6. Dec 2024
  7. Jun 2024
    1. rmsb: Bayesian Regression Modeling Strategies Package, Focusing on Semiparametric Univariate and Longitudinal Models

      maybe subdivide 6.3 with 6.3.1 rms, 6.3.2 rmsb, and ? 6.3.3 the global options, etc. or maybe include that with 6.3.1?

      Main point is the toc for section 6 should show a subsection for rmsb explicitly!

  8. May 2024
    1. 42

      why is the number 42 shown separately from the lowest interval and the second lowest?

      also, wondering how to see the color associated with each interval? True, I can hover over various colors and figure out that yellow represents the highest interval ...

  9. Apr 2024
  10. Jun 2023
  11. May 2023
    1. Logistic regression

      Just to clarify, some ML 'packages' (e.g. AzureML, GoogleML) consider logistic regression to be one of the ML algorithms. FWIW, I've seen (high school) science fair projects use both of those and find that logistic is "the winner" ...

    2. over-simplified

      if the cutoff for dichotomizing one variable is unstable, shouldn't be surprised that a tree based on cutoffs by dichotomizing multiple variables is unstable ?

    1. that the difference between the most different treatments is badly biased

      trying to understand this: maybe "the difference between the treatments with the greatest difference is badly biased"

  12. Apr 2023
    1. Best way to make model fit data well is to discard much of the data

      Suggested edit to emphasize you don't really suggest this approach?!

      "Best way" to make model fit data well is to discard much of the data!!

  13. Mar 2023
  14. Aug 2022