Hypothesis

4 Matching Annotations

Jul 2019
livebook.datascienceheroes.com livebook.datascienceheroes.com

Data Science Live Book

2
1. intelligence.refinery 10 Jul 2019
  
  in Public
  
  However, the gain ratio is the most important metric here, ranged from 0 to 1, with higher being better.
  
  Variable importance Gain ratio
2. intelligence.refinery 09 Jul 2019
  
  in Public
  
  en: entropy measured in bits mi: mutual information ig: information gain gr: gain ratio
  
  Variable importance
Visit annotations in context

Tags

Variable importance

Gain ratio

Annotators

intelligence.refinery

URL

livebook.datascienceheroes.com/selecting-best-variables.html
rdrr.io rdrr.io

predictivePower: Calcualtes feature predictive power in XanderHorn/autoEDA: Automated univariate and bivariate exploratory data analysis

1
1. intelligence.refinery 09 Jul 2019
  
  in Public
  
  Feature predictive power will be calculated for all features contained in a dataset along with the outcome feature. Works for binary classification, multi-class classification and regression problems. Can also be used when exploring a feature of interest to determine correlations of independent features with the outcome feature. When the outcome feature is continuous of nature or is a regression problem, correlation calculations are performed. When the outcome feature is categorical of nature or is a classification problem, the Kolmogorov Smirnov distance measure is used to determine predictive power. For multi-class classification outcomes, a one vs all approach is taken which is then averaged to arrive at the mean KS distance measure. The predictive power is sensitive towards the manner in which the data has been prepared and will differ should the manner in which the data has been prepared changes.
  
  Variable importance autoEDA
Visit annotations in context

Tags

Variable importance

autoEDA

Annotators

intelligence.refinery

URL

rdrr.io/github/XanderHorn/autoEDA/man/predictivePower.html
www.scholarpedia.org www.scholarpedia.org

Mutual information

1
1. intelligence.refinery 09 Jul 2019
  
  in Public
  
  Mutual information is one of many quantities that measures how much one random variables tells us about another. It is a dimensionless quantity with (generally) units of bits, and can be thought of as the reduction in uncertainty about one random variable given knowledge of another. High mutual information indicates a large reduction in uncertainty; low mutual information indicates a small reduction; and zero mutual information between two random variables means the variables are independent.
  
  Mutual information Variable importance
Visit annotations in context

Tags

Mutual information

Variable importance

Annotators

intelligence.refinery

URL

scholarpedia.org/article/Mutual_information

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL