70 Matching Annotations

Nov 2022
mml-book.github.io mml-book.github.io

Mathematics for Machine Learning

19
1. hsheraz 05 Nov 2022
  
  in Public
  
  latent variable
  
  latent variables are variables that you cannot be observed
2. hsheraz 05 Nov 2022
  
  in Public
  
  One way to detectoverfitting inpractice is toobserve that themodel has lowtraining risk buthigh test risk duringcross validation
  
  overfitting = high acc during training and low acc during testing
3. hsheraz 05 Nov 2022
  
  in Public
  
  Model Fitting
  
  how well a model is learning
4. hsheraz 05 Nov 2022
  
  in Public
  
  cross-validation
  
  technique used to evaluate how well your model is doing
5. hsheraz 05 Nov 2022
  
  in Public
  
  validation se
  
  I have always been confused by the validation set. It is a set used to provide a glimpse of how your model will react to the data. Usually you take a portion of the training set to create the validation set
6. hsheraz 05 Nov 2022
  
  in Public
  
  egularization
  
  technique used to reduce overfitting
7. hsheraz 05 Nov 2022
  
  in Public
  
  overfitting
  
  overfitting = during training the error is small where as during testing it is large
8. hsheraz 05 Nov 2022
  
  in Public
  
  Another phrasecommonly used forexpected risk is“population risk”
  
  From what I know, population risk is the number of individuals at risk. Is it samething as expected risk ?
9. hsheraz 05 Nov 2022
  
  in Public
  
  independent and identicallyindependent andidenticallydistributed distributed
  
  what is a set of example here? I am thinking of it as features rather than anything else. But features are dependent upon one another so I am not sure what this means
10. hsheraz 05 Nov 2022
  
  in Public
  
  Affine functions areoften referred to aslinear functions inmachine learning
  
  affine faction = linear function
11. hsheraz 01 Nov 2022
  
  in Public
  
  Training or parameter estimation
  
  adjust predictive model based on training data.
  
  In order to find good predictors do one of two things: 1) find the best predict based on some measure of quality (known as finding a point estimate) and 2) using bayesian inference
12. hsheraz 01 Nov 2022
  
  in Public
  
  Prediction or inference
  
  predict on unseen test data. 'inference' can mean prediction for non-probabilist models or parameter estimation
13. hsheraz 01 Nov 2022
  
  in Public
  
  goal of learning is to find a model and its corresponding parame-ters such that the resulting predictor will perform well on unseen data
  
  important
  
  definition
14. hsheraz 01 Nov 2022
  
  in Public
  
  noisy observation
  
  real-life data is always noisy
15. hsheraz 01 Nov 2022
  
  in Public
  
  example or data point
  
  I thought rows were observations or instances?
16. hsheraz 01 Nov 2022
  
  in Public
  
  we do not expect the iden-tifier (the Name) to be informative for a machine learning task
  
  This is a good reminder that only query the columns or data that are relevant to the exercise
17. hsheraz 01 Nov 2022
  
  in Public
  
  features, attributes, or covariates
  
  definition
18. hsheraz 01 Nov 2022
  
  in Public
  
  What dowe mean by good models?
  
  This is a great question. I usually think of models as algorithms
19. hsheraz 01 Nov 2022
  
  in Public
  
  good models should perform well on unseendata
  
  The main idea in implementing a machine learning model
Visit annotations in context

Tags

definition

Annotators

hsheraz

URL

mml-book.github.io/book/mml-book.pdf
Oct 2022
mml-book.github.io mml-book.github.io

Mathematics for Machine Learning

22
1. hsheraz 29 Oct 2022
  
  in Public
  
  This derivation iseasiest tounderstand bydrawing thereasoning as itprogresses.
  
  The reasoning of the derivative?
2. hsheraz 29 Oct 2022
  
  in Public
  
  Exampleof a convex set
  
  an easy example to identify convex sets. One way to determine a convex sex is to keep in mind that if at given given point within the set if a line segment is within the set it is convex
3. hsheraz 29 Oct 2022
  
  in Public
  
  Lagrange multiplier
  
  Lagrange aims to find the local minima and maxima of function
4. hsheraz 29 Oct 2022
  
  in Public
  
  The step-size is alsocalled the learningrate.
  
  when implementing a neural net, the learning rate is a hyper parameter that controls how much to adjust the weights by w.r.t to the gradient
5. hsheraz 29 Oct 2022
  
  in Public
  
  We use theconvention of rowvectors forgradients
  
  so a matrix? or just rows like this : [a b c ]?
  
  question
6. hsheraz 29 Oct 2022
  
  in Public
  
  minx f (x)
  
  This is important. For all optimization problems, the end goal is to minimize the function
7. hsheraz 29 Oct 2022
  
  in Public
  
  Linear Program
  
  interesting example. Seeing how the linear programs can be plotted.
8. hsheraz 18 Oct 2022
  
  in Public
  
  elative frequencies of eventsof interest to the total number of events that occurred
  
  isn't this the definition of mean?
  
  question
9. hsheraz 18 Oct 2022
  
  in Public
  
  abducted by aliens
  
  lol
10. hsheraz 06 Oct 2022
  
  in Public
  
  Theorem 4.3. A square matrix A ∈ Rn×n has det(A) 6 = 0 if and only ifrk(A) = n. In other words, A is invertible if and only if it is full rank
  
  refer to section 2.6.2 for rank definition
11. hsheraz 06 Oct 2022
  
  in Public
  
  pA(λ) := det(A − λI)
  
  definition characteristic polynomial
12. hsheraz 06 Oct 2022
  
  in Public
  
  (−1)k+j det(Ak,j )a cofactor
  
  cofactor
13. hsheraz 06 Oct 2022
  
  in Public
  
  det(Ak,j ) is calleda minor
  
  minor
14. hsheraz 06 Oct 2022
  
  in Public
  
  det(A) =n∑k=1(−1)k+j akj det(Ak,j )
  
  Laplace expansion
15. hsheraz 06 Oct 2022
  
  in Public
  
  Adding a multiple of a column/row to another one does not changedet(A)
  
  determinant properties
16. hsheraz 06 Oct 2022
  
  in Public
  
  Swapping two rows/columns changes the sign of det(A)
  
  determinant properties
17. hsheraz 06 Oct 2022
  
  in Public
  
  det(λA) = λn det(A)
  
  determinant properties
18. hsheraz 06 Oct 2022
  
  in Public
  
  If A is regular (invertible), then det(A−1) = 1det(A)
  
  determinant properties
19. hsheraz 06 Oct 2022
  
  in Public
  
  (4.7)
  
  determinant of 3x3 matrix
  
  determinant
20. hsheraz 06 Oct 2022
  
  in Public
  
  (4.6)
  
  determinant of 2x2 matrix
  
  determinant
21. hsheraz 06 Oct 2022
  
  in Public
  
  is invertibleif and only if det(A) 6 = 0
  
  Invertible: det(A) $\neq 0$
  
  definition
22. hsheraz 06 Oct 2022
  
  in Public
  
  determinant of a square matrix A ∈ Rn×n is a function that maps A
  
  determinant
  
  definition
Visit annotations in context

Tags

Laplace expansion

determinant

properties

characteristic polynomial

question

definition

cofactor

minor

Annotators

hsheraz

URL

mml-book.github.io/book/mml-book.pdf
Sep 2022
mml-book.github.io mml-book.github.io

Mathematics for Machine Learning

29
1. hsheraz 29 Sep 2022
  
  in Public
  
  rotation matrix
  
  coordinates of rotation in the form on basis vectors
  
  definition
2. hsheraz 29 Sep 2022
  
  in Public
  
  rotation
  
  linear mapping that rotates a plan by angle $\theta$ with respect to origin
  
  if angle $\theta$ > 0 rotate counterclockwise
  
  definition
3. hsheraz 29 Sep 2022
  
  in Public
  
  orthogonal basis
  
  $$<b_{i}, b_{j}> = 0, i \neq j$$
  
  definition
4. hsheraz 29 Sep 2022
  
  in Public
  
  orthogonal complement
  
  Let W be a subspace of a vector space V. Then the orthogonal complement of W is also a subspace of V. Furthermore, the intersection of W and its orthogonal complement is just the zero vector.
  
  definition
5. hsheraz 29 Sep 2022
  
  in Public
  
  normal vector
  
  vector with magnitude 1, $||w|| = 1$ and is perpendicular to the surface
  
  definition
6. hsheraz 29 Sep 2022
  
  in Public
  
  Gram-Schmidt process
  
  concatenate basis vector (non-orthogonal and unnormalized) into a matrix, apply gaussian eliminate and obtain an orthonormal basis
  
  definition
7. hsheraz 29 Sep 2022
  
  in Public
  
  Orthonormal Basi
  
  basis vectors = subset of vectors linearly independent if orthonormal basis -> orthogonal basis
  
  definition
8. hsheraz 29 Sep 2022
  
  in Public
  
  3.32
  
  distance of orthogonal matrix
  
  definition
9. hsheraz 29 Sep 2022
  
  in Public
  
  ‖Ax‖2 = (Ax)>(Ax) = x>A>Ax = x>Ix = x>x = ‖x‖2
  
  this is an important proof of dot product for an orthogonal matrix
  
  proof
10. hsheraz 29 Sep 2022
  
  in Public
  
  Orthogonal Matrix
  
  $$AA^T = I = A^TA \Rightarrow A^{-1} = A^T$$ orthonormal columns
  
  definition
11. hsheraz 29 Sep 2022
  
  in Public
  
  〈x, y〉
  
  this is equal to 1, which does not meet the requirement of orthogonality
12. hsheraz 29 Sep 2022
  
  in Public
  
  (Orthogonality
  
  if $<x,y> = 0$ and $||x|| = ||y|| = 0$<br /> any two lines that are perpendicular - 90 degree angle
  
  definition
13. hsheraz 29 Sep 2022
  
  in Public
  
  cos ω
  
  used to find angle between vector
14. hsheraz 29 Sep 2022
  
  in Public
  
  x, y) 7 → d(x, y)
  
  if x and y are two points in a vector space then, you can find the distance
  
  definition
15. hsheraz 29 Sep 2022
  
  in Public
  
  d(x, y) := ‖x − y‖ =√〈x − y, x − y
  
  so a Euclidean distance is a distance from point x to point y, so the shortest path (a straight line). I don't understand the difference between distance and Euclidean distance. Isn't distance also a dot product? how would you do the calculation?
  
  question
16. hsheraz 29 Sep 2022
  
  in Public
  
  inner product returns smallervalues than the dot product if x1 and x2 have the same sig
  
  this is interesting
17. hsheraz 29 Sep 2022
  
  in Public
  
  atisfies (3.11) is called symmetric, positive definite
  
  symmetric positive definite
  
  definition
18. hsheraz 29 Sep 2022
  
  in Public
  
  (3.9)
  
  The inner product must be positive definite, symmetric and bilinear. test for inner product: let v = (1,2) -> <v,v> = (1)(1) - (1)(2) - (2)(1) + 2(2)(2) = 1 - 2 -2 + 8 = -3 + 8 = 5 (symmetric, bilinear and positive)
  
  test for dot product: as per (3.5) the right side does not equal the left side
19. hsheraz 29 Sep 2022
  
  in Public
  
  se the dot product defined in (3.5), we call(V, 〈·, ·〉) a Euclidean vector space
  
  euclidean vector space
  
  definition
20. hsheraz 29 Sep 2022
  
  in Public
  
  The pair (V, 〈·, ·〉) is called an inner product space
  
  inner product space
  
  definition
21. hsheraz 29 Sep 2022
  
  in Public
  
  positive definite, symmetric bilinear mapping Ω : V ×V → R is calledan inner product on V
  
  definition
22. hsheraz 29 Sep 2022
  
  in Public
  
  positive definite if positive definite∀x ∈ V \{0} : Ω(x, x) > 0 , Ω(0, 0) = 0
  
  definition
23. hsheraz 29 Sep 2022
  
  in Public
  
  symmetric if Ω(x, y) = Ω(y, x)
  
  a symmetric matrix was: (A^(-1))^T = (A^(T))^-1
  
  definition
24. hsheraz 29 Sep 2022
  
  in Public
  
  x>y =n∑i=1xiyi
  
  inner product and dot product interchangeable here
  
  definition
25. hsheraz 29 Sep 2022
  
  in Public
  
  3.4
  
  distance from the origin of a vector
  
  definition
26. hsheraz 29 Sep 2022
  
  in Public
  
  Positive definite: ‖x‖ > 0 and ‖x‖ = 0 ⇐⇒ x = 0
  
  definition
27. hsheraz 29 Sep 2022
  
  in Public
  
  Triangle inequality: ‖x + y‖ 6 ‖x‖ + ‖y‖
  
  definition
28. hsheraz 29 Sep 2022
  
  in Public
  
  Absolutely homogeneous: ‖λx‖ = |λ|‖x‖
  
  definition
29. hsheraz 29 Sep 2022
  
  in Public
  
  A norm on a vector space V is a function
  
  definition
Visit annotations in context

Tags

definition

question

proof

Annotators

hsheraz

URL

mml-book.github.io/book/mml-book.pdf

hsheraz

Annotations: 70

Joined: September 10, 2022

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL