6 Matching Annotations
  1. Dec 2022

    Annotators

  2. Nov 2022
    1. There isno back-door path through Q, as you can see. But there is a non-causal path from Q to Wthrough U: Q → E ← U → W.

      We don't know what the right side is of a Basketball game, it could be the underdog, it could be the favorite, it could be any team - anything can happend

    Annotators

  3. Sep 2022
    1. = Xa⇡(a|s) Xs 0 ,rp(s 0 , r |s, a)hr + v ⇡ (s 0 )i, for all s 2 S,

      I have an issue understanding this formula, and how it easily can be read as an expected value. Why do we merge the two sums, one over all the values of s' and the other over all the values of r. What are we trying to accomplish here?