Hypothesis

2 Matching Annotations

Oct 2023
arxiv.org arxiv.org

2106.01345.pdf

1
1. mark.crowley 25 Oct 2023
  
  in Public
  
  (Chen, NeurIPS, 2021) Che1, Lu, Rajeswaran, Lee, Grover, Laskin, Abbeel, Srinivas, and Mordatch. "Decision Transformer: Reinforcement Learning via Sequence Modeling". Arxiv preprint rXiv:2106.01345v2, June, 2021.
  
  Quickly a very influential paper with a new idea of how to learn generative models of action prediction using SARSA training from demonstration trajectories. No optimization of actions or rewards, but target reward is an input.
  
  reinforcement-learning transformers generative-models minecraft minerl rdgrp-f23 reading_group_crowley
Visit annotations in context

Tags

minerl

generative-models

minecraft

transformers

rdgrp-f23

reinforcement-learning

reading_group_crowley

Annotators

mark.crowley

URL

arxiv.org/pdf/2106.01345
Nov 2022
www.exponentialview.co www.exponentialview.co

🔮 Azeem's commentary: On the generative wave (Part 1)

1
1. ravenscroftj 21 Nov 2022
  
  in Public
  
  “The metaphor is that the machine understands what I’m saying and so I’m going to interpret the machine’s responses in that context.”
  
  Interesting metaphor for why humans are happy to trust outputs from generative models
  
  generative models machine learning ml explainability
Visit annotations in context

Tags

ml explainability

machine learning

generative models

Annotators

ravenscroftj

URL

exponentialview.co/p/azeems-commentary-on-the-generative