1 Matching Annotations

Nov 2023
serpdotai.gitbook.io serpdotai.gitbook.io

Actor-critic - The Hitchhiker's Guide to Machine Learning Algorit

1
1. devinschumacher 05 Nov 2023
  
  in Public
  
  Actor-critic is a temporal difference algorithm used in reinforcement learning. It consists of two networks: the actor, which decides which action to take, and the critic, which evaluates the action produced by the actor by computing the value function and informs the actor how good the action was and how it should adjust. In simple terms, the actor-critic is a temporal difference version of policy gradient. The learning of the actor is based on a policy gradient approach.
  
  Actor-critic
  
  actor-critic machine learning algorithms
Visit annotations in context

Tags

actor-critic

machine learning algorithms

Annotators

devinschumacher

URL

serpdotai.gitbook.io/the-hitchhikers-guide-to-machine-learning-algorithms/chapters/actor-critic

Devin Schumacher

Founder @ SERP

Annotations: 1

Joined: October 30, 2023

Link: devinschumacher.com

ORCID: 0000-0001-8620-2583

Top tags 2