Hypothesis

4 Matching Annotations

Nov 2023
serpdotai.gitbook.io serpdotai.gitbook.io

Actor-critic - The Hitchhiker's Guide to Machine Learning Algorit

1
1. devinschumacher 05 Nov 2023
  
  in Public
  
  Actor-critic is a temporal difference algorithm used in reinforcement learning. It consists of two networks: the actor, which decides which action to take, and the critic, which evaluates the action produced by the actor by computing the value function and informs the actor how good the action was and how it should adjust. In simple terms, the actor-critic is a temporal difference version of policy gradient. The learning of the actor is based on a policy gradient approach.
  
  Actor-critic
  
  actor-critic machine learning algorithms
Visit annotations in context

Tags

actor-critic

machine learning algorithms

Annotators

devinschumacher

URL

serpdotai.gitbook.io/the-hitchhikers-guide-to-machine-learning-algorithms/chapters/actor-critic
Mar 2021
academic.oup.com academic.oup.com

Reflection on modern methods: when worlds collide—prediction, machine learning and causal inference

1
1. n.parfitt 15 Mar 2021
  
  in BehSci
  
  Blakely, Tony, John Lynch, Koen Simons, Rebecca Bentley, and Sherri Rose. ‘Reflection on Modern Methods: When Worlds Collide—Prediction, Machine Learning and Causal Inference’. International Journal of Epidemiology 49, no. 6 (1 December 2020): 2058–64. https://doi.org/10.1093/ije/dyz132.
  
  is:article lang:en prediction machine learning causal inference modelling method best prediction propensity scores IPTWs G computation TMLE potential outcomes epidemiology covariate algorithms
Visit annotations in context

Tags

causal inference

prediction

method

algorithms

machine learning

potential outcomes

epidemiology

best prediction

lang:en

G computation

propensity scores

modelling

TMLE

covariate

IPTWs

is:article

Annotators

n.parfitt

URL

academic.oup.com/ije/article/49/6/2058/5531243
Sep 2020
psyarxiv.com psyarxiv.com

Unifying recommendation and active learning for information filtering and recommender systems

1
1. katietaylor_99 07 Sep 2020
  
  in BehSci
  
  Yang, Scott Cheng-Hsin, Chirag Rank, Jake Alden Whritner, Olfa Nasraoui, and Patrick Shafto. ‘Unifying Recommendation and Active Learning for Information Filtering and Recommender Systems’. Preprint. PsyArXiv, 25 August 2020. https://doi.org/10.31234/osf.io/jqa83.
  
  is:preprint lang:en active learning information filtering recommender system algorithms Internet AI artificial intelligence machine learning predictive accuracy recommendation accuracy exploration-exploitation tradeoff parameterized model cognitive science computer science experimental approach
Visit annotations in context

Tags

computer science

parameterized model

cognitive science

AI

information filtering

recommender system

machine learning

algorithms

Internet

recommendation accuracy

lang:en

predictive accuracy

active learning

experimental approach

exploration-exploitation tradeoff

artificial intelligence

is:preprint

Annotators

katietaylor_99

URL

psyarxiv.com/jqa83/
Jul 2019
www.oreilly.com www.oreilly.com

Evaluating Machine Learning Models

1
1. intelligence.refinery 02 Jul 2019
  
  in Public
  
  Machine learning models are basically mathematical functions that represent the relationship between different aspects of data.
  
  Machine learning Algorithms
Visit annotations in context

Tags

Machine learning

Algorithms

Annotators

intelligence.refinery

URL

oreilly.com/ideas/evaluating-machine-learning-models/page/5/hyperparameter-tuning