Hypothesis

4 Matching Annotations

Nov 2023
serpdotai.gitbook.io serpdotai.gitbook.io

Actor-critic - The Hitchhiker's Guide to Machine Learning Algorit

1
1. devinschumacher 05 Nov 2023
  
  in Public
  
  Actor-critic is a temporal difference algorithm used in reinforcement learning. It consists of two networks: the actor, which decides which action to take, and the critic, which evaluates the action produced by the actor by computing the value function and informs the actor how good the action was and how it should adjust. In simple terms, the actor-critic is a temporal difference version of policy gradient. The learning of the actor is based on a policy gradient approach.
  
  Actor-critic
  
  actor-critic machine learning algorithms
Visit annotations in context

Tags

actor-critic

machine learning algorithms

Annotators

devinschumacher

URL

serpdotai.gitbook.io/the-hitchhikers-guide-to-machine-learning-algorithms/chapters/actor-critic
Mar 2021
academic.oup.com academic.oup.com

Reflection on modern methods: when worlds collide—prediction, machine learning and causal inference

1
1. n.parfitt 15 Mar 2021
  
  in BehSci
  
  Blakely, Tony, John Lynch, Koen Simons, Rebecca Bentley, and Sherri Rose. ‘Reflection on Modern Methods: When Worlds Collide—Prediction, Machine Learning and Causal Inference’. International Journal of Epidemiology 49, no. 6 (1 December 2020): 2058–64. https://doi.org/10.1093/ije/dyz132.
  
  is:article lang:en prediction machine learning causal inference modelling method best prediction propensity scores IPTWs G computation TMLE potential outcomes epidemiology covariate algorithms
Visit annotations in context

Tags

IPTWs

best prediction

prediction

modelling

TMLE

potential outcomes

machine learning

method

propensity scores

covariate

causal inference

algorithms

lang:en

is:article

G computation

epidemiology

Annotators

n.parfitt

URL

academic.oup.com/ije/article/49/6/2058/5531243
Sep 2020
psyarxiv.com psyarxiv.com

Unifying recommendation and active learning for information filtering and recommender systems

1
1. katietaylor_99 07 Sep 2020
  
  in BehSci
  
  Yang, Scott Cheng-Hsin, Chirag Rank, Jake Alden Whritner, Olfa Nasraoui, and Patrick Shafto. ‘Unifying Recommendation and Active Learning for Information Filtering and Recommender Systems’. Preprint. PsyArXiv, 25 August 2020. https://doi.org/10.31234/osf.io/jqa83.
  
  is:preprint lang:en active learning information filtering recommender system algorithms Internet AI artificial intelligence machine learning predictive accuracy recommendation accuracy exploration-exploitation tradeoff parameterized model cognitive science computer science experimental approach
Visit annotations in context

Tags

artificial intelligence

is:preprint

machine learning

computer science

cognitive science

recommendation accuracy

algorithms

parameterized model

Internet

experimental approach

lang:en

active learning

AI

recommender system

predictive accuracy

exploration-exploitation tradeoff

information filtering

Annotators

katietaylor_99

URL

psyarxiv.com/jqa83/
Jul 2019
www.oreilly.com www.oreilly.com

Evaluating Machine Learning Models

1
1. intelligence.refinery 02 Jul 2019
  
  in Public
  
  Machine learning models are basically mathematical functions that represent the relationship between different aspects of data.
  
  Machine learning Algorithms
Visit annotations in context

Tags

Algorithms

Machine learning

Annotators

intelligence.refinery

URL

oreilly.com/ideas/evaluating-machine-learning-models/page/5/hyperparameter-tuning