Information about Test

  1. Reinforcement learning

    reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Reinforcement learning

  2. Deep reinforcement learning

    Deep reinforcement learning (DRL) uses deep learning and reinforcement learning principles in order to create efficient algorithms that can be applied

  3. Model-free (reinforcement learning)

    In reinforcement learning (RL), a model-free algorithm (as opposed to a model-based one) is an algorithm which does not use the transition probability

  4. Q-learning

    Q-learning is a model-free reinforcement learning algorithm. The goal of Q-learning is to learn a policy, which tells an agent what action to take under

  5. Reinforcement

    In behavioral psychology, reinforcement is a consequence applied that will strengthen an organism's future behavior whenever that behavior is preceded

  6. Machine learning

    human user for labeling. Reinforcement learning algorithms are given feedback in the form of positive or negative reinforcement in a dynamic environment

  7. Neural architecture search

    hyperparameter optimization and is a subfield of automated machine learning (AutoML). Reinforcement learning (RL) can underpin a NAS search strategy. Zoph et al. applied

  8. Multi-objective reinforcement learning

    Multi-objective reinforcement learning (MORL) is a form of reinforcement learning concerned with conflicting alternatives. It is distinct from multi-objective

  9. Temporal difference learning

    Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate

  10. Softmax function

    model which uses the softmax activation function. In the field of reinforcement learning, a softmax function can be used to convert values into action probabilities