Learning from delayed rewards

doi:10.1016/0921-8890(95)00026-C

Journal ArticleDOI

Learning from delayed rewards

Ben Kröse

- 01 Oct 1995 -

Robotics and Autonomous Systems

- Vol. 15, Iss: 4, pp 233-235

TLDR

The invention relates to a circuit for use in a receiver which can receive two-tone/stereo signals which is intended to make a choice between mono or stereo reproduction of signal A or of signal B and vice versa.

About:

This article is published in Robotics and Autonomous Systems.The article was published on 1995-10-01. It has received 2861 citations till now. The article focuses on the topics: Autonomous system (mathematics) & Robotics.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Teaching Machine Learning to Design Students

Bram van der Vlist, +7 more

TL;DR: This work successfully used the Embodied Intelligence method to teach machine learning to students, embodying the learning system into the Lego Mindstorm NXT platform to provide the student with a tangible tool to understand and interact with a learning system.

...read moreread less

Proceedings ArticleDOI

Simulation Studies of Multi-armed Bandits with Covariates (Invited Paper)

Nicos G. Pavlidis, +2 more

TL;DR: In this paper, the authors evaluate the performance of a number of action selection methods on the multi-armed bandit problem with covariates and show that there is a trade-off between the satisfaction of the different performance measures.

...read moreread less

Book ChapterDOI

A Gentle Introduction to Reinforcement Learning

Ann Nowé, +1 more

TL;DR: This paper provides a gentle introduction to some of the basics of reinforcement learning, as well as pointers to more advanced topics within the field.

...read moreread less

Journal ArticleDOI

Reinforced contrast adaptation

Hamid R. Tizhoosh, +1 more

- 01 Jul 2006 -

International Journal of Image and Graph...

TL;DR: It is demonstrated that Reinforcement Learning is a potential method for solving the problem of subjective evaluation of human operators and an agent is developed that uses the Q-learning algorithm.

...read moreread less

Proceedings ArticleDOI

NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration

Zhengang Li, +15 more

TL;DR: A general category of fine-grained structured pruning applicable to various DNN layers is proposed, and a comprehensive, compiler automatic code generation framework supporting different DNNs and different pruning schemes are proposed, which bridge the gap of model compression and NAS.

...read moreread less