MAI4CAREU - Machine Learning: Model-free Reinforcement Learning Created byLaia Güell Paule|Updated27 July 2024The University of Cyprus's MSc Artificial Intelligence is part of the Master programmes in Artificial Intelligence 4 Careers in Europe (MAI4CAREU). One of Master's programme's courses, MAI612 - Machine Learning is split up into several lectures. Taught by Vassilis Vassiliades, PhD, the eighteenth lecture of the MAI612 - Machine Learning course focuses on Model-free Reinforcement Learning.Learning outcomesThe lesson is divided in five parts: Multi-armed bandits, Model-free Prediction, Between MC and TD: Multi-Step TD, Temporal-Difference Learning for Control, and Optimistic Initialization. In this lesson you will learn about:The simpler framework of multi-armed bandits and the exploration-exploitation tradeoffModel-free prediction to estimate values in an unknown MDP: Monte Carlo, Temporal-Difference (TD) Learning, and Multi-step TD learningModel-free control to optimise values in an unknown MDP: SARSA and Q-learning algorithmsOptimistic initialization of the value function to help explorationLearning contentWebsite linkMAI4CAREU - Lecture 18 - Model-free Reinforcement LearningTarget audienceDigital skills for ICT professionals and other digital experts.Digital skill levelIntermediateAdvancedGeographic scope - CountryAustriaBelgiumBulgariaCyprusRomaniaSloveniaCroatiaCzech republicDenmarkEstoniaFinlandFranceGermanyGreeceHungaryItalyIrelandMaltaLatviaLithuaniaLuxembourgNetherlandsPortugalPolandSwedenSpainSlovakiaShow moreShow less Share this page Log in to comment