SOW-BKI258
Reinforcement Learning
Course infoSchedule
Course moduleSOW-BKI258
Credits (ECTS)3
CategoryB2 (Second year bachelor)
Language of instructionEnglish
Offered byRadboud University; Faculty of Social Sciences; Artificial Intelligence;
Lecturer(s)
Lecturer
M. Aslanimoghanloo
Other course modules lecturer
Examiner
prof. dr. ir. J.H.P. Kwisthout
Other course modules lecturer
Lecturer
dr. Y. Qin
Other course modules lecturer
Coordinator
dr. Y. Qin
Other course modules lecturer
Contactperson for the course
dr. Y. Qin
Other course modules lecturer
Academic year2023
Period
PER3  (29/01/2024 to 05/04/2024)
Starting block
PER3
Course mode
full-time
Remarks-
Registration using OSIRISYes
Course open to students from other facultiesYes
Pre-registrationNo
Waiting listNo
Placement procedure-
Aims
The text of this course will be revised later.

Upon successful completion of the course, you will:
- Understand the foundations and core ideas of reinforcement learning algorithms and apply them to small-scale problems
- Formalize a learning problem as a Markov Decision Process (MDP) and understand the use and relevance of value functions in decision-making
- Find exact solutions (optimal value functions and optimal policies) to simple RL problems and apply these ideas to real-world scenarios
- Understand the core classes of methods to solve MDPs, their similarities and differences, strengths and weaknesses and understand their application to different decision problems
- Be prepared to take on more advanced topics in the domains of AI and deep learning.
Content
Cognitive agents learn by interaction with the environments they are embedded in. Exercising the connection between sensors and actuators (sensorimotor loop) allows the agent to modify its inputs, explore the consequences of its actions and decisions in order to understand how to optimize its behavior to achieve a desired goal. Such interactions are foundational to mammalian cognition and behavior and underlie many theories of learning, decision making and natural intelligence.

Beyond its relevance in the domains of psychology and neuroscience, as a core learning mechanism in biological intelligence, Reinforcement Learning (RL) has broad practical applicability in domains ranging from machine learning, operations research or control engineering.
This course will introduce the core ideas, methods and techniques used in RL algorithms.
 
Level
AI-B2
Presumed foreknowledge
Knowledge of the following fields is strongly advised: Linear Algebra, Probability Theory and Programming.
Test information
Exam: 60% (one retake possible)
Individual Assignment: 40%

Average grade: >5.5

Grading for the different parts will be made public in Brightspace. Only the final grade will be published in Osiris at the end of the course
Specifics
Please sign up for any course at (https://portal.ru.nl/home), it is obligatory.

Students who are enrolled for a course are also provisionally registered for the exam. 

Resit: Manual register at (https://portal.ru.nl/home) until five working days prior to the date of the exam. No delayed registration is possible. 

We urge you to always read the course information on Brightspace. 
 
Required materials
Book
Richard S. Sutton and Andrew G. Barto (2018). [Reinforcement Learning: An Introduction](http://incompleteideas.net/book/the-book.html). MIT Press, 2nd Edition. (Part I) Additional literature and resources will be provided for each topic
Title:Reinforcement Learning: An Introduction
Author:Richard S. Sutton and Andrew G. Barto
Publisher:MIT Press
Edition:2

Recommended materials
Book
Mathematically inclined students may complement with: Csaba Szepesvari (2010). [Algorithms for Reinforcement Learning](https://sites.ualberta.ca/~szepesva/rlbook.html). Morgan and Claypool Publishers.
Title:Algorithms for Reinforcement Learning
Author:Csaba Szepesvari
Publisher:Morgan and Claypool Publishers

Instructional modes
Lab sessions

Remark
Recommended attendance

Lecture

Remark
Recommended attendance

Tests
Final grade
Test weight1
Test typeExam
OpportunitiesBlock PER3, Block PER4