ECE411: Adaptive Control and Reinforcement Learning (Last updated: March 5, 2025)

Course Description

An introduction to adaptive control and reinforcement learning for discrete-time deterministic linear systems. Topics include: discrete-time state space models; stability of discrete time systems; parameter adaptation laws; error models in adaptive control; persistent excitation; controllability and pole placement; observability and observers; classical regulation in discrete-time; adaptive regulation; dynamic programming; value and policy iteration; Q-learning. Labs involve control design using MATLAB.


Teaching Staff

Prof. M.E. Broucke GB342 LEC 01 broucke at control.utoronto.ca
Fatima Ghadieh GB348 TUT01 fatima dot ghadieh at mail.utoronto.ca
Parvin Malekzadeh GB348 PRA02 p.malekzadeh at mail.utoronto.ca
Gautham Raj Palanivel Rajan GB348 PRA03 gauthamraj.palanivelrajan at mail.utoronto.ca
Trent Suzuki GB348 PRA01 t.suzuki at mail.utoronto.ca


Lecture and Tutorial Schedule

Section Day and Time Dates
LEC 01 Mon 12-1pm Starts January 6
  Wed 12-1pm Starts January 8
  Fri 2-3pm Starts January 10
TUT 01 Wed 2-3pm Jan 15, Jan 22, Jan 29, Feb 5, Feb 12, Feb 26, Mar 5, Mar 12, Mar 19, Mar 26, Apr 2


Course Outline

The following table shows the lecture topics. The events column shows suggested reading from the course notes (distributed on Quercus) as well as quiz and exam dates. This schedule may be updated as the semester progresses, so it's a good idea to check this webpage periodically.

Week Date Lecture Topics Weekly Events
1 Jan 6 1       Introduction Chapter 2
    2 Difference equations, z-transforms  
    3 Solving difference equations using z-transforms, transfer functions  
2 Jan 13 4 State space models, SS --> TF, controllable and observable canonical forms  
    5 Time response Quiz 1
    6 Solution of SS models, computing A^k  
3 Jan 20 7 Transient response and pole locations  
    8 Stability for discrete-time systems Chapter 3
    9 Lyapunov's method  
4 Jan 27 10 Stability for LTI systems  
    11 Controllability, Pole placement Quiz 2
    12 Deadbeat control, PBH test Chapter 4
5 Feb 3 13 Adaptive control: static error model  
    14 Adaptive control: dynamic error model  
    15 Adaptive control: static error model theory Chapter 5
6 Feb 10 16 Adaptive control: dynamic error model theory  
    17 Adaptive control: dynamic error model theory Quiz 3
    18 Observability, observers, separation principle  
  Feb 17   Reading Week  
7 Feb 24 19 Lab 2 preparation Chapter 6
    20 Regulator problem  
    21 Internal model principle, regulator design  
8 Mar 3 22 Adaptive regulator problem Chapter 7
    23 Adaptive regulator design  
    24 Adaptive regulator design  
9 Mar 10 25 Midterm, March 10, 5-7pm  
    26 Lab 3 preparation Chapter 8
    27 Dynamic programing: finite horizon  
10 Mar 17 28 Dynamic programming: infinite horizon  
    29 Dynamic programming: value and policy iterations  
    30 Offline value and policy iterations using Q functions  
11 Mar 24 31 Lab 4 preparation  
    32 Reinforcement learning: temporal difference error Quiz 4
    33 Reinforcement learning: value function approximation Chapter 9
12 Mar 31 34 Reinforcement learning: value function approximation  
    35 Reinforcement learning: Q functions  
    36 Reinforcement learning: online policy and value iterations  


Laboratories

Labs are Matlab-based and performed in groups of two or three in BA3114. You may select your own lab partners, or your assigned practical TA can help you form a group. Each team submits a preparation on Quercus at the start of the lab session. Each team submits their exported Matlab Livescript as a pdf or html by 5pm on the due date listed below. The preparation + lab are worth 2 + 8 = 10 marks.

Instructions for graduate students only: you may work on your own and you do not need to attend a lab session. Submit both the preparation and the report as one submission on Quercus by the due date listed below.

Lab PRA 01 PRA 02 PRA 03 Lab Report Deadline
Lab 1 Feb 6, 12-15 Feb 13, 12-15 Feb 6, 9-12 Feb 28, 5pm
Lab 2 Feb 27, 12-15 Mar 6, 12-15 Feb 27, 9-12 Mar 14, 5pm
Lab 3 Mar 13, 12-15 Mar 20, 12-15 Mar 13, 9-12 Mar 28, 5pm
Lab 4 Mar 27, 12-15 Apr 3, 12-15 Mar 27, 9-12 Apr 11, 5pm


Grading

Labs 20% Includes preparation, lab work, and report
Quizzes 10% Jan 15, Jan 29, Feb 12, Mar 26
Midterm 30% March 10, 5-7pm
Final Exam 40% TBD
Final Projects (Graduate Students) 40% April 17