ECE411: Adaptive Control and Reinforcement Learning (Last updated: March 5, 2025)

Course Description

An introduction to adaptive control and reinforcement learning for discrete-time deterministic linear systems. Topics include: discrete-time state space models; stability of discrete time systems; parameter adaptation laws; error models in adaptive control; persistent excitation; controllability and pole placement; observability and observers; classical regulation in discrete-time; adaptive regulation; dynamic programming; value and policy iteration; Q-learning. Labs involve control design using MATLAB.

Teaching Staff

Prof. M.E. Broucke	GB342	LEC 01	broucke at control.utoronto.ca
Fatima Ghadieh	GB348	TUT01	fatima dot ghadieh at mail.utoronto.ca
Parvin Malekzadeh	GB348	PRA02	p.malekzadeh at mail.utoronto.ca
Gautham Raj Palanivel Rajan	GB348	PRA03	gauthamraj.palanivelrajan at mail.utoronto.ca
Trent Suzuki	GB348	PRA01	t.suzuki at mail.utoronto.ca

Lecture and Tutorial Schedule

Section	Day and Time	Dates
LEC 01	Mon 12-1pm	Starts January 6
	Wed 12-1pm	Starts January 8
	Fri 2-3pm	Starts January 10
TUT 01	Wed 2-3pm	Jan 15, Jan 22, Jan 29, Feb 5, Feb 12, Feb 26, Mar 5, Mar 12, Mar 19, Mar 26, Apr 2

Course Outline

The following table shows the lecture topics. The events column shows suggested reading from the course notes (distributed on Quercus) as well as quiz and exam dates. This schedule may be updated as the semester progresses, so it's a good idea to check this webpage periodically.

Week	Date	Lecture	Topics	Weekly Events
1	Jan 6	1	Introduction	Chapter 2
		2	Difference equations, z-transforms
		3	Solving difference equations using z-transforms, transfer functions
2	Jan 13	4	State space models, SS --> TF, controllable and observable canonical forms
		5	Time response	Quiz 1
		6	Solution of SS models, computing A^k
3	Jan 20	7	Transient response and pole locations
		8	Stability for discrete-time systems	Chapter 3
		9	Lyapunov's method
4	Jan 27	10	Stability for LTI systems
		11	Controllability, Pole placement	Quiz 2
		12	Deadbeat control, PBH test	Chapter 4
5	Feb 3	13	Adaptive control: static error model
		14	Adaptive control: dynamic error model
		15	Adaptive control: static error model theory	Chapter 5
6	Feb 10	16	Adaptive control: dynamic error model theory
		17	Adaptive control: dynamic error model theory	Quiz 3
		18	Observability, observers, separation principle
	Feb 17		Reading Week
7	Feb 24	19	Lab 2 preparation	Chapter 6
		20	Regulator problem
		21	Internal model principle, regulator design
8	Mar 3	22	Adaptive regulator problem	Chapter 7
		23	Adaptive regulator design
		24	Adaptive regulator design
9	Mar 10	25	Midterm, March 10, 5-7pm
		26	Lab 3 preparation	Chapter 8
		27	Dynamic programing: finite horizon
10	Mar 17	28	Dynamic programming: infinite horizon
		29	Dynamic programming: value and policy iterations
		30	Offline value and policy iterations using Q functions
11	Mar 24	31	Lab 4 preparation
		32	Reinforcement learning: temporal difference error	Quiz 4
		33	Reinforcement learning: value function approximation	Chapter 9
12	Mar 31	34	Reinforcement learning: value function approximation
		35	Reinforcement learning: Q functions
		36	Reinforcement learning: online policy and value iterations

Laboratories

Labs are Matlab-based and performed in groups of two or three in BA3114. You may select your own lab partners, or your assigned practical TA can help you form a group. Each team submits a preparation on Quercus at the start of the lab session. Each team submits their exported Matlab Livescript as a pdf or html by 5pm on the due date listed below. The preparation + lab are worth 2 + 8 = 10 marks.

Instructions for graduate students only: you may work on your own and you do not need to attend a lab session. Submit both the preparation and the report as one submission on Quercus by the due date listed below.

Lab	PRA 01	PRA 02	PRA 03	Lab Report Deadline
Lab 1	Feb 6, 12-15	Feb 13, 12-15	Feb 6, 9-12	Feb 28, 5pm
Lab 2	Feb 27, 12-15	Mar 6, 12-15	Feb 27, 9-12	Mar 14, 5pm
Lab 3	Mar 13, 12-15	Mar 20, 12-15	Mar 13, 9-12	Mar 28, 5pm
Lab 4	Mar 27, 12-15	Apr 3, 12-15	Mar 27, 9-12	Apr 11, 5pm

Grading

Labs	20%	Includes preparation, lab work, and report
Quizzes	10%	Jan 15, Jan 29, Feb 12, Mar 26
Midterm	30%	March 10, 5-7pm
Final Exam	40%	TBD
Final Projects (Graduate Students)	40%	April 17