Covers algorithms for optimal sequential decision making in the presence of uncertainty. Mathematical formalisms include the Markov decision process (MDP), partially observable Markov decision process (POMDP), and Games. Solution techniques include exact dynamic programming, Monte Carlo tree search, deep reinforcement learning, and alpha vector value approximation for POMDPs. Assignments require programming in a high level language (Julia as of 2023). Previously offered as a special topics course. Recommended prerequisite: fluency in a high-level programming language, willingness to learn another language if required for homework assignments and basic understanding of probability. Same as CSCI 5264.