Chloe (Soo hwa) Hong

Reinforcement Learning as Probabilistic Inference - Part 3

29 November 2024

From the optimal action conditionals, we recover the optimal policy through backward messages, relate it to value functions in RL, and connect probabilistic inference to maximum entropy reinforcement learning.

Reinforcement Learning as Probabilistic Inference - Part 2

29 November 2024

Building on part 1, we establish decision-making as a probabilistic graphical model, connect RL to a trajectory prediction problem, and formulate optimal trajectory prediction as probabilistic inference.

Reinforcement Learning as Probabilistic Inference - Part 1

29 November 2024

This series of posts explores the intersection of reinforcement learning and probabilistic graphical models, delving into the optimization of policies through inference, bridging the gap between planning and decision-making under uncertainty.