Reinforcement Learning as Probabilistic Inference - Part 3
29 November 2024
From the optimal action conditionals, we recover the optimal policy through backward messages, relate it to value functions in RL, and connect probabilistic inference to maximum entropy reinforcement learning.
1Reinforcement Learning as Probabilistic Inference - Part 2
29 November 2024
Building on part 1, we establish decision-making as a probabilistic graphical model, connect RL to a trajectory prediction problem, and formulate optimal trajectory prediction as probabilistic inference.
2Reinforcement Learning as Probabilistic Inference - Part 1
29 November 2024
This series of posts explores the intersection of reinforcement learning and probabilistic graphical models, delving into the optimization of policies through inference, bridging the gap between planning and decision-making under uncertainty.
3