New blog: Three ways to estimate the KL penalty — K1 / K2 / K3 derivations and which one to actually use.