Exponential-Min and Gumbel-Max
Exponential-min and Gumbel-max tricks for reformulating sampling from a discrete distribution as argmin and argmax, making the sampling operation differentiable.
Posts and notes on Machine Learning.