Advances in Learning to Generalize to Out-of-Distribution Data