Mark Saroufim    @marksaroufim    11/25/2021      

This README was the best visual explanation of gradient checkpointing I’ve seen Made it click for me. Hopefully someone else finds it useful too.
Stephan Hoyer    @shoyer    11/24/2021      

Gradient checkpointing (aka rematerialization) is an easy trick that can save massive amounts of memory for calculating gradients. If you differentiate through computation involving long iterative processes (like ODE solving), learn it and make it part of your toolkit! 👇🧵
Christian Wolf    @chriswolfvision    11/25/2021      

Sam Altman    @sama    12/4/2021      

AK    @ak92501    11/25/2021      

NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion abs: presents a unified multimodal pretrained model that can generate new or manipulate existing visual data (i.e., images and videos) for various visual synthesis tasks
