Marie-Hélène Burle
Prevent recompilation for the last batch that is smaller (different shape).
Parallel runs on multiple GPUs/TPUs