Deep learning with Megatron