Large-scale AI pretraining