COPUS: Co-adaptive Parallelism and Batch Size Selection in Large Language Model Training Paper • 2604.26687 • Published 23 days ago • 2