COPUS: Co-adaptive Parallelism and Batch Size Selection in Large Language Model Training Paper • 2604.26687 • Published 27 days ago • 2