GPipe Easy Scaling with Micro-Batch Pipeline Parallelism
1 minute read ∼ Filed in : A paper noteIntroduction
GPipe utilizes a novel batch-splitting pipelining algorithm, resulting in almost linear speedup when a model is partitioned across multiple accelerators