SAPP Search Studio

SAPP Unified Standalone

Unified Parallel Strategy Search

A research system for unified ND and PPB strategy search in large-scale LLM training systems.

This project investigates parallel strategy design for large-model training on large-scale clusters. It supports end-to-end experimental studies, from ND exploration to PPB analysis, while keeping the leading configurations easy to compare and inspect.

Author: Ruiwen WANG, Thibaut Tachon, Philips Fang, Chong Li, Pierre Leca Email: wangrw0124@gmail.com

Research system Large-scale training studies

Start a study to populate the workspace

Begin with a YAML configuration, choose the framework and device, and launch a unified ND + PPB study. The resulting workspace will summarize the leading strategies and expose detailed PPB artifacts for further analysis.

Best end time

Feasible candidates

Best PPB solver peak

PPB cache hits

Search time

ND strategies

PPB-refined candidates

Overview

Search overview

Leading strategies after ND generation, performance estimation, and PPB evaluation.

The comparison figure will appear after a completed study.

Summary

Leading configuration

The summary figure will appear after a completed study.

Evaluation note

ND vs PPB counts

ND counts the full feasible search space. PPB counts the leading strategies advanced to the pipeline balance stage.

Compare

Candidate ranking

Rank	Strategy	End time	ND peak mem (MB)	PPB solver peak (MB)	Perf score	Cached	Feasible

Deep dive

Strategy explorer

Inspect one of the leading strategies in the current study

Simulation

Pipeline timeline

Generate the solved pipeline schedule for one candidate to inspect stage timing and memory evolution over time.

PPB YAML

Candidate YAML view

Layer	offset	recompute	select_recompute	select_comm_recompute

Evaluating candidate strategies