ci: parallelize merge-tree mocha tests (skip in perf mode) by frankmueller-msft · Pull Request #26657 · microsoft/FluidFramework

frankmueller-msft · 2026-03-05T20:47:13Z

Summary

Re-applies the parallelization from #26624 (reverted in #26653) with a fix for the Performance Benchmarks pipeline failure.

Enables parallel: true with jobs: 4 for merge-tree mocha tests, except when running in perf mode (--perfMode)
Perf mode is excluded because Mocha's parallel worker serialization strips methods from Hook objects (like hook.error()), which the benchmark reporter depends on. Sequential execution is also needed for measurement quality.
Bumps timeouts in applyStashedOpFarm and reconnectFarm to match other farm tests

Changes

packages/dds/merge-tree/.mocharc.cjs — conditional parallel config
packages/dds/merge-tree/src/test/client.applyStashedOpFarm.spec.ts — timeout bump
packages/dds/merge-tree/src/test/client.reconnectFarm.spec.ts — timeout bump

Verification

Without fix (parallel enabled unconditionally) — perf tests fail with the reported error:

TypeError: hook.error is not a function
Test Uncaught error outside test suite failed with error:  TypeError: hook.error is not a function

With fix (--perfMode guard) — both pass:

npm run perf — all benchmarks pass (sequential mode, benchmark reporter works)
npm run test:mocha — 1555 passing, 2 pending (parallel mode, ~2 min)

Test plan

Reproduce hook.error is not a function failure without the fix
npm run perf passes with the fix (sequential mode)
npm run test:mocha passes with the fix (parallel mode)
Performance Benchmarks pipeline passes in CI

🤖 Generated with Claude Code

Re-applies the parallelization from microsoft#26624 which was reverted in microsoft#26653 because it broke the Performance Benchmarks pipeline. The fix: make parallel execution conditional on not running in perf mode. Mocha serializes Hook objects across worker processes in parallel mode, which strips methods like hook.error() that the benchmark reporter relies on. Perf tests also need sequential execution for measurement quality. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

Re-introduces Mocha parallelization for merge-tree tests in CI while disabling it for perf mode to avoid benchmark reporter issues, and adjusts farm test timeouts for parallel execution stability.

Changes:

Conditionally enables Mocha parallel mode with jobs: 4 when not running with --perfMode.
Increases timeouts for applyStashedOpFarm and reconnectFarm test suites.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
packages/dds/merge-tree/.mocharc.cjs	Enables conditional Mocha parallelization and sets a fixed job count for CI.
packages/dds/merge-tree/src/test/client.applyStashedOpFarm.spec.ts	Increases suite timeout to accommodate parallel execution overhead.
packages/dds/merge-tree/src/test/client.reconnectFarm.spec.ts	Increases suite timeout to accommodate parallel execution overhead.

Copilot · 2026-03-05T20:48:01Z

packages/dds/merge-tree/src/test/client.reconnectFarm.spec.ts

+			// Matches the timeout convention used by other farm tests (conflictFarm, obliterateFarm,
+			// rollbackFarm). Tests complete in ~15-20s normally, but need headroom under parallel execution.
+		}).timeout(30 * 10000);


30 * 10000 sets the timeout to 300,000ms (5 minutes), which doesn’t align with the comment’s stated need for modest headroom (tests ~15–20s). This looks like a multiplier typo (likely meant 30 * 1000, or another explicit ms value). Consider setting an intentional timeout (e.g., 60 * 1000) and/or extracting a named constant to avoid unit mistakes.

Copilot · 2026-03-05T20:48:02Z

packages/dds/merge-tree/src/test/client.applyStashedOpFarm.spec.ts

+			// Matches the timeout convention used by other farm tests (conflictFarm, obliterateFarm,
+			// rollbackFarm). Tests complete in ~15-20s normally, but need headroom under parallel execution.
+		}).timeout(30 * 10000);


30 * 10000 sets a 5 minute timeout, which is much larger than the described headroom and likely unintentional. Use an explicit intended value in milliseconds (and consider a shared constant with other farm tests) to prevent silent CI slowdowns and masking hung tests.

Copilot · 2026-03-05T20:48:02Z

packages/dds/merge-tree/.mocharc.cjs

+// Parallelize tests to speed up CI, but not in perf mode where parallel execution
+// breaks the benchmark reporter and degrades measurement quality.
+if (!process.argv.includes("--perfMode")) {
+	config.parallel = true;
+	// Keep job count limited to avoid excessive memory use and thread overhead in CI.
+	// 4 jobs was measured to give most of the speed up while not slowing down CI.
+	config.jobs = 4;
+}


process.argv.includes(\"--perfMode\") only matches the exact token; it won’t detect common variants like --perfMode=true / --perfMode=1. If those forms can occur in your pipelines/scripts, parallel mode would be enabled unexpectedly. Consider checking for any arg that equals --perfMode or starts with --perfMode= (or using a small arg parser) to make the gating robust.

frankmueller-msft · 2026-03-05T21:05:10Z

Addressing Copilot's review:

Timeout 30 * 10000 (300s) — Not a typo. This matches the existing convention used by all other farm tests in this directory (conflictFarm, obliterateFarm, rollbackFarm, replay). These are long-running fuzz tests that need generous timeouts.
--perfMode variants — --perfMode is a custom mocha flag used consistently as a bare flag across the entire repo (20+ scripts). It's never passed as --perfMode=true or --perfMode=1. No additional parsing needed.

CraigMacomber · 2026-03-05T21:57:07Z

packages/dds/merge-tree/.mocharc.cjs

+	config.parallel = true;
+	// Keep job count limited to avoid excessive memory use and thread overhead in CI.
+	// 4 jobs was measured to give most of the speed up while not slowing down CI.
+	config.jobs = 4;


On the CI test run, there was a timeout of an unrelated test: https://dev.azure.com/fluidframework/public/_build/results?buildId=382483&view=ms.vss-test-web.build-test-results-tab&runId=7473433&resultId=100000&paneView=debug

Its possible you just got unlucky, but this suggests we might be increasing the risk of a timeout by overloading the CPU.

Since with this change the unit tests on CI are no longer the long poll, we are getting more speed up that we need at the moment, so it might be safer to go with a smaller thread count here and still get all or most of the benefit.

Maybe try 2 for a conservative starting point, then run the CI test pipeline a few times and see if it runs without any issues.

Failing tests indicate this likely makes test run more flakey.

Copilot AI review requested due to automatic review settings March 5, 2026 20:47

frankmueller-msft requested a review from a team as a code owner March 5, 2026 20:47

Copilot AI reviewed Mar 5, 2026

View reviewed changes

frankmueller-msft requested review from CraigMacomber and anthony-murphy March 5, 2026 21:05

CraigMacomber previously approved these changes Mar 5, 2026

View reviewed changes

CraigMacomber reviewed Mar 5, 2026

View reviewed changes

Merge branch 'main' into ci/parallel-merge-tree-tests-v2

3814e61

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: parallelize merge-tree mocha tests (skip in perf mode)#26657

ci: parallelize merge-tree mocha tests (skip in perf mode)#26657
frankmueller-msft wants to merge 2 commits intomicrosoft:mainfrom
frankmueller-msft:ci/parallel-merge-tree-tests-v2

frankmueller-msft commented Mar 5, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 5, 2026

Uh oh!

Copilot AI Mar 5, 2026

Uh oh!

Copilot AI Mar 5, 2026

Uh oh!

frankmueller-msft commented Mar 5, 2026

Uh oh!

CraigMacomber Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

frankmueller-msft commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Verification

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

frankmueller-msft commented Mar 5, 2026

Uh oh!

CraigMacomber Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

frankmueller-msft commented Mar 5, 2026 •

edited

Loading