Improvements to Single Evaluation SIDT Generation by mjohnson541 · Pull Request #41 · zadorlab/PySIDT

mjohnson541 · 2026-03-29T05:08:42Z

This adds a number of improvements to single evaluation SIDT generation:

Adds intelligent node selection (similar to the multi-evaluation SIDTs)
Enables parallel SIDT generation
New pruning algorithm that is compatible with parallel operation
Checkpointing

In particular, this framework allows significant scale up in the size of datasets we can efficiently train single evaluation SIDTs on.

…hicDecisionTree level

before we fit rules for the whole tree at the end, but this precluded more intelligent node selection algorithms

…aluation)

this algorithm prunes the tree based on an interpolated sequence of uncertainty cutoffs finding the cutoff with the lowest validation error this can be used together with continuous pruning during generation, but is particularly important for parallel generation where continuous pruning is not possible

…and post generation pruning

also fix some handling for coordination number extensions

mjohnson541 added 12 commits March 28, 2026 21:39

Properly input coordination number list to regularization

2ab6d55

make weigh_node_selection_by_occurrance a property as SubgraphIsomorp…

c1460e4

…hicDecisionTree level

use intelligent node selection for single evaluation tree training

a2820d3

add fit_node function to SubgraphIsomorphicDecisionTree

62647c1

before we fit rules for the whole tree at the end, but this precluded more intelligent node selection algorithms

fit nodes when adding them to the single evaluation SIDT

13755a2

remove fit_tree function in SubgraphIsomorphicDecisionTree (single ev…

bffc89f

…aluation)

add function for launching tree generation processes at subnodes

190ec89

implement parallel single evaluation SIDT generation

5360b6c

add adsorption energies example to CI tests

f0fd580

added multiprocess to environment and build files

a95d7f1

update single eval surface diffusion example

69b654d

mjohnson541 force-pushed the single_eval_improvements_parallel branch from e8413ad to b161527 Compare March 29, 2026 21:40

mjohnson541 added 2 commits March 29, 2026 14:45

add checkpointing for SubgraphIsomorphicDecisionTree

8a3d7df

add RMG Habstraction low rank datasets

4e317fd

mjohnson541 force-pushed the single_eval_improvements_parallel branch 2 times, most recently from b9c98d0 to 2b627f5 Compare March 29, 2026 22:00

mjohnson541 added 2 commits March 29, 2026 15:05

add Hydrogen abstraction example for parallelization, checkpointing, …

5b8e6be

…and post generation pruning

skip initial node fitting for nodes with already fit rules

96d8443

mjohnson541 force-pushed the single_eval_improvements_parallel branch from 2b627f5 to 96d8443 Compare March 29, 2026 22:05

mjohnson541 added 5 commits March 31, 2026 13:43

document r_ncoord option in SubgraphIsomorphicDecisionTree

d650539

update generate_tree documentation for SubgraphIsomorphicDecisionTree

b022ccc

add docstring for prune function in SubgraphIsomorphicDecisionTree

4527e3b

add extension generation for atom lone pairs

bb6f8cf

handle lone pairs in tree generation/regularization

42f045e

also fix some handling for coordination number extensions

mjohnson541 merged commit 6d06117 into main Mar 31, 2026
1 check passed

mjohnson541 deleted the single_eval_improvements_parallel branch March 31, 2026 20:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to Single Evaluation SIDT Generation#41

Improvements to Single Evaluation SIDT Generation#41
mjohnson541 merged 21 commits intomainfrom
single_eval_improvements_parallel

mjohnson541 commented Mar 29, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mjohnson541 commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mjohnson541 commented Mar 29, 2026 •

edited

Loading