feat!: SPRAS revision by tristan-f-r · Pull Request #320 · Reed-CompBio/spras

tristan-f-r · 2025-07-09T20:51:39Z

This change means that output files will not be reused whenever SPRAS is updated, furthering the immutability goal necessary to get OSDF integration working for SPRAS benchmarking. ('updated' depends on the git commit hash or the actual SPRAS release version)

This adds the unique spras_revision to every single paramater combination (before hashing) and the dataset label, to provide OSDF support on the level of deterministic, non-seeded algorithms when datasets are immutable.

This has the added benefit of allowing SPRAS users to simply upgrade their SPRAS version without needing to clear output, which complements #380. The refactored test also partially covers #165 and #45. (This is also where the majority of the code comes from: The actual feature patch here is a 50 line change.)

See #321 implemented by #335 for handling nondeterministic algorithms / seeded algorithms.

To make this change, a significant test refactor in test/analysis was needed to remove hardcoded paths (which contained the hashes being modified per-commit in this PR.) It turns out that whenever we make any change to the hash, this [original: the patch here fixes this] test breaks! That's why this PR is depended on by so many other PRs.

This adds the unique spras_revision to every single paramater combination (before hashing) and the dataset label, to provide OSDF support on the level of deterministic algorithms.

agitter

I finished another partial revision. I still haven't thought about the testing implications carefully.

spras/config/config.py

agitter · 2026-01-17T04:25:52Z

spras/config/config.py

+        return f"v{importlib.metadata.version('spras').replace('.', '_')}"
+
+def attach_spras_revision(label: str) -> str:
+    return f"{label}_{spras_revision()}"


I'm thinking through whether there are other ways to get this same behavior without making filenames longer. The subdirectory names that already follow the --params- pattern are long already, and now we're extending them. The only other idea is to use subdirectories instead, which isn't necessarily an improvement.

This is a little concerning, though thanks to #434, I'm not too worried about files being the primary interface for organizing SPRAS output. We should still document this file directory naming once we have actual SPRAS workflow documentation.

Unresolving this for now so we can get broader feedback from @annaritz and @ntalluri. I can be a meeting agenda item if needed.

Snakefile

test/analysis/input/egfr.yaml

whoops! accidentally feature-regressed

agitter

A few more comments. I still haven't looked through all the test code.

test/analysis/input/egfr.yaml

spras/config/config.py

tristan-f-r · 2026-01-31T05:08:43Z

Since both past approaches do not scale well, I've decided to only focus on the RECORD file.

This fails specifically in the case where SPRAS is somehow ran without being installed as a python module, and I can't think of a plausible scenario where this happens.

feat: spras_revision

b0327a2

This adds the unique spras_revision to every single paramater combination (before hashing) and the dataset label, to provide OSDF support on the level of deterministic algorithms.

tristan-f-r marked this pull request as ready for review July 9, 2025 20:51

tristan-f-r added enhancement New feature or request needed for benchmarking Priority PRs needed for the benchmarking paper labels Jul 9, 2025

style: fmt

8cec738

tristan-f-r changed the title ~~feat: spras_revision~~ feat: SPRAS revision Jul 9, 2025

This comment was marked as outdated.

Sign in to view

tristan-f-r marked this pull request as draft July 9, 2025 21:37

tristan-f-r mentioned this pull request Jul 10, 2025

fix: custom installation of DOMINO #235

Open

1 task

tristan-f-r added 2 commits July 10, 2025 19:32

test: summary

5683392

docs(test_summary): mention preprocessing motivation

af90ce0

tristan-f-r marked this pull request as ready for review July 10, 2025 19:34

tristan-f-r changed the title ~~feat: SPRAS revision~~ feat!: SPRAS revision Jul 10, 2025

tristan-f-r added 7 commits July 10, 2025 12:44

test(analysis/summary): use input from /input instead

6141874

docs(test/analysis): mention dual integration testing

440a2d4

test(analysis/summary): use test/analysis provided gold standard

d9e852b

style: fmt

abb0eb9

chore: don't repeat docs inside analysis configs

60185fc

feat: get working with cytoscape

e6bd6a0

style: fmt

f9a3081

tristan-f-r mentioned this pull request Jul 11, 2025

Guaranteed immutable output #323

Open

test: remove nondet from analysis

77fc3b4

This comment was marked as outdated.

Sign in to view

fix: get input pathways at runtime

0592850

This was referenced Jul 11, 2025

Update summary.py to include parameter combinations #194

Merged

feat!: typed PRA#run #329

Merged

This was referenced Jul 21, 2025

chore: bump dependencies #310

Merged

Integration testing instead of artifacts #339

Open

feat: algorithm attributions #345

Open

tristan-f-r added the P-high This is a blocker for many PRs/issues/features label Jul 24, 2025

github-actions bot added the merge-conflict This PR has merge conflicts. label Jan 9, 2026

tristan-f-r added 2 commits January 9, 2026 18:47

Merge branch 'main' into hash

eec09f2

test: fix files

a8d71bd

tristan-f-r added the awaiting-author Author of the PR needs to fix something from a review / etc. label Jan 10, 2026

github-actions bot removed the merge-conflict This PR has merge conflicts. label Jan 10, 2026

tristan-f-r added P-high This is a blocker for many PRs/issues/features and removed awaiting-author Author of the PR needs to fix something from a review / etc. P-medium medium prirotity; this is needed for some external service or another PR labels Jan 10, 2026

tristan-f-r mentioned this pull request Jan 13, 2026

refactor: separate statistic computation #411

Open

1 task

tristan-f-r added the tuning Workflow-spanning algorithm tuning label Jan 13, 2026

agitter reviewed Jan 17, 2026

View reviewed changes

tristan-f-r added 5 commits January 17, 2026 17:14

apply suggestions

e12fc75

clean, fix: strip project_directory

977bf5a

fix: correct equality on not SPRAS pyproject.toml

8500bcb

chore: grammar

112db39

chore: move attach_spras_revision out of Snakefile

c7262ed

github-actions bot added the merge-conflict This PR has merge conflicts. label Jan 31, 2026

tristan-f-r added 2 commits January 30, 2026 20:19

Merge branch 'main' into hash

f69a0f3

fix: properly resolve merge conflict

72e30bf

github-actions bot removed the merge-conflict This PR has merge conflicts. label Jan 31, 2026

tristan-f-r added 2 commits January 31, 2026 04:28

fix: undo mistaken merge conflict

c71b652

whoops! accidentally feature-regressed

chore: drop unnecessary self.datasets initialization

6b941e0

agitter reviewed Jan 31, 2026

View reviewed changes

test/analysis/input/egfr.yaml Show resolved Hide resolved

spras/config/config.py Outdated Show resolved Hide resolved

spras/config/config.py Outdated Show resolved Hide resolved

spras/config/config.py Outdated Show resolved Hide resolved

tristan-f-r added 2 commits January 31, 2026 05:04

feat: dynamic spras versioning

fbf0ceb

chore: error handling on setup.pu

edc0369

tristan-f-r added 4 commits January 30, 2026 21:10

docs: note on git commit hashes

3a1251d

chore: drop git magic

d330d6a

feat: correctly parse RECORD

5e31d06

style: fmt

dba2b45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!: SPRAS revision#320

feat!: SPRAS revision#320
tristan-f-r wants to merge 45 commits intoReed-CompBio:mainfrom
tristan-f-r:hash

tristan-f-r commented Jul 9, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

agitter left a comment

Uh oh!

Uh oh!

agitter Jan 17, 2026

Uh oh!

tristan-f-r Jan 17, 2026 •

edited

Loading

Uh oh!

agitter Jan 31, 2026

Uh oh!

Uh oh!

Uh oh!

agitter left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Jan 31, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tristan-f-r commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

agitter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

agitter Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

tristan-f-r Jan 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

agitter Jan 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

agitter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tristan-f-r commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tristan-f-r commented Jul 9, 2025 •

edited

Loading

tristan-f-r Jan 17, 2026 •

edited

Loading

tristan-f-r commented Jan 31, 2026 •

edited

Loading