Fixes and improvements to evaluation parsing by tzschmidt · Pull Request #63 · potassco/benchmark-tool

tzschmidt · 2026-05-22T12:24:14Z

Update evaluation format
Update evaluation parser
Improve documentation

Copilot

Pull request overview

This PR updates the benchmark evaluation XML format and aligns both the writer (runscript) and reader (result parser/classes) to the new structure, with accompanying test and documentation updates.

Changes:

Extend evaluation/result model to capture richer metadata (e.g., cmdline as {pre, post}, memout, template_options, dist_template/dist_options, per-setting encodings).
Refactor the result XML parser into smaller helpers with stricter required-attribute handling and clearer errors.
Update tests, reference XML, and getting-started docs to reflect the new attributes/paths.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`src/benchmarktool/result/parser.py`	Refactors XML parsing into helper methods; parses new attributes (cmdline dicts, memout, template options, setting encodings).
`src/benchmarktool/result/result.py`	Updates dataclasses (new fields, `kw_only=True` in several types) to represent the new parsed format.
`src/benchmarktool/runscript/runscript.py`	Updates XML emission to the new attribute names/structure (e.g., `encoding_tag`, optional `template_options`, instance encodings).
`tests/result/test_result_parser.py`	Adjusts expectations for the new parsed structures/fields.
`tests/result/test_result_classes.py`	Updates unit tests to match new dataclass signatures/fields (including keyword-only constructors).
`tests/runscript/test_runscript_classes.py`	Updates XML-string assertions for renamed/added XML attributes and instance encodings.
`tests/ref/test_eval.xml`	Updates the reference evaluation XML fixture to the new format.
`docs/getting_started/workflow/index.md`	Rewords/moves an executable-permissions note into an admonition block.
`docs/getting_started/gen/runscript.md`	Updates example encoding paths and clarifies path relativity.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

…gfixes

BenKaufmann

Looks ok to me, but let's wait until @rkaminsk had a look.

tzschmidt added 6 commits May 21, 2026 15:47

Update evaluation format

7bbf537

Update evaluation parsing

67077a1

Fix instance scope

bf35a3e

Update attribute names

55e3199

Update tests

c320cdc

Improve documentation clarity

c4736ac

tzschmidt requested a review from Copilot May 22, 2026 12:24

Copilot started reviewing on behalf of tzschmidt May 22, 2026 12:24 View session

Copilot AI reviewed May 22, 2026

View reviewed changes

Comment thread src/benchmarktool/runscript/runscript.py

Comment thread docs/getting_started/gen/runscript.md Outdated

Comment thread docs/getting_started/gen/runscript.md Outdated

tzschmidt and others added 3 commits May 22, 2026 14:31

Fix formatting

df48c62

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Satisfy linter

ddab117

Merge branch 'bugfixes' of github.com:potassco/benchmark-tool into bu…

6b1ec5c

…gfixes

tzschmidt requested review from BenKaufmann and rkaminsk May 22, 2026 12:56

BenKaufmann approved these changes May 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes and improvements to evaluation parsing#63

Fixes and improvements to evaluation parsing#63
tzschmidt wants to merge 9 commits into
masterfrom
bugfixes

tzschmidt commented May 22, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BenKaufmann left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tzschmidt commented May 22, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BenKaufmann left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants