Sanity checks of new evaluation scripts/pipelines.
Sanity checks of new evaluation scripts/pipelines.