Python scripts replacing the original notebooks for the curation pipeline.
uv python install 3.12
uv python pin 3.12
uv venv --python 3.12
source .venv/bin/activate
uv pip install -e .Notes:
- Python 3.12 is recommended. Python 3.13+ (including 3.14) may try to build
fionafrom source and require a local GDAL install. - GDAL is required for
osgeobindings and for CLI tools likeogr2ogr.
Install GDAL (includes ogr2ogr) before installing Python deps:
# macOS
brew install gdal
# Ubuntu/Debian
sudo apt-get install -y gdal-bin libgdal-devPython GDAL bindings must match the system GDAL version. If pip/uv fails to build gdal, install a compatible GDAL version first.
scripts/extract_metadata.pyscripts/get_thumbs.pyscripts/create_pm_tiles.pyscripts/export_gdb_feature_classes_to_gpkg.pyscripts/merge_csvs.py
Example:
python scripts/merge_csvs.py people.csv scores.csv id -o combined.csvThe merged output includes match_status and unmatched_source columns so rows that only exist in one CSV are still written to the combined file.
Use --ignore-key-case when values like ABC123 and abc123 should be treated as the same key. Columns that are blank in every output row are omitted automatically.