HDBSCAN validation paper notebook by carolineychen8 · Pull Request #274 · Watts-Lab/nomad

carolineychen8 · 2026-04-23T16:16:14Z

No description provided.

…sion tests A new per-user regression test exposed a real stop-table concat bug when some users had empty outputs.\n\nThis commit hardens empty stop-table construction by deriving exact output columns and explicit dtypes from shared helpers, then using those typed empties in stop-detection paths. It also applies reset_index(drop=True) after grouped stop summarization and adds passthrough guards to avoid duplicate user_id columns.\n\nPer-user regression tests were cleaned up and made faster:\n- compare labels directly by (user_id, timestamp)\n- remove offset/parts-style expectation logic\n- run on a 4-user sample\n- parameterize n_jobs with 1 and 2\n\nFor now, this is prototyped in dbstop.py and sequential.py via the focused per-user regression path that originally surfaced the bug, with shared helper changes ready for wider consolidation.

Replace split empty-stop schema helpers (column names + dtype map) with one shared helper that directly returns a typed empty stop DataFrame. Update all active stop-detection summarization callsites (dbstop, dbscan, density_based, hdbscan, lachesis, sequential, grid_based) to use the unified helper, removing duplicated empty-frame construction logic.

paco-barreras · 2026-04-23T16:20:35Z

We should merge #251 first, after tests pass, then merge with main again.

refreshed stop detection utilities and needed to resolve conflicts

This branch cleans up the HDBSCAN validation notebook with some deeper refactors to nomad's function. ## Validation The first part of the change makes the validation path related to `compute_visitation_errors` better, and it now lives with the rest of the stop-detection validation logic in validation.py, and the overlap / validation code can handle a separate traj_cols for the right-hand table when the predicted stops and the truth table do not use the same column names. That let me remove a lot of notebook-side transformations that was only there to work with that fragile code. ## Notebook The notebook `hdbscan_validation_paper` is leaner. It no longer passes default traj_cols mappings into loaders just to restate the defaults, and it no longer drops diary rows with missing building IDs before validation. The general metrics now use the full truth diary, while category-specific slices happen naturally where the categories are actually used. I also fixed the stale `start_timestamp` / `timestamp` mismatch after the summarize-stop output switched to `keep_col_names=True`, and cleaned up the generation path so regenerated diaries keep `user_id`. ## Plotting The plotting code also got reorganized. The notebook was mixing up two different statistical objects: the per-user distribution of a metric, and uncertainty in the median metric estimate. Those are now shown separately. `validation.py` now provides a small bootstrap summary helper plus two plotting helpers: one for per-user boxplots, and one for bootstrapped median estimates with interval whiskers. The boxplots are there to show the spread across users; the point-and-whisker plot is there to compare the estimated medians. That split makes the interpretation much clearer for this notebook. For the grouped colors, the x-axis still uses the registry family labels such as `lachesis_coarse` and `lachesis_fine`, but the colors are grouped by the underlying base algorithm. That is piped through from the registry as `{algo['family']: algo['algorithm']}`, so variants of the same base method share a hue family without hardcoding the palette in the notebook. I ran the validation notebook end to end on the 250-agent dataset after these changes. The script completes successfully and writes figures that make sense to me.

paco-barreras · 2026-04-24T22:41:58Z

@carolineychen8 , I forget if all we wanted from this PR was to touch up this notebook and bring your branch and work up to date. If so, we can merge it?

…eared

paco-barreras · 2026-04-25T06:23:20Z

We can't merge yet because we still have 3 failing tests related to hdbscan. I don't know if they are new failures

======================================================= short test summary info ========================================================
FAILED nomad/tests/test_stop_detection.py::test_hdbscan_labels_single_stop - AssertionError: Expected at least one cluster
FAILED nomad/tests/test_stop_detection.py::test_hdbscan_labels_two_stops - AssertionError: Expected 2 clusters, got 0
FAILED nomad/tests/test_stop_detection.py::test_st_hdbscan_ground_truth - AssertionError: Expected 2-5 stops on ground truth, got 36
================================== 3 failed, 115 passed, 1 xfailed, 58 warnings in 267.39s (0:04:27) ===================================

However, we do know that hdbscan is about to change. So let's wait until we are debugging it.

carolineychen8 and others added 16 commits April 9, 2026 10:13

initial benchmark

c5c61a7

benchmarking

513609a

feat(stop_detection): expand per-user wrappers across algorithms

7e9af16

chore(stop_detection): remove deprecated sliding module

8dac227

Streamline trajectory notebooks and config generation

491ba9a

cleanup of laggard debugging notebooks

c449851

Show average trajectory size in timing report

c04e6ed

condensed notebooks evluating accuracy of stop detection algorithms

a49b708

edits

b2013bd

validation paper

ad687c0

merge

4b77556

algo registry + other changes

1805759

validation paper

73ad35d

working validation paper nb

dbe9d4c

This was referenced Apr 23, 2026

Data for hdbscan_validation_paper.ipynb #266

Open

Refactor hdbscan notebook evaluating on general trajectories #255

Open

paco-barreras and others added 6 commits April 23, 2026 15:47

Merge branch 'main' into caroline-hdbscan-benchmarking

360efaa

refreshed stop detection utilities and needed to resolve conflicts

Merge branch 'main' into caroline-hdbscan-benchmarking

3136e5d

Sync hdbscan validation notebook with jupytext

c6fe24b

Refactor hdbscan validation pipeline helpers

cb70839

Merge branch 'main' into caroline-hdbscan-benchmarking

13c13da

cleaned up stale tests related to stop_detection.sliding which disapp…

98bce15

…eared

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDBSCAN validation paper notebook#274

HDBSCAN validation paper notebook#274
carolineychen8 wants to merge 23 commits intomainfrom
caroline-hdbscan-benchmarking

carolineychen8 commented Apr 23, 2026

Uh oh!

paco-barreras commented Apr 23, 2026

Uh oh!

paco-barreras commented Apr 24, 2026 •

edited

Loading

Uh oh!

paco-barreras commented Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

carolineychen8 commented Apr 23, 2026

Uh oh!

paco-barreras commented Apr 23, 2026

Uh oh!

paco-barreras commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paco-barreras commented Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

paco-barreras commented Apr 24, 2026 •

edited

Loading