Ultrasound Doppler angle from B-mode with deep learning

Problem

Spectral-Doppler velocity depends on the beam-to-vessel angle $\theta$ through $f_d = 2 f_0 v \cos\theta / c$, and angle correction is set by hand. I first-authored Patil & Anand (EMBC 2019), where a convolutional network learns $\theta$ directly from a single grayscale B-mode carotid image — no color Doppler, no segmentation. This project is that pipeline rebuilt from scratch on modern infra and carried end to end: why a frozen backbone works at all, how far the estimator climbs once it is tuned, and what a clinic would still need. The full interactive write-up — overview, method, results, a clinical evaluation, and a live beam-angle demo — lives on the project site.

Approach

One typed, test-first library (Keras 3 / JAX, pixi), with the model written once and the backend chosen per machine.
Orientation-preserving grid pooling instead of global average pooling (global pooling is partly rotation-invariant — wrong for an orientation target). This is the load-bearing design choice that makes a frozen backbone work at all.
Two sampling protocols behind a config flag: image-level sampling (the paper’s standard augmented-corpus protocol) and patient-level sampling (cross-subject, holding out whole volunteers) — two complementary lenses, each reported and each tuned to its own best.
Optuna TPE hyperparameter search against cached frozen features (each trial a shallow head fit; one extraction per backbone serves both protocols), then a stacked ensemble of the tuned backbones.
Clinical-grade, post-hoc evaluation, all Keras-free: split-conformal intervals, Bland–Altman, calibration curves, patient-level nested CV, test-time augmentation, a classical structure-tensor prior + fusion, and Grad-CAM.
Every figure is regenerated from results/ by script; the whole thing is reproducible with pixi run all.

Headline results

The core model: a frozen DenseNet201 + grid pooling lands at 5.84% MAPE (3.77° MAE), the paper’s best single-model regime — and it’s the pooling, not the backbone, that gets there (grid pooling lifts the frozen model from ~14% to 5.84%).
Best estimator, image-level sampling: an Optuna-tuned 5-model ensemble reaches 2.79% MAPE / 1.96° MAE ($R^2$ 0.995) — better than the paper’s best single model.
Best estimator, patient-level sampling: the tuned ensemble reaches 8.53% MAPE / 5.93° MAE ($R^2$ 0.952) on the stricter cross-subject regime.
Architecture bake-off: frozen DenseNet201 beats ConvNeXt and EfficientNetV2 — newer is not better for small-data frozen transfer.
Clinical-grade: split-conformal 90% intervals of ±20.5° at 95.2% coverage; on Bland–Altman the model reads about 4.3° below the single reference reading (method-vs-reference, not inter-observer — honestly flagged); test-time augmentation cuts per-image MAE 7.8° → 4.7°.
Honest about the ceiling: end-to-end fine-tuning and modern self-supervised encoders (DINOv2, USFM) are deferred to a CUDA box — documented, not hidden.

Share on

Bluesky Facebook LinkedIn Mastodon X (formerly Twitter)

Ultrasound Doppler angle from B-mode with deep learning

Nilesh Patil

Problem

Approach

Headline results

Links

Share on