Contents

From “how to improve U-Net” to “how not to have to choose”
nnU-Net
The data fingerprint
Supported configurations
The Medical Segmentation Decathlon
Open Source and reproducibility
Expected impact on research
Recognised limits
In the Italian context
Outlook

AgenticHealth

On-premise clinical platform with local LLMs, RAG on FHIR/DICOM data, diagnostic support, remote follow-up. Architecture designed for the MDR pathway.

Discover AgenticHealth →

Digital Health

Medical software development compliant with CE and MDR regulatory standards. Clinical decision support systems, AI integration in clinical workflows.

Discover →

From “how to improve U-Net” to “how not to have to choose”

In the years after U-Net’s publication (2015), the medical image segmentation community has produced hundreds of architectural variants — Attention U-Net, Residual U-Net, Dense U-Net, 3D variants, different encoders, alternative loss functions, ensemble strategies. Each publication presents improvements on specific tasks, but the field suffers a practical problem: what works well on one dataset does not necessarily work on another, and the choice of architecture + preprocessing + hyperparameters + training strategy requires deep learning expertise many clinical centres lack.

A provocative observation, emerging in systematic evaluations in the more active labs, is that recent architectural improvements add little to a well-configured standard U-Net — final performance is dominated by preprocessing choices (spacing, normalisation, patch size), training strategies (data augmentation, loss function, scheduler), and ensembling more than by the architecture itself.

If a “well-configured” standard U-Net is already state-of-the-art, the problem is not finding a new architecture: it is automating configuration. This is nnU-Net’s goal.

nnU-Net

nnU-Net — “no new Net” — was developed at MIC-DKFZ (Medical Image Computing) Heidelberg, in Klaus H. Maier-Hein’s group, primarily by Fabian Isensee. The preprint publication is from September 2018 (arXiv 1809.10486): “nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation”, Isensee et al.

The central thesis: for every new medical dataset, a well-configured pipeline matters more than a new architecture. The original work introduces no new architecture; it introduces a self-configuring system that, given a dataset, automatically derives optimal pipeline choices.

The data fingerprint

nnU-Net’s conceptual core is the data fingerprint — an automatically computed dataset profile including:

Voxel spacing — X, Y, Z resolution of volumes
Image shape distribution — typical dimensions and extremes of volumes
Intensity distribution — mean, standard deviation, percentiles; for CT modalities (HU) vs. MR (non-HU), normalisation strategies differ
Class distribution — how many classes, frequency of each, imbalance
Anatomical region — single-organ vs. extended region dataset

From this fingerprint, nnU-Net infers with heuristic rules a set of decisions:

Resampling strategy — whether to resample isotropically, which target spacing
Patch size — 3D training block size (GPU memory-limited)
Batch size
Network topology — network depth, initial channel count, number of pooling steps (derived from patch size)
Loss function — Dice + cross-entropy, Dice-only for very imbalanced classes
Data augmentation — rotations, scaling, gamma, noise, calibrated on the domain
Training schedule — 1000 epochs, polynomial learning-rate schedule
Number of folds — 5-fold cross-validation

The user provides the annotated dataset; nnU-Net does the rest.

Supported configurations

nnU-Net generates up to three configurations for the same dataset:

2d — classic 2D U-Net, trained slice-by-slice. Fast, baseline
3d_fullres — 3D U-Net at full resolution (or at max target), with patches covering the largest possible volume given memory
3d_lowres + 3d_cascade_fullres — cascade strategy: first 3D network at low resolution produces coarse segmentation, second 3D network at full resolution refines it. Useful when the target organ is large and GPU memory doesn’t allow sufficient patches

After training, auto-selection chooses the best configuration in cross-validation, optionally with ensemble of different configurations for the final result.

The Medical Segmentation Decathlon

nnU-Net’s proving ground is the Medical Segmentation Decathlon (MSD), a challenge organised in 2018 by a consortium of international institutions (Lena Maier-Hein et al., with MIT, KCL, and other representatives) as part of MICCAI 2018. MSD tests methods on 10 very different segmentation tasks:

Brain Tumour (multi-modal MR)
Heart (cardiac MR)
Liver (CT + tumours)
Hippocampus (MR)
Prostate (multi-modal MR)
Lung (CT + nodules)
Pancreas (CT + tumours)
Hepatic Vessel (CT + tumours)
Spleen (CT)
Colon (CT + cancer)

Challenge goal: identify a method that works without being specialised for any particular task — a generalist method. Participants were evaluated on Dice and surface distance.

Result: nnU-Net wins the 2018 Decathlon in both the Task Phase (dedicated training for each task) and the Live Phase (application to new tasks not known in advance), with significant margin over runners-up. It wins with no architectural modifications — using only the auto-derived configuration from data fingerprint.

This result — decisive and reproducible — has an immediate impact: a system with no architectural novelty systematically beats competitors with special architectures. The message is clear: configuration matters more than architecture.

Open Source and reproducibility

nnU-Net is distributed under Apache 2.0, code on GitHub (MIC-DKFZ/nnUNet). Packaging philosophy is plug-and-play:

The user prepares the dataset in a standardised directory structure
Runs nnUNet_plan_and_preprocess — nnU-Net analyses the fingerprint, plans the configuration, runs preprocessing
Runs nnUNet_train for each of 5 cross-validation folds
nnUNet_find_best_configuration picks the best configuration
nnUNet_predict applies the trained model to new cases

No PyTorch code required; configuration is fully declarative via dataset structure. This dramatically lowers the threshold for clinical groups wanting to train a segmenter on their own data.

Expected impact on research

Observable early community reactions:

Universally adopted baseline — nnU-Net will become the reference method against which any new architectural contribution is compared. If a new method doesn’t beat nnU-Net, its scientific value is hard to justify
MICCAI challenges — future challenge organisers will have as implicit standard the presence of an nnU-Net baseline in comparisons
Clinical labs — university hospitals wanting to build internal segmenters on their data (for tumour volumetry, radiotherapy organs at risk, phenotypic analyses) can start with nnU-Net without deep learning expertise
“Data fingerprint” strategy assessment — other medical imaging tasks (detection, classification) may extend the same self-configuring pattern

Recognised limits

nnU-Net has known limits:

Computational cost — full 5-fold cross-validation on a 3D dataset takes days on a single GPU (less with multi-GPU). Not a lightweight system
Dataset uniformity assumption — the data fingerprint assumes the dataset is relatively uniform (same scanner type, coherent population); very heterogeneous datasets may benefit from custom modifications
Not inference-optimised — nnU-Net is research-grade (result quality); clinical products requiring near-real-time inference need specific optimisations (TensorRT, mixed precision)
Fixed U-Net architecture — the framework does not explore radically different architectures (Transformer, Mamba — emerging but not yet applied to medical imaging). The assumption is U-Nets suffice for segmentation; valid but restrictive

In the Italian context

Italian medical imaging research groups — Politecnico di Milano, Universities of Turin, Verona, Bologna, CNR, IRCCS institutes — closely follow nnU-Net development. Adoption for Italian-data research projects is expected in coming months, once the codebase stabilises.

Value for Italian clinics will be mostly indirect: PACS vendors and AI-as-a-Service radiology services entering the Italian market in coming years will likely incorporate nnU-Net-based pipelines (with any necessary adaptations and regulatory qualification).

Outlook

Framework future directions:

Annotation platform integration — reduce manual annotation work, enabling active learning and human-in-the-loop
Multi-dataset transferability — pretrained models transferring across different tasks
Uncertainty estimation — per-pixel confidence estimation, essential for clinical uptake
Pipelines for detection and classification beyond segmentation
MONAI integration — the consortium about to announce MONAI as PyTorch-based reference medical imaging framework is discussing nnU-Net integration
Later versions (nnU-Net v2) with alternative Transformer architectures and more expressive configuration language

nnU-Net represents a paradigm shift from architectural research to system research: not “what is the best network” but “what system best configures the network”. A lesson that could extend beyond medical segmentation.

References: Fabian Isensee et al., “nnU-Net: Self-adapting Framework for U-Net-Based Medical Image Segmentation”, preprint ArXiv 1809.10486 (September 2018). MIC-DKFZ, Heidelberg (Klaus H. Maier-Hein). Medical Segmentation Decathlon (MSD), MICCAI 2018. Code: github.com/MIC-DKFZ/nnUNet. Apache 2.0 licence.

Company

Actions

Links

Products

Solutions

Industries

nnU-Net: self-adapting framework for medical segmentation