U-Net (2015) — Research — PartialObs-PDEBench

TL;DR

U-Net is a symmetric encoder–decoder convolutional network with skip connections that fuse high-resolution features from the encoder into the decoder. While originally proposed for biomedical segmentation, it is widely used as a baseline for dense prediction on grids (including PDE surrogate modeling and masked field reconstruction).

Problem

Provide a fast, high-quality mapping from input images/tensors to output images/tensors when locality and multi-scale context matter. In AI4PDE, U-Net is a common baseline for grid-based surrogates and inpainting.

Benefits vs others

Strong locality + multi-scale inductive bias; often competitive with more complex models on image-like PDE grids.
Easy to train and deploy; compatible with many conditioning signals via concatenation.
Good default backbone for diffusion denoisers and PDE inpainting models.

Interesting detail

Many diffusion models use a U-Net backbone; this makes U-Net a key baseline component for generative PDE inference.

Core method (math)

Template for CNN / U-Net. Paper-specific equations are added when manually curated.

\[\text{Encoder: } h_{l+1} = \sigma(\mathrm{Conv}(h_l))\;\; \xrightarrow{\downarrow}\;\; \text{(downsample)}\] \[\text{Decoder: } g_{l-1} = \sigma(\mathrm{UpConv}(g_l))\;\; \oplus\;\; h_{l-1}\;\; \text{(skip connection)}\]

Main theoretical contribution

Skip connections preserve fine spatial detail while allowing deep receptive fields through downsampling.

Main contribution

Introduced the now-standard U-shaped encoder–decoder with skip connections for dense prediction.
Demonstrated strong segmentation performance with limited labeled data via heavy data augmentation.

Main results (headline)

(Optional) Add main_results for a quick headline summary.

Experiments

PDE problems

(baseline architecture; not PDE-specific)

Tasks

segmentation (original)
grid-to-grid regression (baseline)
inpainting

Experiment setting (high level)

Convolution + ReLU blocks with max-pooling downsampling.
Transposed convolutions (or upsampling+conv) for decoding.
Skip connections concatenate encoder feature maps into decoder.

Comparable baselines

FCN / plain encoder–decoder CNNs

Main results

Why it is used in AI4PDE

Use case	Reason
Surrogate modeling	Fast grid-to-grid mapping; strong locality bias
Masked reconstruction	Skip connections preserve detail in inpainting tasks
Diffusion denoiser backbone	Standard U-Net is a strong denoising architecture

Citation (BibTeX)

@inproceedings{ronneberger2015unet,
  title={U-Net: Convolutional Networks for Biomedical Image Segmentation},
  author={Ronneberger, Olaf and Fischer, Philipp and Brox, Thomas},
  booktitle={International Conference on Medical image computing and computer-assisted intervention},
  pages={234--241},
  year={2015},
  organization={Springer}
}