Deep Image Matting
Authors:
A deep learning approach for accurate alpha matte estimation, addressing challenges in image matting (e.g., overlapping foreground/background colors, complex textures). The method uses a two-stage neural network and introduces a large-scale synthetic dataset (Composition-1k) to improve generalization.
Architecture:
1. Encoder-Decoder Network (VGG-16 backbone) for initial alpha prediction; 2. Refinement Network (4-layer CNN) to enhance details
Input:
RGB image + trimap (user-defined or dilated from ground truth)
Requires Human in Loop:
Yes (trimap creation for inference)
Modality:
Image
Key Innovations:
- Two-stage architecture: Encoder-decoder network + refinement network for sharp edges
- Combined loss functions (alpha prediction + compositional loss). Compositional loss is a creative solution.
- Training images are composited. The paper claims the model is able to generalize to real images.