High-Fidelity 4D Cloth Capture Pipeline with a Two-Level Pattern

We present a 4D (spatio-temporal) cloth capture system that achieves 1mm spatial resolution using only 16 RGB cameras. Our approach uses a two-level marker pattern: sparse, colored L-shaped markers provide robust detection and orientation, while dense noise patterns within each marker enable both marker identification and precise keypoint localization. A physics-based optimization deforms a template mesh to match the captured geometry while maintaining penetration-free constraints and physical plausibility for occluded regions. Our method produces temporally coherent sequences that faithfully capture fine wrinkles and folds even during complex motions with self-contact.

Paper Video

Project Publications