uni4dUni4D is a novel method for learning discriminative 4D representations from point cloud videos using a self-disentangled Masked AutoEncoder. It outperforms existing methodsUni4D is modular and any component can be swapped for other visual foundation model outputs. For custom vdeo depth estimation and dynamic masks, save them in