Abstract: PointPillars, a voxel-based 3-D object detection model, would encounter the resolution loss after voxelization, leading to the capability reduction in capturing intricate object details.
DEIMv2 is an evolution of the DEIM framework while leveraging the rich features from DINOv3. Our method is designed with various model sizes, from an ultra-light version up to S, M, L, and X, to be ...