PhraseStereo: The First Open-Vocabulary Stereo Image Segmentation Dataset

Abstract

This paper introduces PhraseStereo, the first open-vocabulary stereo image segmentation dataset. The dataset addresses the challenge of segmenting objects in stereo images based on natural language descriptions, enabling more flexible and intuitive computer vision applications. The work presents a comprehensive benchmark for evaluating stereo segmentation methods in open-vocabulary settings.

Publication
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) - X-Sense Workshop, 2025