arxiv Preprint – Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP


In this episode we discuss Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
by Qihang Yu, Ju He, Xueqing Deng, Xiaohui Shen, Liang-Chieh Chen. The paper proposes a single-stage framework for open-vocabulary segmentation using a shared Frozen Convolutional CLIP (FC-CLIP) backbone. FC-CLIP simplifies the pipeline and achieves a better accuracy-cost trade-off compared to existing two-stage approaches. It outperforms previous methods on various benchmarks, sets a new state-of-the-art performance on open-vocabulary semantic segmentation datasets, and is significantly faster and uses fewer parameters.


Posted

in

by

Tags: