Deep Instruction Tuning Enhances Segment Anything Model's Text-Guided Segmentation Capabilities
Deep text instruction tuning is essential to improve the text-guided segmentation capabilities of the Segment Anything Model (SAM), which performs much worse on text-instructed tasks compared to point- and box-guided segmentation.