Per-pixel Features: Mating Segment-Anything with CLIP

This repository aims to generate per-pixel features using pretrained models, Segment-Anything and CLIP. The pixel-aligned features are useful for downstream tasks such as visual grounding and VQA. First, we use the SAM to generate segmetation masks. Then, cropped images are sent into CLIP to extract semantic features. Finally, each pixel will be assigned semantic features according to its associated masks.

Here, we show open-vocabulary segmentation without any training and finetuning.

Input Image	Segment Segmentation

Prepare

You may need to install Segment-Anything and CLIP (or, OpenCLIP).
Download one of SAM checkpoints from the SAM repository.

Demo

You can generate per-pixel features of an image.

python feature_autogenerator.py --image_path {image_path} --output_path {output_path} --output_name {feature_file_name} --checkpoint_dir {checkpoint_dir}

Or directly generate segmentation results by the given config file.

python segment.py --config_path {config_path}

Acknowledgement

Citation

If you find this work useful for your research, please consider citing this repo:

@misc{mingfengli_seganyclip,
  title={Per-pixel Features: Mating Segment-Anything with CLIP},
  author={Li, Ming-Feng},
  url={https://github.com/justin871030/Segment-Anything-CLIP},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
images		images
README.md		README.md
feature_autogenerator.py		feature_autogenerator.py
seganyclip.py		seganyclip.py
segment.py		segment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

images

images

README.md

README.md

feature_autogenerator.py

feature_autogenerator.py

seganyclip.py

seganyclip.py

segment.py

segment.py

Repository files navigation

Per-pixel Features: Mating Segment-Anything with CLIP

Prepare

Demo

Acknowledgement

Citation

About

Releases

Packages

Languages

justin871030/Segment-Anything-CLIP

Folders and files

Latest commit

History

Repository files navigation

Per-pixel Features: Mating Segment-Anything with CLIP

Prepare

Demo

Acknowledgement

Citation

About

Topics

Resources

Stars

Watchers

Forks

Languages