clip

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

image-to-text clip text-to-image dit multimodal sora text-to-video aigc stable-diffusion controlnet llava blip2 minigpt4 sd-xl ppdiffusers eva-clip stablevideodiffusion qwen-vl

Updated May 31, 2024
Python

open-compass / VLMEvalKit

Star

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated May 31, 2024
Python

zwx8981 / LIQE

Star

[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective

clip image-quality-assessment blind-image-quality-assessment multitask-learning no-reference-image-quality-assessment vision-language-model

Updated May 31, 2024
Python

marqo-ai / marqo

Star

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Updated May 31, 2024
Python

batmanlab / Mammo-CLIP

Star

Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography

breast-cancer-prediction clip mammogram rsna multimodal vision-and-language efficientnet vindr rsna-breast-cancer

Updated May 31, 2024
Python

jacobmarks / zero-shot-prediction-plugin

Star

Run zero-shot prediction models on your data

python plugin computer-vision detection classification segmentation clip zero-shot-learning huggingface fiftyone segment-anything owl-vit yolo-world

Updated May 30, 2024
Python

yaoxiaoyuan / mimix

Star

Mimix: A Text Generation Tool and Pretrained Chinese Models

Updated May 30, 2024
Python

Imageomics / bioclip

Star

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral].

computer-vision taxonomy clip knowledge-guided-machine-learning imageomics

Updated May 31, 2024
Python

CVHub520 / X-AnyLabeling

Star

Effortless data labeling with AI support from Segment Anything and other awesome models.

deep-learning sam pytorch yolo resnet deeplearning clip paddle labeling-tool onnx llm

Updated May 29, 2024
Python

unum-cloud / uform

Star

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Updated May 29, 2024
Python

leondgarse / keras_cv_attention_models

Star

Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam

recognition tensorflow model detection keras tf2 imagenet attention coco clip tf visualizing ddpm stable-diffusion segment-anything

Updated May 29, 2024
Python

zer0int / CLIP-fine-tune

Star

Fine-tuning code for CLIP models

openai fine-tune clip finetune fine-tuning textencoder comfyui sdxl

Updated May 29, 2024
Python

yaman / fashion-clip-rs

Star

A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GRPC services and as a standalone library, providing highly efficient text and image embeddings.

grpc clip onnx clip-vit fashion-clip

Updated May 28, 2024
Jupyter Notebook

FrostBird347 / ParrotNightmares

Star

An extremely cursed text to image AI which generates terrifying parrot abominations.

clip stylegan2-ada-pytorch text-to-image-generation

Updated May 28, 2024
Jupyter Notebook

gokayfem / awesome-vlm-architectures

Star

Famous Vision Language Models and Their Architectures

awesome awesome-list kosmos clip image-encoder vlm blip multimodal text-encoder vision-language-model llava internlm cogvlm qwen-vl

Updated May 28, 2024
Markdown

Improve this page

Add a description, image, and links to the clip topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the clip topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clip

Here are 568 public repositories matching this topic...

zer0int / Long-CLIP

jamjamjon / usls

mayuras7685 / Prom-raw

MertKalkanci / Highlights-Maker

stdlib-js / math-base-special-clamp

PaddlePaddle / PaddleMIX

open-compass / VLMEvalKit

zwx8981 / LIQE

marqo-ai / marqo

batmanlab / Mammo-CLIP

jacobmarks / zero-shot-prediction-plugin

yaoxiaoyuan / mimix

Imageomics / bioclip

CVHub520 / X-AnyLabeling

unum-cloud / uform

leondgarse / keras_cv_attention_models

zer0int / CLIP-fine-tune

yaman / fashion-clip-rs

FrostBird347 / ParrotNightmares

gokayfem / awesome-vlm-architectures

Improve this page

Add this topic to your repo