vision-language-model

Star

Here are 107 public repositories matching this topic...

1adrianb / lasp

Star

pytorch clip zero-shot few-shot-learning vision-language-model

Updated Jul 31, 2023
Python

M3-IT / YING-VLM

Star

Vision Large Language Models trained on M3IT instruction tuning dataset

large-language-models instruction-tuning vision-language-model

Updated Aug 16, 2023
Python

vincentlux / Awesome-Multimodal-LLM

Star

Reading list for Multimodal Large Language Models

machine-learning natural-language-processing computer-vision awesome-list paper-list multimodal-machine-learning large-language-models vision-language-model multimodal-large-language-models

Updated Aug 17, 2023

minhanh151 / PRE

Star

Prompt Learning with Residual Context Optimization for Vision-Language Models (2023)

prompt-learning vision-language-model

Updated Aug 28, 2023
Python

bnabis93 / vision-language-examples

Star

Vision-lanugage model example code.

tutorial example pytorch transformer embedding-models model-acceleration vision-language model-optimization vision-language-model

Updated Sep 6, 2023
Python

FeiElysia / ViECap

Star

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning, ICCV 2023

transferability modality-biases vision-language-model zero-shot-captioning object-hallucination

Updated Sep 28, 2023
Python

MIFA-Lab / InstructionGPT-4

Star

About Implementation for paper "InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4" (https://arxiv.org/abs/2308.12067)

multi-modal-learning vision-language-model minigpt4

Updated Oct 9, 2023
Python

richard-peng-xia / LMPT

Star

LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition

multi-label-image-classification prompt-tuning long-tailed-learning vision-language-model

Updated Oct 11, 2023
Python

VPGTrans / VPGTrans

Star

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.

llm vision-language-model large-scale-language-modeling vl-llm

Updated Oct 13, 2023
Python

marthaflinderslewis / clip-binding

Star

Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.

clip vision-language-model

Updated Oct 14, 2023
Python

YonghaoXu / Txt2Img-MHN

Star

[IEEE TIP 2023] Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks

remote-sensing hopfield-network image-synthesis text-to-image-generation vision-language-model

Updated Oct 19, 2023
Python

FMXExpress / AI-Vision-Chat

Star

Chat with large languages models about the contents of an image via this native desktop client for Windows, macOS, and Linux.

desktop-app windows macos linux delphi ai vicuna delphi-sample llm vision-language-model llava replicate-api

Updated Oct 20, 2023
Pascal

YiSyuanChen / SINC

Star

Original PyTorch implementation for ICCV 2023 Paper "SINC: Self-Supervised In-Context Learning for Vision-Language Tasks."

low-resource in-context-learning vision-language-model

Updated Oct 23, 2023
Python

yunqing-me / AttackVLM

Star

[NeurIPS-2023] Annual Conference on Neural Information Processing Systems

deep-generative-model adversarial-attack trustworthy-ai foundation-models large-language-models text-to-image-generation generative-ai vision-language-model image-to-text-generation

Updated Oct 30, 2023
Python

Anastasiais-ml / sw_clip

Star

vision-language-model

Updated Oct 31, 2023
Jupyter Notebook

Surrey-UPLab / Recognize-Any-Regions

Star

Recognize Any Regions

open-world object-detection zero-shot instance-segmentation auto-labeling vision-language-pretraining open-vocabulary vision-language-model multimodal-representation-learning vision-foundation-model vision-language-foundation-model

Updated Nov 22, 2023
Python

zhudotexe / kani-vision

Sponsor

Star

Kani extension for supporting vision-language models (VLMs). Comes with model-agnostic support for GPT-Vision and LLaVA.

extension kani large-language-models vision-language-model llava multimodal-llm gpt-vision

Updated Nov 22, 2023
Python

UCSC-VLAA / vllm-safety-benchmark

Star

Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

benchmark safety datasets robustness adversarial-attacks llm vision-language-model multimodal-llm

Updated Nov 28, 2023
Python

SangbumChoi / Florence2

Star

Unofficial repository for building Florence-2 in Microsoft Azure

unofficial-library multi-modal vision-language-model florence2

Updated Nov 29, 2023
Jupyter Notebook

lizhaoliu-Lec / CG-VLM

Star

This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.

vision-language data-efficient contrastive-learning instruction-following data-efficient-learning large-language-models instruction-tuning vision-language-model

Updated Dec 1, 2023

Improve this page

Add a description, image, and links to the vision-language-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-language-model topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-language-model

Here are 107 public repositories matching this topic...

1adrianb / lasp

M3-IT / YING-VLM

vincentlux / Awesome-Multimodal-LLM

minhanh151 / PRE

bnabis93 / vision-language-examples

FeiElysia / ViECap

MIFA-Lab / InstructionGPT-4

richard-peng-xia / LMPT

VPGTrans / VPGTrans

marthaflinderslewis / clip-binding

YonghaoXu / Txt2Img-MHN

FMXExpress / AI-Vision-Chat

YiSyuanChen / SINC

yunqing-me / AttackVLM

Anastasiais-ml / sw_clip

Surrey-UPLab / Recognize-Any-Regions

zhudotexe / kani-vision

UCSC-VLAA / vllm-safety-benchmark

SangbumChoi / Florence2

lizhaoliu-Lec / CG-VLM

Improve this page

Add this topic to your repo