[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
-
Updated
Jun 4, 2024 - Python
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the features (attributes, style, functionality etc.) of the items and increase online sales by enticing more customers.
A true multimodal LLaMA derivative -- on Discord!
caption generator using lavis and argostranslate
Finetuning Large Visual Models on Visual Question Answering
Caption images across your datasets with state of the art models from Hugging Face and Replicate!
Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost
Too lazy to organize my desktop, make gpt + BLIP-2 do it
Implementation of Qformer from BLIP2 in Zeta Lego blocks.
An end to end Deep Learning based tool for image caption generation.
This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for risk(s).
Modifying LAVIS' BLIP2 Q-former with models pretrained on Japanese datasets.
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
Creating stylish social media captions for an Image using Multi Modal Models and Reinforcement Learning
Add a description, image, and links to the blip2 topic page so that developers can more easily learn about it.
To associate your repository with the blip2 topic, visit your repo's landing page and select "manage topics."