image-captioning

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

prompt chinese image-captioning pretrained-models visual-question-answering multimodal text-to-image-synthesis vision-language pretraining referring-expression-comprehension prompt-tuning

Updated Apr 24, 2024
Python

microsoft / Oscar

Star

Oscar and VinVL

vqa image-captioning oscar vision-and-language pre-training image-text-search vinvl

Updated Aug 28, 2023
Python

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Updated Nov 24, 2023
Python

milaan9 / Deep_Learning_Algorithms_from_Scratch

Star

This repository explores the variety of techniques and algorithms commonly used in deep learning and the implementation in MATLAB and PYTHON

data-science deep-learning linear-regression image-processing neural-networks image-captioning logistic-regression object-detection autoencoders adversarial-machine-learning rnn-pytorch cnn-classification tutor-milaan9 deep-learning-pytorch deep-learning-matlab deep-learning-python

Updated Dec 9, 2022
Jupyter Notebook

MahanFathi / CS231

Star

Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition

computer-vision deep-learning solutions tensorflow neural-networks stanford image-captioning convolutional-neural-networks dd cs231n visual-recognition assignments

Updated Apr 17, 2022
Jupyter Notebook

aimagelab / meshed-memory-transformer

Star

Meshed-Memory Transformer for Image Captioning. CVPR 2020

pytorch transformer image-captioning captioning-images visual-semantic caption-generation cvpr2020

Updated Dec 21, 2022
Python

DataTurks / DataTurks

Star

ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.

java image-processing image-classification image-captioning document-classification image-segmentation ner annotation-tool document-annotate

Updated Nov 28, 2021
JavaScript

yashk2810 / Image-Captioning

Star

Image Captioning using InceptionV3 and beam search

tensorflow keras cnn lstm image-captioning beam-search

Updated Aug 26, 2020
Jupyter Notebook

anuragmishracse / caption_generator

Star

A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.

image tensorflow keras cnn lstm rnn image-captioning captioning-images bleu-score

Updated May 13, 2018
Python

YehLi / xmodaler

Star

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

image-captioning video-captioning visual-question-answering vision-and-language cross-modal-retrieval pretraining tden

Updated Feb 27, 2023
Python

kuanghuei / SCAN

Star

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

computer-vision deep-learning neural-network pytorch image-captioning cross-modal visual-semantic

Updated May 18, 2023
Python

ufal / neuralmonkey

Star

An open-source tool for sequence learning in NLP built on TensorFlow.

python nlp deep-learning tensorflow gpu machine-translation neural-networks image-captioning neural-machine-translation sequence-to-sequence mt nmt encoder-decoder

Updated Apr 28, 2020
Python

ttengwang / Caption-Anything

Star

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything

image-captioning controllable-image-captioning controllable-generation chatgpt segment-anything

Updated Aug 29, 2023
Python

Improve this page

Add a description, image, and links to the image-captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image-captioning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image-captioning

Here are 778 public repositories matching this topic...

salesforce / LAVIS

sgrvinod / a-PyTorch-Tutorial-to-Image-Captioning

salesforce / BLIP

peteanderson80 / bottom-up-attention

yunjey / show-attend-and-tell

imaginary-cloud / CameraManager

ruotianluo / self-critical.pytorch

OFA-Sys / OFA

microsoft / Oscar

OpenGVLab / InternGPT

milaan9 / Deep_Learning_Algorithms_from_Scratch

MahanFathi / CS231

aimagelab / meshed-memory-transformer

DataTurks / DataTurks

yashk2810 / Image-Captioning

anuragmishracse / caption_generator

YehLi / xmodaler

kuanghuei / SCAN

ufal / neuralmonkey

ttengwang / Caption-Anything

Improve this page

Add this topic to your repo