#

image-text

Here are 37 public repositories matching this topic...

Nexdata-AI / 11000-Image-Video-caption-data-of-human-action

11000-Image-Video-caption-data-of-human-action

computer-vision human-action-recognition image-text text-image caption-data aigc generative-ai

Updated Apr 18, 2024

ppraneeth270 / img2text

textrecognition image-text image2text

Updated May 23, 2021
Python

AkshayBura / Character-Recognition

Character Recognition system using CNN and Streamlit

python deep-neural-networks tensorflow image-processing cnn preprocessing image-text streamlit recognizing-characters

Updated Aug 22, 2023
Jupyter Notebook

yomnaFathy / Text-Detection-and-Recognition

opencv ocr computer-vision deep-learning text-recognition transfer-learning pretrained-models text-detection pytesseract east image-text text-detection-recognition

Updated Oct 20, 2020
Python

waittim / ConVIRT-Colab

Contrastive Learning Representations for Images and Text Pairs. Colab implementation of ConVIRT for transfer learning with insufficient data volume.

colab image-text contrastive-learning

Updated Jan 15, 2022
Jupyter Notebook

Nexdata-AI / 20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes

20011--Image-Caption-Data-Of-OCR-In-Natural-Scenes

ocr natural-scenes image-text text-image caption-data generative-ai

Updated Apr 18, 2024

DarkKnightSgh / Text-Image-Text

Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.

python information-retrieval transformers image-text flickr8k-dataset text-image streamlit semantic-embedding huggingface-transformers

Updated Apr 27, 2024
Python

makefile / text_extraction

Windows version of text_extraction(VS2013). This code is the implementation of the method proposed in the paper “Multi-script text extraction from natural scenes” (Gomez & Karatzas) to appear in ICDAR2013 conference.

Updated Aug 19, 2017
C++

CharlesYang030 / MTA

MTA: A Lightweight Multilingual Text Alignment Model for Cross-language Visual Word Sense Disambiguation

multilingual image-text multimodal language-vision visualwsd

Updated May 31, 2023
Jupyter Notebook

awsaf49 / flickr-dataset

Download flickr8k, flickr30k image caption datasets

image flickr dataset clip captioning-images image-text flickr8k flickr30k siglip

Updated Feb 6, 2024

Nexdata-AI / 10000-Image-caption-data-of-gestures

10000-Image-caption-data-of-gestures

gesture-recognition asian image-text caption-data generative-ai

Updated Apr 18, 2024

Nexdata-AI / 10000-Image-caption-data-of-vehicles

10000-Image-caption-data-of-vehicles

image-recognition vehicle-detection image-text caption-data generative-ai

Updated Apr 18, 2024

jianzhnie / MultimodalTransformers

lmmtoolkit is a toolkit for Multi-Modal Learning

image-text text-image multi-modal-learning text-to-video

Updated Nov 21, 2023
Python

xiongshufeng / MTFN-RR-PyTorch-Code

The offical code for paper "Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking", ACM Multimedia 2019 Oral

fusion image-text

Updated Sep 28, 2019
Python

formulae-org / package-graphic-raster-js

Raster graphics package for Fōrmulæ, in JavaScript

javascript formulae graphics graphics-programming turtle-graphics rotating image-transformations image-colors image-text raster-graphics image-coordinates graphic-primitives stroke-imaging xor-mode

Updated May 31, 2024
JavaScript

Nexdata-AI / 10100-Image-caption-data-of-human-face

10100-Image-caption-data-of-human-face

image-recognition image-text caption-data human-face-recognition generative-ai

Updated Apr 18, 2024

ask0ne / ocrator

Scan text from an image and convert into speech/audio of desired language.

natural-language-processing text-to-speech image-recognition pytesseract image-text

Updated Dec 8, 2022
Python

reshalfahsi / image-captioning-mobilenet-llama3

Image Captioning With MobileNet-LLaMA 3

nlp cnn pytorch transformer image-captioning image-text flickr8k-dataset mobilenetv3 pytorch-lightning kv-cache rotary-position-embedding grouped-query-attention rms-norm llama3

Updated May 5, 2024
Jupyter Notebook

miccunifi / QualiCLIP

Quality-Aware Image-Text Alignment for Real-World Image Quality Assessment

computer-vision deep-learning image-processing image-quality clip iqa image-text image-quality-assessment blind-image-quality-assessment low-level-vision image-degradation self-supervised-learning ranking-loss biqa vision-language nr-iqa no-reference-image-quality-assessment opinion-unaware opinion-unaware-nr-iqa

Updated Mar 19, 2024

CharlesYang030 / PolCLIP

PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations

image-text multimodal-wsd

Updated Mar 30, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the image-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the image-text topic, visit your repo's landing page and select "manage topics."