[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
-
Updated
Jun 1, 2024 - Python
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
Label Studio is a multi-type data labeling and annotation tool with standardized output format
⛅ Versatile Data Pipeline (VDP) console website
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
An ASL detection script utilizing a TensorFlow image classification model trained from scratch. It is tailored to recognize American Sign Language (ASL) alphabet letters from live video streams, and provides documentation covering the neural network architecture, installation, dataset details, training procedures, and real-time detection.
The open-source tool for building high-quality datasets and computer vision models
This repository offers a straightforward implementation of Vision Transformers (ViT), specifically designed for computer vision tasks using PyTorch. Dive into efficient and practical transformer applications for image recognition.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Client interface for all things Cleanlab Studio
A fast accurate API for detecting NSFW images.
Predicting & Classifying Brain tumor using CNN model
♻️ Data preparation, image classification and segmentation of images of trash.
Code for classifying hand-sign image dataset using a supervised learning approach: Neural Network with some technique to improve the accuracy metric.
Official Repository for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
23-2 조경미학
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/MaybeShewill-CV/mortred_model_server
Add a description, image, and links to the image-classification topic page so that developers can more easily learn about it.
To associate your repository with the image-classification topic, visit your repo's landing page and select "manage topics."