#

gpt4v

Here are 33 public repositories matching this topic...

roboflow / gpt-checkup

Monitor the performance of OpenAI's GPT-4V model over time.

computer-vision model-analysis gpt4v gpt-4v

Updated May 19, 2024
HTML

mnotgod96 / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

agent gpt4 llm generative-ai chatgpt gpt4v

Updated May 16, 2024
Python

reworkd / tarsier

Vision utilities for web interaction agents 👀

python ocr selenium webscraping pypi-package playwright llms gpt4v

Updated May 15, 2024
Jupyter Notebook

AmberSahdev / Open-Interface

Control Any Computer Using LLMs

python windows macos linux machine-learning automation assistant openai gpt pyinstaller self-driving pyautogui assistant-computer-control self-driving-software gpt4 llm gpt4v gpt4vision

Updated May 12, 2024
Python

BUAADreamer / Chinese-LLaVA-Med

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

ai medical chinese multimodal mllm llava gpt4v qwen1-5

Updated May 11, 2024
Python

martintmv-git / gpt4v-streamlit-voiceover

AI Voiceover with GPT4V

python jupyter-notebook openai streamlit gpt4v

Updated May 10, 2024
Jupyter Notebook

tiwater / flowgen

AutoGen Visualized - Visual Tools for Multi-Agent Development.

agent artificial-intelligence openai autogen rag llm chatgpt llava gpt4v

Updated May 10, 2024
TypeScript

vscode-ui-sketcher

pAIrprogio / vscode-ui-sketcher

Draw your projects to life

ui-design vscode-extension tldraw gpt4v

Updated May 6, 2024
TypeScript

sketch2app

cameronking4 / sketch2app

The ultimate sketch to code app made using GPT4 vision. Choose your desired framework (React, Next, React Native, Flutter) for your app. It will instantly generate code and preview (sandbox) from a simple hand drawn sketch on paper captured from webcam

code-generator nextjs openai wireframe app-maker sketch2code gpt4 design2code code-assistant ai-tool gpt4v gpt4-vision sketch2app pad2pixel generate-app-ai

Updated May 3, 2024

X-PLUG / MobileAgent

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

android agent harmony ios app gui automation mobile copilot multimodal mobile-agents mllm multimodal-large-language-models gpt4v multimodal-agent

Updated Apr 3, 2024
Python

gpt4api9 / gpt4api9

麻雀GPTs-API市场

openai gpt4 gpt35turbo gpt4all-api gpt4api gpt4v

Updated Mar 27, 2024

Ravi-Teja-konda / TunedLlavaDelights

Explore the rich flavors of Indian desserts with TunedLlavaDelights. Utilizing the in Llava fine-tuning, our project unveils detailed nutritional profiles, taste notes, and optimal consumption times for beloved sweets. Dive into a fusion of AI innovation and culinary tradition

dessert nutrition nutrition-information finetuning multimodal multi-modality gpt4 tranformers dalle2 stable-diffusion chatgpt vision-language-model llava vision-language-learning llama2 gpt4v

Updated Mar 17, 2024
Python

kyegomez / HRTX

Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2

machine-learning ai ml artificial-intelligence ensemble multi-modal rtx multi-modality rt-2 gpt4v

Updated Mar 12, 2024
Python

kyegomez / MambaByte

Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta

machine-learning ai tokenizer ml artificial-intelligence mamba multi-modality megabyte gpt4v

Updated May 17, 2024
Python

amazing-openai-api

soulteary / amazing-openai-api

Convert different model APIs into the OpenAI API format out of the box.

openai openai-api azure-openai azure-openai-api gpt4v gpt4vision yi-34b google-gemini gemini-pro yi-34b-chat

Updated Feb 21, 2024
Go

sagentic-ai / cupid

Valentine's Day Cupid Agent

ai chatbot gpt4 llm gpt4v bazed-af

Updated Feb 14, 2024
TypeScript

dceluis / vacocam_render

Vision-Assisted Camera Orientation

computer-vision ffmpeg artificial-intelligence gpt4 gpt4-api gpt4v gpt4-vision

Updated Feb 8, 2024
Jupyter Notebook

Charmve / gpt-eyes

I GAVE GPT-4 EYES!

agent world-models gpt-4 gpt4 worldmodel llm llm-inference world-model gpt4v gpt-4o gpt-4omni

Updated Jan 24, 2024
JavaScript

metatatt / iso_bot

ISO 13485 Sniffer Bot, GPT4V with LlamaIndex embeded in React Bot UI

react nextjs chatbot ml llamaindex gpt4v

Updated Jan 6, 2024
TypeScript

jamesponddotco / allalt

[READ-ONLY] Describe images and generate alt tags for visually impaired users.

go golang accessibility a11y openai a11y-automation gpt4v

Updated Dec 16, 2023
Go

Improve this page

Add a description, image, and links to the gpt4v topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpt4v topic, visit your repo's landing page and select "manage topics."