Digital Artificial Intelligence Agent
-
Updated
Dec 28, 2023 - Python
Digital Artificial Intelligence Agent
Highly efficient and easy-to-use utility functions for common tasks. Includes functions to fetch JPG files from a folder, sort by modification time, and preprocess images in batch for GPT-4o vision API. Also includes optimized directory operations, file handling, image processing, and JSON manipulation with cache and multithreading.
Camera powered with AI on the web
VisionQuery GPT-4v is a cutting-edge tool that combines screenshot-based queries with OpenAI's GPT-4. It enables users to capture screens, ask questions, and receive insightful answers from GPT-4v, revolutionizing digital interaction and understanding.
Towards Explainable Metrics for Conditional Image Synthesis Evaluation (ACL 2024)
Your own personal Ruskin.
Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision).
This is a tool that uses GPT4 Vision to operate your computer
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
Convert different model APIs into the OpenAI API format out of the box.
Control Any Computer Using LLMs
Add a description, image, and links to the gpt4vision topic page so that developers can more easily learn about it.
To associate your repository with the gpt4vision topic, visit your repo's landing page and select "manage topics."