Your own personal Ruskin.
-
Updated
Nov 20, 2023 - TypeScript
Your own personal Ruskin.
Digital Artificial Intelligence Agent
VisionQuery GPT-4v is a cutting-edge tool that combines screenshot-based queries with OpenAI's GPT-4. It enables users to capture screens, ask questions, and receive insightful answers from GPT-4v, revolutionizing digital interaction and understanding.
Camera powered with AI on the web
This is a tool that uses GPT4 Vision to operate your computer
Convert different model APIs into the OpenAI API format out of the box.
Capture images with HoloLens and receive descriptive responses from OpenAI's GPT-4V(ision).
Control Any Computer Using LLMs
Highly efficient and easy-to-use utility functions for common tasks. Includes functions to fetch JPG files from a folder, sort by modification time, and preprocess images in batch for GPT-4o vision API. Also includes optimized directory operations, file handling, image processing, and JSON manipulation with cache and multithreading.
Towards Explainable Metrics for Conditional Image Synthesis Evaluation (ACL 2024)
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
Add a description, image, and links to the gpt4vision topic page so that developers can more easily learn about it.
To associate your repository with the gpt4vision topic, visit your repo's landing page and select "manage topics."