Skip to content

Chat for summarization and information retrieval of your audio data like meetups, lectures, or webinars

License

Notifications You must be signed in to change notification settings

Logisx/audio-conversational-agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Conversational agent with RAG on audio files

🚀 Overview

This chat agent is a tool that is able to transcribe audio input, and answer user's questions based on the information in the audio. It is made for efficient summarization, information retrieval, and organization of data after attending events like meetups, lectures, or webinars.

Version MIT License
OpenAI LangChain ChromaDB AssemblyAI Gradio Jupyter Lab

⭐ Example

Demo

🧰 Technologies used

  • OpenAI 🤖
  • LangChain 🔗
  • ChromaDB 💾
  • AssemblyAI 🔊
  • Gradio 💻

📊 How It Works

  1. Audio Input: Input audio is transcribed using AssemblyAI's transcription capabilities.
  2. Storage: Transcribed data is stored in the Chroma vector database.
  3. Query Response: User queries are answered using Retrieval Augmented Generation, providing detailed and accurate information.
  4. Conversation History: Chat is keeping the history of preivous messages and is able to provide more accurate answers based on that.

🗺️ How to Get Started

  1. Open the notebook file
  2. Install dependencies
  3. Insert the links to your audio
  4. Run the cells and start your chat!

⚖️ License

MIT

🔗 Links

linkedin

About

Chat for summarization and information retrieval of your audio data like meetups, lectures, or webinars

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published