Skip to content

Web API that summarizes multimedia from various sources using modern AI tools.

Notifications You must be signed in to change notification settings

oscargullberg/tldwol

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TL;DWOL - Too Long; Didn't Watch or Listen 🎧💡

Web API that summarizes multimedia from various sources using modern AI tools.

flowchart LR
    API[HTTP API]
    Whisper[whisper.cpp]
    Llama[llama.cpp]
    Output

    URL --> API
    API -->|Audio File| Whisper
    Whisper -->|Transcript| Llama
    Llama --> |Summary| Output

Supported inputs

  • YouTube
  • Apple Podcasts
  • Direct file URLs

Prerequisites

Quick Start 🚀

  1. Clone the Repository:

    git clone https://github.com/oscargullberg/tldwol/blob/main/README.md
    cd tldwol
  2. Install Dependencies:

    poetry install

Configuration ⚙

Create a .env file in the project root and add your environment variables:

LLAMA_CPP_DIR_PATH=path/to/llama.cpp
WHISPER_CPP_DIR_PATH=path/to/whisper.cpp
FILES_DL_DIR_PATH=path/to/download/directory
LLAMA_MODEL_PATH=path/to/model
WHISPER_MODEL_PATH=/some/path/bin/whisper.cpp/models/ggml-large.bin

Usage 🎮

  1. Start the Server:

    poetry run start
  2. Request a Summary:

    curl "http://127.0.0.1:8000/?url=https://youtube.com/watch?v=thOifuHs6eY"
    {
      "summary": "The provided content highlights the significance of hexagons in nature and human-made objects. It emphasizes the strength and stability of hexagons and ..."
    }

Releases

No releases published

Packages

No packages published

Languages