A Python 3.10 tool to transcribe audio files (e.g., iPhone voice memos) and summarize them with action items, attendees, time, and date using OpenAI GPT.
- Transcribe audio using local Whisper (openai/whisper or faster-whisper) or OpenAI Whisper API
- Summarize transcript with GPT (cost-efficient, default: gpt-3.5-turbo)
- Extract action items, attendees, time, and date
- Configurable via CLI or
.envconfig file
- Clone the repo and create a new Anaconda environment:
conda create -n transaiscriber python=3.10 conda activate transaiscriber
- Install dependencies:
pip install -r requirements.txt
- Copy and edit the config file:
cp config.example.env .env # Edit .env to add your OpenAI API key and preferences
python transaiscriber.py path/to/audio.m4a --method local_whisper --output_dir output- You can also set options in
.envinstead of CLI. - Supported methods:
local_whisper,faster_whisper,openai_api
- Modular code for easy addition of cloud storage or other features.
- Python 3.10+
- See
requirements.txt
