This guide will walk you through setting up your environment and necessary tools to create an audiobook using Python.
sudo apt get update
sudo apt install ffmpeg
python -m venv venv
source venv/bin/activate
.\venv\Scripts\activate
pip install -r requirements.txt
pip install -r pytorch-requirements.txt
docker compose -f docker/llm-compose.yml up -d
ollama pull deepseek-r1:7b
ollama pull qwen2.5
Pull Necessary TTS and Audio Models: To get started, download the following models using the Hugging Face CLI:
huggingface-cli download parler-tts/parler-tts-mini-v1.1
huggingface-cli download parler-tts/parler-tts-mini-multilingual-v1.1
huggingface-cli download facebook/audiogen-medium
huggingface-cli download facebook/audio-magnet-medium
docker compose -f docker/tts-server-fast.yml up -d
docker compose -f docker/tts-server.yml up -d
docker compose -f docker/audiocraft-server.yml up -d
Environment Variables:
Ensure any required environment variables are set. You can use a .env
file or set them directly in your terminal.
Dependencies:
Ensure all dependencies listed in requirements.txt
and pytorch-requirements.txt
are correct and up-to-date.
Docker Check: Verify that Docker is running and the necessary containers are up and operational.