TTS Generation Pipeline

Data Ingestion
TSV/CSV with text + optional reference audio/text
Data Preprocessing
Input Validation and Sanitization
Task Scheduling
Batch and Sample Management
Inference Engine
GPU-Accelerated Model Execution
Output Post-processing
Audio ID stitching → WAV; filters; normalization
Output Generation
WAV Files and Reporting
Monitoring & Logging
Live tqdm, Per-sample CSV, Aggregate Metrics (throughput, failures, GPU util)