AUDiO, ViD —> TEXT
Transcribe audio to text with speaker diarization
Transcribe audio files or YouTube videos into text
Transcribe audio to text with timestamps