StepAudio 2.5
StepAudio 2.5 TTS adds natural-language control of emotion and delivery
StepFun released StepAudio 2.5, a text-to-speech model that lets you steer emotion and delivery with natural-language instructions. It was covered in the show's Voice & Audio segment as the week's notable speech release.