Enhancing Naturalness and Expressiveness of Text-to-Speech Generated Speech through Prosodic Parameter Manipulation
This project aims to improve the naturalness and expressiveness of Text-to-Speech (TTS) systems by developing a machine learning model that manipulates the prosodic parameters (pitch, duration, and energy) of TTS-generated speech to make it more closely resemble human speech.