Hello!
I've a project where I need to set timings and delays.
For example:
[
{
"text": "We've got all the information you need here, and we explain why",
"in": 0.37537499999999996,
"out": 4.546208333333333
},
{
"text": "melting glaciers could threaten millions of people's lives.",
"in": 4.587916666666667,
"out": 8.592916666666666
},
{
"text": "If you're struggling to see how renewable energy",
"in": 10.676333333333332,
"out": 13.220541666666666
},
{
"text": "could replace our usual energy sources, check out these solar powered aircraft.",
"in": 13.220541666666666,
"out": 18.851166666666664
}
]
I need the voice to start at the in point and end at the out point.
Solutions that I brainstormed about:
- Get Words-Per-Minute to calculate the rate and set the rate to the API.
- Set delay blocks inside the combined text to pause the talking.
But I don't know if the information I need for that are available.
Does anyone have any idea to solve that problem? (I'm using Java, but I can transcode everything, so every language answer is welcome)