How to update captions in real time? | Remotion | Page 1

rancid kernel Jul 7, 2024, 2:48 PM

#

I have adapted the word-by-word tiktok captions template to my nextjs project, the thing is that once it generates the subtitles with the sample-video.json file I want the user to have like a list where it comes out from the second to the second that each word goes and to be able to change it, that could be done getting from the json the data but I want that when I change a word it changes in real time, right now if I change it in the json I need to restart the page.

#

How to update captions in real time?

silver cypress Jul 8, 2024, 6:55 AM

#

rancid kernel I have adapted the word-by-word tiktok captions template to my nextjs project, t...

ok.
Have you tried working with Claude or ChatGPT ?
They are usually good helpers

rancid kernel Jul 8, 2024, 8:45 AM

#

silver cypress ok. Have you tried working with Claude or ChatGPT ? They are usually good help...

Yes, I tried but they gave me a helpful answer.

#

Can you help me or give me a good prompt for them to try to make it?

silver cypress Jul 8, 2024, 11:02 AM

#

rancid kernel Can you help me or give me a good prompt for them to try to make it?

you'll have to find your own way.
What I'd recommend is you parse your json and put it ont a reducer and then use the reducer in your inputProps in the <Player />
that way you can do whatever you want

low glade Jul 10, 2024, 12:08 PM

#

rancid kernel I have adapted the word-by-word tiktok captions template to my nextjs project, t...

@rancid kernel there are two different approaches you can take:

if you are building a SaaS, you want to build and editing interface around the <Player>, then I would recommend rendering an input field for each token and displaying the timestamp next to it. you can then edit each token. the changes you would just keep in a useState() and pass as inputProps to the player.
if you are just wanting to edit the props in the studio, you can use the writeStaticFile() API to save the props back. if you want two-way bindings, you can also use watchStaticFile() to listen to changes on disk.
I would recommend to create an overlay for editing the whisper.cpp output - here is one we did for #recorder as a reference: https://www.remotion.dev/recorder/subseditor.webm
the source code for it is also available if you get the recorder.

▶ Play video

rancid kernel Jul 10, 2024, 12:09 PM

#

Im trying to build a video editing SaaS

rancid kernel Jul 10, 2024, 12:09 PM

#

low glade <@1141751263260852248> there are two different approaches you can take: - if yo...

Im doing that

#

Passing the subtitles generated by whisper though an useState

#

To change them and the start time in the interface

#

@low glade my actual doubt

#

is where to deploy whisper.cpp (sub.mjs)

#

To use whisper cpp on an endpoint

#

Because I’m using nextjs and I want to do a SaaS

low glade Jul 10, 2024, 3:40 PM

#

rancid kernel Because I’m using nextjs and I want to do a SaaS

I am also wondering. you can not run whisper.cpp in a serverless function

if anyone has experiences, feel free to share and we make a document

rancid kernel Jul 10, 2024, 3:41 PM

#

low glade I am also wondering. you can not run whisper.cpp in a serverless function if an...

Do you think it’s a good way to use whisper with the OpenAI API?

#

And do not use whisper.cpp

#

I think it’s easier

#

Anyway, I’m going to research and If I find out some information I will let you know

#

To make a document

low glade Jul 10, 2024, 3:44 PM

#

rancid kernel To make a document

yes maybe!
it has a different format but not a bad one

stark quest Jul 12, 2024, 6:44 AM

#

@rancid kernel deepgram offers way cheaper and better transcription

#

https://deepgram.com/

Deepgram

Deepgram Voice AI: Text to Speech + Speech to Text APIs | Deepgram

Power your apps with real-time speech-to-text and text-to-speech APIs powered by Deepgram's voice AI models. Low latency, high quality, and low cost that scales

#

you can pass your audio and you would get a transcription, and it also gives the start and end times,

you can then use durationInFrames or, start and end parameters in remotion to display the subtitles based on the timing.

#

based on the components you use..

rancid kernel Jul 12, 2024, 8:54 AM

#

stark quest <@1141751263260852248> deepgram offers way cheaper and better transcription

Tysm

#

🙂

rancid kernel Jul 12, 2024, 9:26 AM

#

@stark quest

#

I checked it

#

and it only has english voices

#

I'm spaniard

#

and a lot of people of my SaaS also

#

whisper.cpp large its multilengual

stark quest Jul 12, 2024, 1:22 PM

#

@rancid kernel it supports lots of languages

#

where you checked?

#

rancid kernel Jul 12, 2024, 1:37 PM

#

stark quest where you checked?

I upload a Spanish YouTube video

#

And I don’t get the transcription

#

I don’t get any words/transcription

#

I use the demostration without login

#

Idk

stark quest Jul 12, 2024, 1:55 PM

#

you need to send audio to it

#

audio file in buffer format ig

#How to update captions in real time?