I’m not sure yet. Faster-whisper has some benchmarks of the Largev2 model taking about 1 minute for 13 minutes of audio. Smaller models ought to be quicker. Unsure if the specs of the GPU will make much differenece.
I’m not sure yet. Faster-whisper has some benchmarks of the Largev2 model taking about 1 minute for 13 minutes of audio. Smaller models ought to be quicker. Unsure if the specs of the GPU will make much differenece.
It can only translate into English, but the source audio can be a foreign language.
I just tried, Emby won’t actually send out the webook on an action. I can use the test webhook, but it won’t trigger off media actions. Documentation half-implies that it’s a premiere options?
If I knew what the endpoints were, nothing would prohibit it. I can add it to my short list.
It should detect the foreign language and make english subtitles, but I haven’t personally tried it.
I’m not using whisper.cpp anymore. I did some short comparisons between WhisperX and stable-ts and ultimately decided to go with stable-ts. Functionally, I’m sure they’re very similar.
Completed the Bazarr integration. See: https://github.com/McCloudS/subgen/edit/main/README.md#bazarr