Subgen - Auto-generate Plex or Jellyfin Subtitles using Whisper OpenAI!

McCloud@alien.top · 11 months ago

Subgen - Auto-generate Plex or Jellyfin Subtitles using Whisper OpenAI!

Snuupy@alien.top · 11 months ago

Wow, this is great! I’d be interested in doing some subtitles for some non-English shows I have, would you happen to know if translating into English subtitles is supported?

Also, take a look at https://github.com/m-bain/whisperX - subsai uses this and it’s much faster than whisper.cpp

McCloud@alien.top · 11 months ago

It should detect the foreign language and make english subtitles, but I haven’t personally tried it.

I’m not using whisper.cpp anymore. I did some short comparisons between WhisperX and stable-ts and ultimately decided to go with stable-ts. Functionally, I’m sure they’re very similar.

RaiseRuntimeError@alien.top · 11 months ago

I was reading the docs for both openai-whisper and faster-whisper and it can translate to English

ggfools@alien.top · 11 months ago

this is cool, do you see a lot of incorrect word matches?

TheBigC@alien.top · 11 months ago

This looks very cool, I am interested. Do I install it on the Plex server itself, or a pc running a plex client?

TheJambo@alien.top · 11 months ago

Damn this looks good, any chance of it coming to Emby?

McCloud@alien.top · 11 months ago

If I knew what the endpoints were, nothing would prohibit it. I can add it to my short list.

McCloud@alien.top · 11 months ago

I just tried, Emby won’t actually send out the webook on an action. I can use the test webhook, but it won’t trigger off media actions. Documentation half-implies that it’s a premiere options?

spookymulderfbi@alien.top · 11 months ago

Very cool! If plex still had plugin support, this is the kind of stuff i’d want to see

Adam_Meshnet@alien.top · 11 months ago

Nice! Do you reckon with GPU you could potentially run it in real time? I’ve set up an endpoint with Whisper to transcribe videos one of my colleagues needed for work on my homelab server, which cumulatively must have saved everyone days worth of time by now.

McCloud@alien.top · 11 months ago

I’m not sure yet. Faster-whisper has some benchmarks of the Largev2 model taking about 1 minute for 13 minutes of audio. Smaller models ought to be quicker. Unsure if the specs of the GPU will make much differenece.

PoundKitchen@alien.top · 11 months ago

Suhweeet!!! English only or will it handle other languages and translation too, Spanish to English?

McCloud@alien.top · 11 months ago

It can only translate into English, but the source audio can be a foreign language.

PoundKitchen@alien.top · 11 months ago

Great, that’s what I need!

I see a Docker pull in my future.

Totallynotaswede@alien.top · 11 months ago

This is Awesome!

I’d love to help out with this! I was starting to write something similar to add hooks to audibookshelf so that it can scan through audiobooks to generate correct chapters / timings also, but it’s better implementing this here.

A good idea would to make the GPU / CPU transcoding a transcoding container, so that the main container can send out work to your gaming PC when it’s online etc and the main container has scheduled jobs that it can trigger on the transcode nodes when available, there’s lots of cool stuff that can be made, really fun project!

Maybe we can create a discord channel for more people who are interested in developing this.

viceman256@alien.top · 11 months ago

I’m getting all sorts of syntax errors going off your dockerfile.

Maribel-han@alien.top · 11 months ago

How do I know if it’s working/doing it’s thing? I installed it but seens to be doing nothing

Kaikidan@alien.top · 11 months ago

The app works perfectly, really nice idea! But I noticed something on my install, on the GitHub it mention that it will transcribe into English from other languages, but I tried Japanese and Portuguese files and they got transcribed at their native language

portuguese > portuguese

japanese > japanese

english > english

is that the expected behavior or should i add some argument on the docker compose to force translation into english?

fefeh1@alien.top · 10 months ago

I have a suggestion. I have installed it and it seems to be working, but I don’t know which file it is working on at the time. I look at the logs and I can see where it is determining the language and translating and transcribing, but I have no idea which movie/show it is processing.

Thanks for the great app!

McCloud@alien.top · 10 months ago

Unfortunately stable-ts and whisper don’t obviously output which files it is working on, so you’re dependent on trying to decipher it from the logs. I tried to add prints to show which files it has queued and started, but with threading, the std-out sometimes gets lost or buffered in strange ways.

Vogete@alien.top · 11 months ago

I didn’t know this project existed and j genuinely was thinking making this tool. This is amazing, thank you! I’ll definitely try it out, especially since I have a hard time finding subtitles for a lot of shows with proper sync.

tablecontrol@alien.top · 11 months ago

holy crap!! i’m going to try this tonight.

I was having some subtitle timing issues on Breaking Bad that was driving me nuts

JoNike@alien.top · 11 months ago

What a cool project! Good job!