cm0002@piefed.world to Linux@programming.devEnglish · 1 month agoFFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comexternal-linkmessage-square14fedilinkarrow-up165arrow-down15file-text
arrow-up160arrow-down1external-linkFFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comcm0002@piefed.world to Linux@programming.devEnglish · 1 month agomessage-square14fedilinkfile-text
https://www.phoronix.com/news/FFmpeg-Vulkan-AV1-Encoding https://www.phoronix.com/news/FFmpeg-Lands-Whisper
minus-squarechrisbtoo@lemmy.calinkfedilinkarrow-up19·1 month agoHopefully the speech recognition is better than whatever the fuck most online video platforms use for automatic subtitles at the moment.
minus-squarepirateKaiser@sh.itjust.workslinkfedilinkarrow-up8·1 month agoI’ve built an app with Whisper, the level of ‘hit or miss’ entirely depends on the size of the model and language. Even audio quality is a lesser factor in my experience. So, it depends…
minus-squaredata1701d (He/Him)@startrek.websitelinkfedilinkEnglisharrow-up1·1 month agoI was messing around with HomeAssistant the other day, which uses the same speech recognition engine, and I found it to be decent.
Hopefully the speech recognition is better than whatever the fuck most online video platforms use for automatic subtitles at the moment.
I’ve built an app with Whisper, the level of ‘hit or miss’ entirely depends on the size of the model and language. Even audio quality is a lesser factor in my experience. So, it depends…
I was messing around with HomeAssistant the other day, which uses the same speech recognition engine, and I found it to be decent.