Adding realtime diarization to collabora/WhisperLive #178
Unanswered
yehiaabdelm
asked this question in
Q&A
Replies: 1 comment
-
|
Hi @yehiaabdelm, apart from To combine diart with Whisper you can check this article that I wrote on Medium some time ago. It will give you a head start but I'm sure many improvements can be made. You can also check out this gist for the diart+whisper code. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm trying to add diarization to this repo https://github.com/collabora/WhisperLive, which has transcription and also runs a VAD model before passing audio data to the transcriber. I had it working with pyannote-audio, however, the VAD model and the diarization model both run on the CPU so they slow down each other. I was also passing the whole audio file every time to the model so this is obviously not optimal. I was wondering how I can use diart instead of pyannote. Most of the examples I see are directly from microphone. Can anyone please share an example of how I can use it diart with the data being a float 32 numpy array of mono audio instead of a stream from the microphone? Any help is appreciated.
Beta Was this translation helpful? Give feedback.
All reactions