You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First large-v2 does not support the "yue" language token and large-v3 supports.
However when I use faster-whisper to load large-v2 and transcribe sentence with the param language="yue", there is no bug. It works but it does be wrong.
importwhispermodel=whisper.load_model("large-v2",)
input_language='yue'result=model.transcribe(audio, language=input_language)
"""--> [154] sot_sequence.append(sot + 1 + langs.index(self.language))ValueError: tuple.index(x): x not in tuple"""
First
large-v2
does not support the "yue" language token andlarge-v3
supports.However when I use
faster-whisper
to loadlarge-v2
and transcribe sentence with the paramlanguage="yue"
, there is no bug. It works but it does be wrong.openai/whisper
Question:
language-token is put the start of encoded_input in openai/whisper. But it seems that language-token is not parsed to model as expected
The text was updated successfully, but these errors were encountered: