Abstract: This paper introduces WhisperSeg, utilizing the Whisper Transformer pre-trained for Automatic Speech Recognition (ASR) for human and animal Voice Activity Detection (VAD). Contrary to ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results