Voice Activity Detection
PyTorch
pyannote
pyannote.audio
pyannote-audio-model
audio
voice
speech
speaker
speaker-diarization
speaker-change-detection
speaker-segmentation
overlapped-speech-detection
resegmentation
speaker-recognition
speaker-verification
speaker-identification
speaker-embedding
PyTorch
wespeaker
File size: 417 Bytes
c52481c d6d2a6e c52481c d6d2a6e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
model:
_target_: pyannote.audio.models.segmentation.PyanNet.PyanNet
linear:
hidden_size: 128
num_layers: 2
lstm:
batch_first: true
bidirectional: true
dropout: 0.0
hidden_size: 128
monolithic: true
num_layers: 4
num_channels: 1
sample_rate: 16000
sincnet:
sample_rate: 16000
stride: 10
task:
duration: 10.0
max_speakers_per_chunk: 3
max_speakers_per_frame: 2
|