新建了一个文件夹robin存放两个人的音频数据(共2x10条wav音频)做测试,运行./test_cos.sh ~/kaldi/egs/sre16/v2/robin/sub_TIMIT_test
在最后报错:sid/compute_vad_decision.sh: moving data_test/vad.scp to data_test/.backup utils/split_scp.pl: Refusing to split data because number of speakers 2 is less than the number of output .scp files 4
测试到底是如何测试的?至少需要4个人的数据?
再加入所有人(168)的数据,继续试试——还是报错:utils/split_scp.pl: Refusing to split data because number of speakers 2 is less than the number of output .scp files 4
可能是数据存放的位置有严格要求,就移动文件至~/kaldi/data/
目录下。
依旧报错。
仔细按照报错的原因进入~/kaldi/egs/sre16/v2/steps
去修改make_mfcc.sh
里的nj
参数:由nj=4
改为nj=1
,依旧报错:
steps/make_mfcc.sh: Succeeded creating MFCC features for data_test
sid/compute_vad_decision.sh --nj 4 --cmd run.pl data_test exp/make_vad_test mfcc_test
utils/split_scp.pl: Refusing to split data because number of speakers 2 is less than the number of output .scp files 4
后来得知,脚本make_mfcc.sh
里面默认的参数可以不用去管,直接去修改当时./extract_xvectors.sh ~/kaldi/data/TIMIT_test exp/xvector_test
操作中的extract_xvectors.sh
即可:
#!/bin/bash
if [ $# != 2 ]; then
echo "Usage: $0 <data-path>"
echo " $0 ~/kaldi/data/TIMIT_test exp/xvector_T