自然语言处理_样本处理_Stratified k-fold

分层k折交叉验证

最新推荐文章于 2024-11-19 12:21:10 发布

原创最新推荐文章于 2024-11-19 12:21:10 发布 · 263 阅读

0 ·

CC 4.0 BY-SA版权

自然语言处理专栏收录该内容

6 篇文章

订阅专栏

Stratified k-fold
StratifiedKFold is a variation of k-fold which returns stratified folds: each set contains approximately the same percentage of samples of each target class as the complete set.

from sklearn.model_selection import StratifiedKFold, KFold
import numpy as np
X, y = np.ones((50, 1)), np.hstack(([0] * 45, [1] * 5))

skf = StratifiedKFold(n_splits=3)
for train, test in skf.split(X, y):
    print('train -  {}   |   test -  {}'.format(
        np.bincount(y[train]), np.bincount(y[test])))

kf = KFold(n_splits=3)
for train, test in kf.split(X, y):
    print('train -  {}   |   test -  {}'.format(
        np.bincount(y[train]), np.bincount(y[test])))