make_classification
这个函数生成一个随机的n类分类问题。
函数原型:
sklearn.datasets.make_classification(n_samples=100, n_features=20, *, n_informative=2, n_redundant=2, n_repeated=0, n_classes=2, n_clusters_per_class=2, weights=None, flip_y=0.01, class_sep=1.0, hypercube=True, shift=0.0, scale=1.0, shuffle=True, random_state=None)[source]
参数介绍:
- n_samples:int, default=100,指定样本的数量
- n_features:int, default=20,指定特征的数量。These comprise n_informative informative features, n_redundant redundant features, n_repeated duplicated features and n_features-n_informative-n_redundant-n_repeated useless features drawn at random.
- n_classes:int, default=2,指定分类问题中类别的数量
参考:
https://scikit-learn.org/stable/modules/generated/sklearn.datasets.make_classification.html