背景
在做点击率预估GBDT+LR模型时,使用lightgbm训练时,报错
# create dataset for lightgbm
lgb_train = lgb.Dataset(X_train, label = y_train)
lgb_eval = lgb.Dataset(X_test, label = y_test)
params = {
'task': 'train',
'boosting_type': 'gbdt',
'objective': 'binary',
'metric': {
'binary_logloss'},
'num_leaves': 64,
'num_trees': 100,
'learning_rate': 0.01,
'feature_fraction': 0.9,
'bagging_fraction': 0.8,
'bagging_freq': 5,
'verbose': 0
}
# number of leaves,will be used in feature transformation
num_leaf = 64
print('Start training...')
# train
gbm = lgb.train(params,
lgb_train,
num_boost_round=100,
valid_sets=lgb_train)
有如下报错
ValueError Traceback (most recent call last)
<ipython-input-25

在构建点击率预估模型时遇到LightGBM训练错误,问题源于输入数据不是二维或为空。通过检查发现,y_train是Series类型而非DataFrame。解决方案包括正确拆分数据集或转换Series为DataFrame。建议使用train_test_split函数划分数据,并在必要时将标签转换为DataFrame。
最低0.47元/天 解锁文章
1929

被折叠的 条评论
为什么被折叠?



