AdaBoost没有过拟合:有文献称,对于表现好的数据集,AdaBoost的测试错误率就会达到一个稳定值,并不会随着分类器的增多而上升。
# 自适应数据加载函数
def loadDataSet(fileName): # general function to parse tab -delimited floats
numFeat = len(open(fileName).readline().split('\t')) # 获取列数 ,最后一列为类别标签 get number of fields
dataMat = []; labelMat = []
fr = open(fileName)
for line in fr.readlines():
lineArr =[]
curLine = line.strip().split('\t')
for i in range(numFeat-