ML之xgboost:基于xgboost(5f-CrVa)算法对HiggsBoson数据集(Kaggle竞赛)训练(模型保存+可视化)实现二分类预测
目录
Dataset之HiggsBoson:Higgs Boson(Kaggle竞赛)数据集的简介、下载、案例应用之详细攻略
更新中……
1、交叉训练时间比较长,大约需要20多分钟。
更新中……
更新中……
- num_round = 1000
- n_estimators = cvresult.shape[0]
- print ('running cross validation, with preprocessing function')
-
- do cross validation, for each fold
- the dtrain, dtest, param will be passed into fpreproc
- then the return value of fpreproc will be used to generate results of that fold
- cvresult = xgb.cv(param, dtrain, num_round, nfold=5,
- metrics={'ams@0.15', 'auc'},
- early_stopping_rounds=10, seed = 0,
- fpreproc = fpreproc)
- print ('finish cross validation','\n',cvresult)
-
-
- print ('train model using the best parameters by cv ... ')
- bst = xgb.train( param, dtrain, n_estimators )
- bst.save_model('data_input/xgboost/data_higgsboson/higgs_cv.model')
网站声明:如果转载,请联系本站管理员。否则一切后果自行承担。
加入交流群
请使用微信扫一扫!