ML之CatboostC:基于titanic泰坦尼克数据集利用catboost算法实现二分类


蓝天会撒娇
蓝天会撒娇 2022-09-19 11:26:12 51373
分类专栏: 资讯

ML之CatboostC:基于titanic泰坦尼克数据集利用catboost算法实现二分类

目录

基于titanic泰坦尼克数据集利用catboost算法实现二分类

设计思路

输出结果

核心代码


相关内容
ML之CatBoost:CatBoost算法的简介、安装、案例应用之详细攻略
ML之CatboostC:基于titanic泰坦尼克数据集利用catboost算法实现二分类
ML之CatboostC:基于titanic泰坦尼克数据集利用catboost算法实现二分类实现

基于titanic泰坦尼克数据集利用catboost算法实现二分类

设计思路

输出结果

  1. Pclass Sex Age SibSp Parch Survived
  2. 0 3 male 22.0 1 0 0
  3. 1 1 female 38.0 1 0 1
  4. 2 3 female 26.0 0 0 1
  5. 3 1 female 35.0 1 0 1
  6. 4 3 male 35.0 0 0 0
  7. Pclass int64
  8. Sex object
  9. Age float64
  10. SibSp int64
  11. Parch int64
  12. Survived int64
  13. dtype: object
  14. object_features_ID: [1]
  15. 0: learn: 0.5469469 test: 0.5358272 best: 0.5358272 (0) total: 98.1ms remaining: 9.71s
  16. 1: learn: 0.4884967 test: 0.4770551 best: 0.4770551 (1) total: 98.7ms remaining: 4.84s
  17. 2: learn: 0.4459496 test: 0.4453159 best: 0.4453159 (2) total: 99.3ms remaining: 3.21s
  18. 3: learn: 0.4331858 test: 0.4352757 best: 0.4352757 (3) total: 99.8ms remaining: 2.4s
  19. 4: learn: 0.4197131 test: 0.4266055 best: 0.4266055 (4) total: 100ms remaining: 1.91s
  20. 5: learn: 0.4085381 test: 0.4224953 best: 0.4224953 (5) total: 101ms remaining: 1.58s
  21. 6: learn: 0.4063807 test: 0.4209804 best: 0.4209804 (6) total: 102ms remaining: 1.35s
  22. 7: learn: 0.4007713 test: 0.4155077 best: 0.4155077 (7) total: 102ms remaining: 1.17s
  23. 8: learn: 0.3971064 test: 0.4135872 best: 0.4135872 (8) total: 103ms remaining: 1.04s
  24. 9: learn: 0.3943774 test: 0.4105674 best: 0.4105674 (9) total: 103ms remaining: 928ms
  25. 10: learn: 0.3930801 test: 0.4099915 best: 0.4099915 (10) total: 104ms remaining: 839ms
  26. 11: learn: 0.3904409 test: 0.4089840 best: 0.4089840 (11) total: 104ms remaining: 764ms
  27. 12: learn: 0.3890830 test: 0.4091666 best: 0.4089840 (11) total: 105ms remaining: 701ms
  28. 13: learn: 0.3851196 test: 0.4108839 best: 0.4089840 (11) total: 105ms remaining: 647ms
  29. 14: learn: 0.3833366 test: 0.4106298 best: 0.4089840 (11) total: 106ms remaining: 600ms
  30. 15: learn: 0.3792283 test: 0.4126097 best: 0.4089840 (11) total: 106ms remaining: 558ms
  31. 16: learn: 0.3765680 test: 0.4114997 best: 0.4089840 (11) total: 107ms remaining: 522ms
  32. 17: learn: 0.3760966 test: 0.4112166 best: 0.4089840 (11) total: 107ms remaining: 489ms
  33. 18: learn: 0.3736951 test: 0.4122305 best: 0.4089840 (11) total: 108ms remaining: 461ms
  34. 19: learn: 0.3719966 test: 0.4101199 best: 0.4089840 (11) total: 109ms remaining: 435ms
  35. 20: learn: 0.3711460 test: 0.4097299 best: 0.4089840 (11) total: 109ms remaining: 411ms
  36. 21: learn: 0.3707144 test: 0.4093512 best: 0.4089840 (11) total: 110ms remaining: 389ms
  37. 22: learn: 0.3699238 test: 0.4083409 best: 0.4083409 (22) total: 110ms remaining: 370ms
  38. 23: learn: 0.3670864 test: 0.4071850 best: 0.4071850 (23) total: 111ms remaining: 351ms
  39. 24: learn: 0.3635514 test: 0.4038399 best: 0.4038399 (24) total: 111ms remaining: 334ms
  40. 25: learn: 0.3627657 test: 0.4025837 best: 0.4025837 (25) total: 112ms remaining: 319ms
  41. 26: learn: 0.3621028 test: 0.4018449 best: 0.4018449 (26) total: 113ms remaining: 304ms
  42. 27: learn: 0.3616121 test: 0.4011693 best: 0.4011693 (27) total: 113ms remaining: 291ms
  43. 28: learn: 0.3614262 test: 0.4011820 best: 0.4011693 (27) total: 114ms remaining: 278ms
  44. 29: learn: 0.3610673 test: 0.4005475 best: 0.4005475 (29) total: 114ms remaining: 267ms
  45. 30: learn: 0.3588062 test: 0.4002801 best: 0.4002801 (30) total: 115ms remaining: 256ms
  46. 31: learn: 0.3583703 test: 0.3997255 best: 0.3997255 (31) total: 116ms remaining: 246ms
  47. 32: learn: 0.3580553 test: 0.4001878 best: 0.3997255 (31) total: 116ms remaining: 236ms
  48. 33: learn: 0.3556808 test: 0.4004169 best: 0.3997255 (31) total: 118ms remaining: 228ms
  49. 34: learn: 0.3536833 test: 0.4003229 best: 0.3997255 (31) total: 119ms remaining: 220ms
  50. 35: learn: 0.3519948 test: 0.4008047 best: 0.3997255 (31) total: 119ms remaining: 212ms
  51. 36: learn: 0.3515452 test: 0.4000576 best: 0.3997255 (31) total: 120ms remaining: 204ms
  52. 37: learn: 0.3512962 test: 0.3997214 best: 0.3997214 (37) total: 120ms remaining: 196ms
  53. 38: learn: 0.3507648 test: 0.4001569 best: 0.3997214 (37) total: 121ms remaining: 189ms
  54. 39: learn: 0.3489575 test: 0.4009203 best: 0.3997214 (37) total: 121ms remaining: 182ms
  55. 40: learn: 0.3480966 test: 0.4014031 best: 0.3997214 (37) total: 122ms remaining: 175ms
  56. 41: learn: 0.3477613 test: 0.4009293 best: 0.3997214 (37) total: 122ms remaining: 169ms
  57. 42: learn: 0.3472945 test: 0.4006602 best: 0.3997214 (37) total: 123ms remaining: 163ms
  58. 43: learn: 0.3465271 test: 0.4007531 best: 0.3997214 (37) total: 124ms remaining: 157ms
  59. 44: learn: 0.3461538 test: 0.4010608 best: 0.3997214 (37) total: 124ms remaining: 152ms
  60. 45: learn: 0.3455060 test: 0.4012489 best: 0.3997214 (37) total: 125ms remaining: 146ms
  61. 46: learn: 0.3449922 test: 0.4013439 best: 0.3997214 (37) total: 125ms remaining: 141ms
  62. 47: learn: 0.3445333 test: 0.4010754 best: 0.3997214 (37) total: 126ms remaining: 136ms
  63. 48: learn: 0.3443186 test: 0.4011180 best: 0.3997214 (37) total: 126ms remaining: 132ms
  64. 49: learn: 0.3424633 test: 0.4016071 best: 0.3997214 (37) total: 127ms remaining: 127ms
  65. 50: learn: 0.3421565 test: 0.4013135 best: 0.3997214 (37) total: 128ms remaining: 123ms
  66. 51: learn: 0.3417523 test: 0.4009993 best: 0.3997214 (37) total: 128ms remaining: 118ms
  67. 52: learn: 0.3415669 test: 0.4009101 best: 0.3997214 (37) total: 129ms remaining: 114ms
  68. 53: learn: 0.3413867 test: 0.4010833 best: 0.3997214 (37) total: 130ms remaining: 110ms
  69. 54: learn: 0.3405166 test: 0.4014830 best: 0.3997214 (37) total: 130ms remaining: 107ms
  70. 55: learn: 0.3401535 test: 0.4015556 best: 0.3997214 (37) total: 131ms remaining: 103ms
  71. 56: learn: 0.3395217 test: 0.4021097 best: 0.3997214 (37) total: 132ms remaining: 99.4ms
  72. 57: learn: 0.3393024 test: 0.4023377 best: 0.3997214 (37) total: 132ms remaining: 95.8ms
  73. 58: learn: 0.3389909 test: 0.4019616 best: 0.3997214 (37) total: 133ms remaining: 92.3ms
  74. 59: learn: 0.3388494 test: 0.4019746 best: 0.3997214 (37) total: 133ms remaining: 88.9ms
  75. 60: learn: 0.3384901 test: 0.4017470 best: 0.3997214 (37) total: 134ms remaining: 85.6ms
  76. 61: learn: 0.3382250 test: 0.4018783 best: 0.3997214 (37) total: 134ms remaining: 82.4ms
  77. 62: learn: 0.3345761 test: 0.4039633 best: 0.3997214 (37) total: 135ms remaining: 79.3ms
  78. 63: learn: 0.3317548 test: 0.4050218 best: 0.3997214 (37) total: 136ms remaining: 76.3ms
  79. 64: learn: 0.3306501 test: 0.4036656 best: 0.3997214 (37) total: 136ms remaining: 73.3ms
  80. 65: learn: 0.3292310 test: 0.4034339 best: 0.3997214 (37) total: 137ms remaining: 70.5ms
  81. 66: learn: 0.3283600 test: 0.4033661 best: 0.3997214 (37) total: 137ms remaining: 67.6ms
  82. 67: learn: 0.3282389 test: 0.4034237 best: 0.3997214 (37) total: 138ms remaining: 64.9ms
  83. 68: learn: 0.3274603 test: 0.4039310 best: 0.3997214 (37) total: 138ms remaining: 62.2ms
  84. 69: learn: 0.3273430 test: 0.4041663 best: 0.3997214 (37) total: 139ms remaining: 59.6ms
  85. 70: learn: 0.3271585 test: 0.4044144 best: 0.3997214 (37) total: 140ms remaining: 57.1ms
  86. 71: learn: 0.3268457 test: 0.4046981 best: 0.3997214 (37) total: 140ms remaining: 54.6ms
  87. 72: learn: 0.3266497 test: 0.4042724 best: 0.3997214 (37) total: 141ms remaining: 52.1ms
  88. 73: learn: 0.3259684 test: 0.4048797 best: 0.3997214 (37) total: 141ms remaining: 49.7ms
  89. 74: learn: 0.3257845 test: 0.4044766 best: 0.3997214 (37) total: 142ms remaining: 47.3ms
  90. 75: learn: 0.3256157 test: 0.4047031 best: 0.3997214 (37) total: 143ms remaining: 45.1ms
  91. 76: learn: 0.3251433 test: 0.4043698 best: 0.3997214 (37) total: 144ms remaining: 42.9ms
  92. 77: learn: 0.3247743 test: 0.4041652 best: 0.3997214 (37) total: 144ms remaining: 40.6ms
  93. 78: learn: 0.3224876 test: 0.4058880 best: 0.3997214 (37) total: 145ms remaining: 38.5ms
  94. 79: learn: 0.3223339 test: 0.4058139 best: 0.3997214 (37) total: 145ms remaining: 36.3ms
  95. 80: learn: 0.3211858 test: 0.4060056 best: 0.3997214 (37) total: 146ms remaining: 34.2ms
  96. 81: learn: 0.3200423 test: 0.4067103 best: 0.3997214 (37) total: 147ms remaining: 32.2ms
  97. 82: learn: 0.3198329 test: 0.4069039 best: 0.3997214 (37) total: 147ms remaining: 30.1ms
  98. 83: learn: 0.3196561 test: 0.4067853 best: 0.3997214 (37) total: 148ms remaining: 28.1ms
  99. 84: learn: 0.3193160 test: 0.4072288 best: 0.3997214 (37) total: 148ms remaining: 26.1ms
  100. 85: learn: 0.3184463 test: 0.4077451 best: 0.3997214 (37) total: 149ms remaining: 24.2ms
  101. 86: learn: 0.3175777 test: 0.4086243 best: 0.3997214 (37) total: 149ms remaining: 22.3ms
  102. 87: learn: 0.3173824 test: 0.4082013 best: 0.3997214 (37) total: 150ms remaining: 20.4ms
  103. 88: learn: 0.3172840 test: 0.4083946 best: 0.3997214 (37) total: 150ms remaining: 18.6ms
  104. 89: learn: 0.3166252 test: 0.4086761 best: 0.3997214 (37) total: 151ms remaining: 16.8ms
  105. 90: learn: 0.3164144 test: 0.4083237 best: 0.3997214 (37) total: 151ms remaining: 15ms
  106. 91: learn: 0.3162137 test: 0.4083699 best: 0.3997214 (37) total: 152ms remaining: 13.2ms
  107. 92: learn: 0.3155611 test: 0.4091627 best: 0.3997214 (37) total: 152ms remaining: 11.5ms
  108. 93: learn: 0.3153976 test: 0.4089484 best: 0.3997214 (37) total: 153ms remaining: 9.76ms
  109. 94: learn: 0.3139281 test: 0.4116939 best: 0.3997214 (37) total: 154ms remaining: 8.08ms
  110. 95: learn: 0.3128878 test: 0.4146652 best: 0.3997214 (37) total: 154ms remaining: 6.42ms
  111. 96: learn: 0.3127863 test: 0.4145767 best: 0.3997214 (37) total: 155ms remaining: 4.78ms
  112. 97: learn: 0.3126696 test: 0.4142118 best: 0.3997214 (37) total: 155ms remaining: 3.17ms
  113. 98: learn: 0.3120048 test: 0.4140831 best: 0.3997214 (37) total: 156ms remaining: 1.57ms
  114. 99: learn: 0.3117563 test: 0.4138267 best: 0.3997214 (37) total: 156ms remaining: 0us
  115. bestTest = 0.3997213503
  116. bestIteration = 37
  117. Shrink model to first 38 iterations.

核心代码

  1. class CatBoostClassifier Found at: catboost.core
  2. class CatBoostClassifier(-title class_ inherited__">CatBoost):
  3. _estimator_type = 'classifier'
  4. """
  5. Implementation of the scikit-learn API for CatBoost classification.
  6. Parameters
  7. ----------
  8. iterations : int, [default=500]
  9. Max count of trees.
  10. range: [1,+inf]
  11. learning_rate : float, [default value is selected automatically for
  12. binary classification with other parameters set to default. In all
  13. other cases default is 0.03]
  14. Step size shrinkage used in update to prevents overfitting.
  15. range: (0,1]
  16. depth : int, [default=6]
  17. Depth of a tree. All trees are the same depth.
  18. range: [1,+inf]
  19. l2_leaf_reg : float, [default=3.0]
  20. Coefficient at the L2 regularization term of the cost function.
  21. range: [0,+inf]
  22. model_size_reg : float, [default=None]
  23. Model size regularization coefficient.
  24. range: [0,+inf]
  25. rsm : float, [default=None]
  26. Subsample ratio of columns when constructing each tree.
  27. range: (0,1]
  28. loss_function : string or object, [default='Logloss']
  29. The metric to use in training and also selector of the machine
  30. learning
  31. problem to solve. If string, then the name of a supported
  32. metric,
  33. optionally suffixed with parameter description.
  34. If object, it shall provide methods 'calc_ders_range' or
  35. 'calc_ders_multi'.
  36. border_count : int, [default = 254 for training on CPU or 128 for
  37. training on GPU]
  38. The number of partitions in numeric features binarization.
  39. Used in the preliminary calculation.
  40. range: [1,65535] on CPU, [1,255] on GPU
  41. feature_border_type : string, [default='GreedyLogSum']
  42. The binarization mode in numeric features binarization. Used
  43. in the preliminary calculation.
  44. Possible values:
  45. - 'Median'
  46. - 'Uniform'
  47. - 'UniformAndQuantiles'
  48. - 'GreedyLogSum'
  49. - 'MaxLogSum'

网站声明:如果转载,请联系本站管理员。否则一切后果自行承担。

本文链接:https://www.xckfsq.com/news/show.html?id=2051
赞同 0
评论 0 条
蓝天会撒娇L0
粉丝 0 发表 7 + 关注 私信
上周热门
如何使用 StarRocks 管理和优化数据湖中的数据?  2672
【软件正版化】软件正版化工作要点  2637
统信UOS试玩黑神话:悟空  2532
信刻光盘安全隔离与信息交换系统  2216
镜舟科技与中启乘数科技达成战略合作,共筑数据服务新生态  1092
grub引导程序无法找到指定设备和分区  743
WPS City Talk · 校招西安站来了!  15
金山办公2024算法挑战赛 | 报名截止日期更新  15
看到某国的寻呼机炸了,就问你用某水果手机发抖不?  14
有在找工作的IT人吗?  13
本周热议
我的信创开放社区兼职赚钱历程 40
今天你签到了吗? 27
信创开放社区邀请他人注册的具体步骤如下 15
如何玩转信创开放社区—从小白进阶到专家 15
方德桌面操作系统 14
我有15积分有什么用? 13
用抖音玩法闯信创开放社区——用平台宣传企业产品服务 13
如何让你先人一步获得悬赏问题信息?(创作者必看) 12
2024中国信创产业发展大会暨中国信息科技创新与应用博览会 9
中央国家机关政府采购中心:应当将CPU、操作系统符合安全可靠测评要求纳入采购需求 8

加入交流群

请使用微信扫一扫!