ISSN 1009-6248CN 61-1149/P 双月刊

主管单位:中国地质调查局

主办单位:中国地质调查局西安地质调查中心
中国地质学会

    高级检索

    融合机器学习的找矿靶区优选模型研究与应用

    An Integrated Machine Learning Model for Optimizing Mineral Exploration Targets and Its Application

    • 摘要:
      针对传统找矿靶区优选方法多源信息融合不足、主观性强、定量化程度低及不确定性难量化等问题,构建融合改进模糊层次分析(HI-FAHP)、蒙特卡洛模拟与机器学习的概率化决策模型,实现多源信息系统融合与结果概率化表达,提升勘查初期找矿风险评估的科学性。
      以河南栾川祖师庙矿区为对象,综合改进AHP、主成分分析与模糊评价建立HI-FAHP体系,客观确定地质指标(赋矿地层、断裂构造、侵入岩、物探异常)与化探指标(Ag-Pb-Sb、Au-Mo-As-Bi、Cu-V-Zn、W)权重。引入蒙特卡洛模拟,对隶属度(扰动系数0.08,三角分布)与等级边界值(均匀分布)随机扰动,生成各靶区综合隶属度概率分布与排序概率。采用决策树、随机森林和XGBoost等3种模型,基于HI-FAHP权重生成5000组随机得分数据训练,优选XGBoost反演地质与化探权重,计算成矿潜力指数,模型圈定5个找矿靶区。蒙特卡洛模拟得综合隶属度排序:Ⅰ号(79.55)>Ⅳ号(76.70)>Ⅱ号(74.50)>Ⅲ号(71.30)>Ⅴ号(67.20)。XGBoost拟合最优(R2=0.90),反演权重:地质0.25,化探0.75,成矿潜力指数排序一致。工程验证:Ⅰ号靶区揭露方铅矿体,Ⅳ号靶区圈定铜矿体,其余靶区矿化弱,与预测高度吻合。 HI-FAHP-蒙特卡洛-机器学习融合模型实现了多源信息客观融合与不确定性量化,XGBoost可有效反演特征权重,经工程验证可靠,适用于勘查初期多源数据整合与找矿风险评估。

       

      Abstract: To address insufficient multi-source data fusion, strong subjectivity, low quantification, and difficulty in uncertainty quantification in traditional mineral exploration target optimization methods, this study constructs a probabilistic decision-making model integrating Hybrid Improved Fuzzy Analytic Hierarchy Process (HI-FAHP), Monte Carlo simulation, and machine learning. The model aims to achieve systematic fusion of multi-source information and probabilistic expression of results, improving the scientific reliability of exploration risk assessment in the early exploration stage. Taking the Zushimiao polymetallic mining area in Luanchuan County, Henan Province, as a case study, a HI-FAHP system is established by combining improved AHP, principal component analysis, and fuzzy evaluation. Weights are objectively determined for geological indicators (ore-bearing strata, fault structures, intrusive rocks, geophysical anomalies) and geochemical indicators (four groups: Ag-Pb-Sb, Au-Mo-As-Bi, Cu-V-Zn, W). Monte Carlo simulation is introduced to randomly perturb membership degrees (perturbation coefficient 0.08, triangular distribution) and grade boundary values (uniform distribution), generating probability distributions and ranking probabilities for comprehensive membership degrees of each target area. Three machine learning models—Decision Tree, Random Forest, and XGBoost—are trained on 5,000 randomly generated score datasets based on HI-FAHP weights. XGBoost is selected via error analysis to invert geological and geochemical weights and calculate the mineralization potential index. Five prospecting target areas (Ⅰ–Ⅴ) are delineated. Monte Carlo simulation yields comprehensive membership rankings: Target Ⅰ (79.55)>Target Ⅳ (76.70)>Target Ⅱ (74.50)>Target Ⅲ (71.30)>Target Ⅴ (67.20). XGBoost shows the best fit (R2=0.90), with inverted weights of 0.25 for geological and 0.75 for geochemical features, and consistent mineralization potential index ranking. Engineering verification confirms that Target I exposes galena ore bodies, Target IV delineates copper ore bodies, while the other targets show weak mineralization, highly consistent with predictions. The HI-FAHP-Monte Carlo-machine learning integrated model achieves objective multi-source data fusion and uncertainty quantification. XGBoost effectively inverts physically meaningful feature weights. Verified by engineering results, the model is reliable and suitable for multi-source data integration and exploration risk assessment in the early exploration stage..

       

    /

    返回文章
    返回