ISSN 1009-6248CN 61-1149/P 双月刊

主管单位:中国地质调查局

主办单位:中国地质调查局西安地质调查中心
中国地质学会

    高级检索

    基于BERT的三维地质建模约束信息抽取方法及意义

    BERT-based Method and Significance of Constraint Information Extraction for 3D Geological Modelling

    • 摘要: 地质报告中地质体的几何、拓扑及属性信息是三维地质建模过程中重要约束性信息。但传统的属性信息抽取方法存在覆盖率有限、局限于人工设计特征及模型泛化能力差等问题。面向三维建模任务,总结了地质报告中地质体的几何、拓扑及属性文本的特点,提出了一种基于BERT-BiLSTM-CRF的三维地质建模信息抽取方法;基于BERT预训练模型,构建融合BiLSTM和CRF的深度学习模型,通过BERT模型获取动态字符深层次语义信息,弥补静态词向量无法解决一词多义的问题,提高地质体复杂建模信息的抽取能力。以43篇地质报告为数据源进行模型性能评估,实验结果表明所提出的方法对于地质体三类属性信息抽取准确率达到90%以上,对于三维地质建模具有重要支撑作用。

       

      Abstract: The geometry, topology and attribute information of geological bodies in geological reports are important constraint information in the 3D geological modeling process. However, the traditional attribute information extraction methods have problems such as limited coverage, limited to artificial design features and poor model generalization ability. Facing the 3D modeling task, the geometry, topology and attribute text characteristics of geological bodies in geological reports are summarized, and a 3D geological modeling information extraction method based on BERT-BiLSTM-CRF is proposed; based on the BERT pre-training model, a deep learning model integrating BiLSTM and CRF is constructed to obtain deep semantic information of dynamic characters through the BERT model to make up for the static word vector cannot solve the problem of multiple meanings of a word, and improve the extraction ability of complex modeling information of geological bodies. The model performance is evaluated with 43 geological reports as the data source, and the experimental results show that the proposed method has an accuracy rate of over 90% for extracting three types of attribute information of geological bodies, which is an important support for 3D geological modeling.

       

    /

    返回文章
    返回