ISSN 1009-6248CN 61-1149/P 双月刊

主管单位:中国地质调查局

主办单位:中国地质调查局西安地质调查中心
中国地质学会

    • 中文核心期刊
    • CSCD收录期刊
    • 中国科技核心期刊
    • Scopus收录期刊
高级检索

基于BERT的三维地质建模约束信息抽取方法及意义

邱芹军, 马凯, 朱恒华, 刘春华, 谢忠, 谭永健, 陶留锋

邱芹军, 马凯, 朱恒华, 等. 基于BERT的三维地质建模约束信息抽取方法及意义[J]. 西北地质, 2022, 55(4): 124-132. DOI: 10.19751/j.cnki.61-1149/p.2022.04.011
引用本文: 邱芹军, 马凯, 朱恒华, 等. 基于BERT的三维地质建模约束信息抽取方法及意义[J]. 西北地质, 2022, 55(4): 124-132. DOI: 10.19751/j.cnki.61-1149/p.2022.04.011
QIU Qinjun, MA Kai, ZHU Henghua, et al. BERT-based Method and Significance of Constraint Information Extraction for 3D Geological Modelling[J]. Northwestern Geology, 2022, 55(4): 124-132. DOI: 10.19751/j.cnki.61-1149/p.2022.04.011
Citation: QIU Qinjun, MA Kai, ZHU Henghua, et al. BERT-based Method and Significance of Constraint Information Extraction for 3D Geological Modelling[J]. Northwestern Geology, 2022, 55(4): 124-132. DOI: 10.19751/j.cnki.61-1149/p.2022.04.011

基于BERT的三维地质建模约束信息抽取方法及意义

基金项目: 

国家自然科学基金项目“地球科学知识图谱表示模式与群智协同构建”(42050101)、“基于多模态数据理解及融合的三维地质模型构建方法研究”(41871311)、济南城区四维地质环境可视化信息系统平台建设项目(2018GDCG01Z0301)、山东省重点研发计划(重大科技创新工程)项目“数字孪生城市四维可视化信息系统及其在济南城区的应用”(2019JZZY020105)、中国博士后科学基金(2021M702991)联合资助。

详细信息
    作者简介:

    邱芹军(1988-),男,博士,副研究员,主要研究领域为地质大数据挖掘与信息抽取研究。E-mail:qiuqinjun@cug.edu.cn。

    通讯作者:

    陶留锋(1984-),男,博士,副研究员,主要研究领域为地质大数据挖掘与信息抽取研究。E-mail:taoliufeng@cug.edu.cn

  • 中图分类号: P628+.3

BERT-based Method and Significance of Constraint Information Extraction for 3D Geological Modelling

  • 摘要: 地质报告中地质体的几何、拓扑及属性信息是三维地质建模过程中重要约束性信息。但传统的属性信息抽取方法存在覆盖率有限、局限于人工设计特征及模型泛化能力差等问题。面向三维建模任务,总结了地质报告中地质体的几何、拓扑及属性文本的特点,提出了一种基于BERT-BiLSTM-CRF的三维地质建模信息抽取方法;基于BERT预训练模型,构建融合BiLSTM和CRF的深度学习模型,通过BERT模型获取动态字符深层次语义信息,弥补静态词向量无法解决一词多义的问题,提高地质体复杂建模信息的抽取能力。以43篇地质报告为数据源进行模型性能评估,实验结果表明所提出的方法对于地质体三类属性信息抽取准确率达到90%以上,对于三维地质建模具有重要支撑作用。
    Abstract: The geometry, topology and attribute information of geological bodies in geological reports are important constraint information in the 3D geological modeling process. However, the traditional attribute information extraction methods have problems such as limited coverage, limited to artificial design features and poor model generalization ability. Facing the 3D modeling task, the geometry, topology and attribute text characteristics of geological bodies in geological reports are summarized, and a 3D geological modeling information extraction method based on BERT-BiLSTM-CRF is proposed; based on the BERT pre-training model, a deep learning model integrating BiLSTM and CRF is constructed to obtain deep semantic information of dynamic characters through the BERT model to make up for the static word vector cannot solve the problem of multiple meanings of a word, and improve the extraction ability of complex modeling information of geological bodies. The model performance is evaluated with 43 geological reports as the data source, and the experimental results show that the proposed method has an accuracy rate of over 90% for extracting three types of attribute information of geological bodies, which is an important support for 3D geological modeling.
  • 魏东琦,江宝得,张静雅.非结构化地质数据内容存储方法研究[J].西北地质,2021,54(04):266-273.

    WEI Dongqi,JIANG Baode,ZHANG Jingya. Research on content storage method for unstructured geological data[J]. Northwestern Geology,2021,54(04):266-273.

    王亚辉,张茂省,师云超等.基于综合物探的城市地下空间探测与建模[J].西北地质,2019,52(02):83-94.

    WANG Yahui,ZHANG Maosheng,SHI Yunchao,et al.Urban underground space exploration and modeling based on integrated physical prospecting[J]. Northwestern Geology,2019,52(02):83-94.

    李兆亮,潘懋,韩大匡,等.三维构造建模技术[J].地球科学,2016,41(12):2136-2146.

    LI Zhaoliang, PAN Mao, HAN Dakuang,et al.Three-dimensional tectonic modeling techniques[J]. Earth Science,2016,41(12):2136-2146.

    陈麒玉,刘刚,何珍文,等.面向地质大数据的结构-属性一体化三维地质建模技术现状与展望[J].地质科技通报,2020,39(04):51-58.

    CHEN Qiyu,LIU Gang,HE Zhenwen,et al. Status and prospect of structure-attribute integrated 3D geological modeling technology for geological big data[J]. Geological Science and Technology Bulletin,2020,39(04):51-58.

    何紫兰,朱鹏飞,马恒,等.基于多源数据融合的相山火山盆地三维地质建模[J].地质与勘探,2018,54(02):404-414.

    HE Zilan,ZHU Pengfei,MA Heng,et al. Three-dimensional geological modeling of the Xiang Shan volcanic basin based on multi-source data fusion[J]. Geology and Exploration,2018,54(02):404-414.

    陈麒玉,刘刚,吴冲龙,等.城市地质调查中知识驱动的多尺度三维地质体模型构建方法[J].地理与地理信息科学,2016,32(04):11-16+48+2.

    CHEN Qiyu,LIU Gang,WU Chonglong,et al.A knowledge-driven approach to construct multi-scale 3D geological body model in urban geological survey[J]. Geography and Geographic Information Science,2016,32(04):11-16+48+2.

    侯卫生,刘修国,吴信才,等.面向三维地质建模的领域本体逻辑结构与构建方法[J].地理与地理信息科学,2009,25(01):27-31.

    HOU Weisheng, LIU Xiuguo, WU Xincai, et al. A logical structure and construction method of domain ontology for 3D geological modeling[J]. Geography and Geographic Information Science,2009,25(01):27-31.

    郭甲腾,代欣位,刘善军,等.一种三维地质体模型的隐式剖切新方法[J].武汉大学学报(信息科学版),2021,46(11):1766-1773.

    GUO Jiateng, DAI Xinwei, LIU Shanjun, et al. A new method for implicit sectioning of 3D geological body models [J]. Journal of Wuhan University(Information Science Edition),2021,46(11):1766-1773.

    代欣位,郭甲腾,刘善军,等.基于动态四叉树索引的三维地质模型组合剖切算法[J].地理与地理信息科学,2020,36(04):8-13.

    DAI Xinwei. GUO Jiateng, LIU Shanjun, et al.Combined profiling algorithm for 3D geological models based on dynamic quadtree indexing[J]. Geography and Geographic Information Science,2020,36(04):8-13.

    王殷行.基于凸凹理论的三维地质体空间关系模型研究[J].地理与地理信息科学,2019,35(01):1-5.

    WANG Yinxing. Research on three-dimensional geological body spatial relationship model based on convexity-concave theory[J]. Geography and Geographic Information Science,2019,35(01):1-5.

    苏丰龙, 谢庆华, 黄清泉, 等. 基于直推式学习的半监督属性抽取 [J]. 山东大学学报: 理学版, 2016, 51(03): 111-115.

    SU Fenglong, XIE Qinghua, HUANG Qingquan, et al. Semi-supervised attribute extraction based on direct push learning [J]. Journal of Shandong University: Science Edition, 2016, 51(03): 111-115.

    余丽, 陆锋, 刘希亮. 开放式地理实体关系抽取的 Bootstrapping 方法 [J]. 测绘学报, 2016a, 45(5): 616-622.

    YU Li, LU Feng, LIU Xiliang. Bootstrapping method for open geographic entity relationship extraction [J]. Journal of Surveying and Mapping, 2016a, 45(5): 616-622.

    余丽,陆锋,刘希亮,等.稀疏地理实体关系的关键词提取方法[J].地球信息科学学报,2016b,18(11):1465-1475.

    YU Li, LU Feng, LIU Xiliang,et al.A bootstrapping algorithm for geo-entity relation extraction from online encyclopedia[J]. International Conference on Geoinformatics. IEEE, 2016b,18(11):1465-1475.

    吕鹏飞, 王春宁, 朱月琴. 基于文献的地质实体关系抽取方法研究 [J]. 中国矿业, 2017, 26(10): 167-172.

    LÜ Pengfei, WANG Chunning, ZHU Yueqin. Research on literature-based relationship extraction method for geological entities [J]. China Mining, 2017, 26(10): 167-172.

    张雪英, 张春菊, 朱少楠. 中文文本的地理空间关系标注 [J]. 测绘学报, 2012, 41(3): 468-474.

    ZHANG Xueying, ZHANG Chunju, ZHU Shaonan. Geospatial relationship annotation of Chinese texts [J]. Journal of Surveying and Mapping, 2012, 41(3): 468-474.

    宋卿, 戚成琳, 杨越. 基于 Bootstrapping 的新闻事件型实体关系抽取方法 [J]. 中国传媒大学学报: 自然科学版, 2017, 24(04): 46-50.

    SONG Qing, QI Chenglin, YANG Yue. A Bootstrapping-based method for extracting news event-based entity relationships [J]. Journal of Communication University of China: Natural Science Edition, 2017, 24(04): 46-50.

    Zhong D, Wang L, Lin B I, et al. Implicit modeling of complex orebody with constraints of geological rules[J]. Transactions of Nonferrous Metals Society of China, 2019, 29(11): 2392-2399.

    Lyu M, Ren B, Wu B, et al. A parametric 3D geological modeling method considering stratigraphic interface topology optimization and coding expert knowledge[J]. Engineering Geology, 2021: 106300.

    Wu X, Liu G, Weng Z, et al. Constructing 3D geological models based on large-scale geological maps[J]. Open Geosciences, 2021, 13(1): 851-866.

    Etzioni O,Banko M, Soderland S, et al. Open information extraction from the web [J]. Communications of the Acm, 2008, 51(12):68-74.

    Etzioni O,Cafarella M, Downey D, et al. Unsupervised named-entity extraction from the Web: An experimental study [J]. Artificial Intelligence, 2005, 165(1): 91-134.

    Li Y U, Feng L U, Liu X, et al. A Method of Context Enhanced Keyword Extraction for Sparse Geo-entity Relation [J]. Journal of Geo-Information Science, 2016, 18(11): 1465-1475.

    Hwang J, Nam K W, Ryu K H. Designing and implementing a geologic information system using a spatiotemporal ontology model for a geologic map of Korea[J]. Computers & Geosciences, 2012, 48: 173-186.

    Caers J. Modeling uncertainty in the earth sciences[M]. John Wiley & Sons, 2011.

    Elia A, Guglielmo D,Maisto A, et al. A Linguistic-Based Method for Automatically Extracting Spatial Relations from Large Non-Structured Data[M]. Algorithms and Architectures for Parallel Processing. Springer International Publishing, Springer. 2013.

    Qiu Q, Xie Z, Wu L, et al. Automatic spatiotemporal and semantic information extraction from unstructured geoscience reports using text mining techniques[J]. Earth Science Informatics, 2020, 13(4): 1393-1410.

    Qiu Q, Xie Z, Wu L, et al. BiLSTM-CRF for geological named entity recognition from the geoscience literature[J]. Earth Science Informatics, 2019a, 12(4): 565-579.

    Qiu Q, Xie Z, Wu L, et al. Geoscience keyphrase extraction algorithm using enhanced word embedding[J]. Expert Systems with Applications, 2019b, 125: 157-169.

  • 期刊类型引用(1)

    1. 匡薇,刘超,田江涛,朱彦菲,于浩. ASTER卫星数据在西昆仑团结峰地区基性-超基性岩信息提取中的应用研究. 新疆地质. 2023(04): 630-634 . 百度学术

    其他类型引用(0)

计量
  • 文章访问数:  46
  • HTML全文浏览量:  5
  • PDF下载量:  24
  • 被引次数: 1
出版历程
  • 收稿日期:  2022-01-25
  • 修回日期:  2022-06-19
  • 网络出版日期:  2023-01-09

目录

    /

    返回文章
    返回