面向高铁列控车载设备故障知识图谱构建方法

A fault knowledge graph construction method for train-controlled on-board equipment

  • 摘要: 为了从繁杂的高铁列控车载设备故障数据中提取出有效信息、以及提高处理高铁事故和查询故障知识的效率,提出一种列控车载设备故障知识图谱构建方法. 首先,引入深度学习模型BERT-BIGRU-CRF,将其用于实体识别;其次,采用基于规则的方法进行关系抽取;然后,对文字表达相近的实体采用余弦相似度算法进行实体对齐操作;最后,将实体和关系组成的三元组存储到Neo4j数据库. 结果表明:实体识别时,在同一数据集下用提出的模型与主流模型BERT-BILSTM-CRF对比,精确率 P、 召回率 R 、 F1 值分别提高2.58、1.7、2.14个百分点;最终构造图谱中有丰富的故障数据,包括有946个实体、3 527条关系;另外利用Neo4j的Cypher语句可有效进行故障知识查询.

     

    Abstract: In order to extract effective information from the complex fault data of high-speed railway train-controlled on-board equipment, as well as to improve the efficiency of processing high-speed railway accidents and querying fault knowledge, a fault knowledge graph construction method for train-controlled on-board equipment is proposed. Firstly, the deep learning model BERT-BIGRU-CRF is introduced, which is used for entity recognition. Secondly, a rule-based method is used for relationship extraction. Then, the cosine similarity algorithm is used for entity alignment operation for entities with similar textual expressions. Finally, the ternary composed of entities and relationships is stored in the Neo4j database. The results show that for entity recognition, comparing with the mainstream model BERT-BILSTM-CRF with the proposed model under the same dataset, the precision P, recall R and F1 value are improved by 2.58, 1.7 and 2.14 percentage points respectively; and there are rich fault data in the final constructed graph, including there are 946 entities and 3 527 relations; and in addition, the fault knowledge queries can be efficiently performed by using the Cypher statement of the Neo4j.

     

/

返回文章
返回