基于BERT的诉讼案件违法事实要素自动抽取
DOI:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391.1

基金项目:


Automatic illegal fact extraction of lawsuit case based on BERT
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对诉讼案件违法事实要素抽取效果依赖领域专业知识的特点,提出一种基于BERT的诉讼案件违法事实要素自动抽取方法。首先通过构建领域知识并采用谷歌BERT预训练语言模型进行训练得到拟合诉讼案件领域数据的模型参数和中文预训练字嵌入向量作为模型的输入,得到具有上下文相关的语义表示,以提高词嵌入的上下文语义质量,然后采用循环卷积神经网络对文本进行编码并获取在文本分类任务中扮演关键角色的信息,提升案件违法事实要素抽取的效果,最后采用focal 函数作为损失函数关注难以区分的样本。违法事实要素抽取的工作是通过对文本标签进行分类得到的。实验测试表明,该方法对诉讼案件要素抽取的F1值为86.41%,相比其他方法性能均有提高。对模型注入领域内知识,也可以提高模型抽取准确率。

    Abstract:

    Aiming at the fact that the extraction of illegal fact elements in lawsuit cases depends on special professional knowledge, an automatic illegal fact elements extraction of lawsuit cases based on BERT is proposed. First, by constructing domain knowledge and using Google BERT pre-training language model for training, model parameters fitting the domain data of lawsuit cases and embedding vector of Chinese pre-training words are obtained as the input of the model, the contextual representation is obtained to improve the quality of the context semantic of word embedding, and then the text is encoded by the cyclic convolutional neural network and the information that plays a key role in the text classification task is obtained. Finally, focal function is adopted as the loss function to focus on the indistinguishable samples. The work of extracting elements of illegal facts is obtained by classifying text labels. Experimental tests show that the F1 value of the method is 86.41%, which is better than other methods. The accuracy of model extraction can also be improved by injecting domain knowledge into the model.

    参考文献
    相似文献
    引证文献
引用本文

崔斌,邹蕾,徐明月. 基于BERT的诉讼案件违法事实要素自动抽取[J]. 科学技术与工程, 2021, 21(9): 3669-3675.
Cui Bin, Zou Lei, Xu Mingyue. Automatic illegal fact extraction of lawsuit case based on BERT[J]. Science Technology and Engineering,2021,21(9):3669-3675.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-07-01
  • 最后修改日期:2021-01-12
  • 录用日期:2020-09-21
  • 在线发布日期: 2021-04-19
  • 出版日期:
×
律回春渐,新元肇启|《科学技术与工程》编辑部恭祝新岁!
亟待确认版面费归属稿件,敬请作者关注