Department of Law, Universidade Federal do Rio Grande do Norte, Caicó, RN, Brazil.
Department of Private Law, Universidade Federal do Rio Grande do Norte, Natal, RN, Brazil.
PLoS One. 2022 Jul 28;17(7):e0272287. doi: 10.1371/journal.pone.0272287. eCollection 2022.
Legal scholars have been trying to predict the outcomes of trials for a long time. In recent years, researchers have been harnessing advancements in machine learning to predict the behavior of natural and social processes. At the same time, the Brazilian judiciary faces a challenging number of new cases every year, which generates the need to improve the throughput of the justice system. Based on those premises, we trained three deep learning architectures, ULMFiT, BERT, and Big Bird, on 612,961 Federal Small Claims Courts appeals within the Brazilian 5th Regional Federal Court to predict their outcomes. We compare the predictive performance of the models to the predictions of 22 highly skilled experts. All models outperform human experts, with the best one achieving a Matthews Correlation Coefficient of 0.3688 compared to 0.1253 from the human experts. Our results demonstrate that natural language processing and machine learning techniques provide a promising approach for predicting legal outcomes. We also release the Brazilian Courts Appeal Dataset for the 5th Regional Federal Court (BrCAD-5), containing data from 765,602 appeals to promote further developments in this area.
法律学者长期以来一直试图预测审判结果。近年来,研究人员一直在利用机器学习的进步来预测自然和社会过程的行为。与此同时,巴西司法机构每年都会面临大量新案件,这就需要提高司法系统的吞吐量。基于这些前提,我们在巴西第 5 地区联邦法院的 612961 件联邦小额索赔法院上诉案件中训练了三个深度学习架构,即 ULMFiT、BERT 和 Big Bird,以预测其结果。我们将模型的预测性能与 22 名经验丰富的专家的预测进行了比较。所有模型都优于人类专家,其中表现最好的模型的马修斯相关系数为 0.3688,而人类专家的马修斯相关系数为 0.1253。我们的研究结果表明,自然语言处理和机器学习技术为预测法律结果提供了一种有前途的方法。我们还发布了巴西第 5 地区联邦法院(BrCAD-5)的法庭上诉数据集,其中包含 765602 起上诉案件的数据,以促进该领域的进一步发展。