| 112 | 4 | 82 |
| 下载次数 | 被引频次 | 阅读次数 |
为了降低楹联文化的学习门槛,激发年轻人对楹联文化的兴趣,提出了一种基于序列到序列预训练神经网络语言模型的楹联自动生成算法。该算法将楹联应对任务建模为一个序列到序列的生成问题,将楹联的上联作为输入,并自递归地(auto-regressively)生成出符合楹联标准要求的下联。与现有神经网络方法不同,该算法模型在楹联生成任务上的训练前,在大规模无监督语料上进行预训练(pre-train),在楹联监督数据上进行微调(fine-tune)。在公开数据集上的实验证明,该算法在测试集上的BLEU值与人工评估指标相对基线模型均有明显提升,证明了该算法的有效性。
Abstract:In order to reduce the obstacles of writing Chinese couplets, and stimulate young people's interest in couplet culture, this paper proposes an automatic couplet generation algorithm based on sequence-to-sequence pre-trained neural network language model. The algorithm models the task as a sequence-to-sequence generation problem, takes the first line of the couplet as input, and auto-regressively generates the second line that meets the requirements of the Chinese couplet standard. The pre-training model used by the algorithm is composed of Transformers. During training, it is pre-trained on large-scale unsupervised corpus, and fine-tuned on the supervised data of Chinese couplets. Experiments on the public dataset show that the BLEU score and human evaluation score on test dataset are improved obviously from the baseline model, which demonstrate the effectiveness of the algorithm.
[1] 张开旭,孙茂松.统计与规则相结合的古文对联应对模型[J].中文信息学报,2009,23(1):100-105.
[2] JIANG L,ZHOU M.Generating Chinese Couplets using a Statistical MT Approach[C]//Proceedings of the 22nd International Conference on Computational Linguistics-COLING’08.August 18-22,2008.Manchester,United Kingdom.Morristown,NJ,USA:Association for Computational Linguistics,2008:377-384.
[3] 朱媛媛.基于循环神经网络的对联生成模型研究[D].长春:吉林大学,2018.
[4] 张江.基于深度神经网络的对联生成系统的研究与实现[D].南京:南京邮电大学,2020.
[5] 卫万成.基于神经网络的诗歌与对联自动生成方法研究[D].桂林:桂林电子科技大学,2019.
[6] 卫万成,黄文明,王晶,等.基于多任务学习的古诗和对联自动生成[J].中文信息学报,2019,33(11):115-124.
[7] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need,in ‘Advances in Neural Information Processing Systems’[C/EB].(2017-07-30).https://www.bibsonomy.org/bibtex/23562677774cce 5dcf84a5cde13f0867d/ dallmann:5998-6008.
[8] BENGIO Y,RéJEAN D,VINCENT P,et al.A Neural Probabilistic Language Model[J].Journal of Machine Learning Research,2003.
[9] PETERS M E,NEUMANN M,LYYER M,et al.Deep contextualized word representations[C/EB].(2017-10-27).https://www.bibsonomy.org/bibtex/ 2a08c93d224dfcfb83550246c3d6a178f/stefan.ernst.
基本信息:
中图分类号:H0-05;TP391.1
引用信息:
[1]乔露.基于序列到序列预训练语言模型的楹联自动生成算法[J].微型电脑应用,2022,38(12):42-44.
基金信息:
陕西省教育厅2020年一般专项科研项目(20JK0392)
2022-12-20
2022-12-20