| 34 | 0 | 38 |
| 下载次数 | 被引频次 | 阅读次数 |
为了快速统计分析电力调度数据,明确电力调度状态,针对电力调度人员声纹特征提取精度和速率较低的问题,提出一种基于卷积神经网络的声纹特征AI回溯提取方法。运用模拟数字转换器获得调度人员语音数据信号序列,计算信号不同帧的短时能量与短时过零率均值,检测声纹信号端点,在Mel频率下依照人耳听觉特性,基于线性预测系数,采取非线性转换,通过微分、高阶微分等步骤获得声纹特征参数;运用AI技术中的卷积神经网络训练调度人员语音数据,引入连续片段说话人识别模型确定目标说话人身份,创建调度人员声纹判断函数,实现声纹特征AI回溯提取目标。仿真结果表明,所提方法声纹特征提取精度高、速率快,为电力调度系统的科学化管理提供重要决策思路。
Abstract:In order to quickly analyze the power dispatching data and clarify the power dispatching state, aiming at the low accuracy and speed of voiceprint feature extraction of power dispatching personnel, an AI backtracking extraction method of voiceprint feature based on convolution neural network is proposed. The analog-to-digital converter is used to obtain the dispatcher's voice data signal sequence, calculate the short-time energy and short-time zero crossing rate mean value of different frames of the signal, detect the voiceprint signal endpoint. At Mel frequency, according to the human ear auditory characteristics, based on the linear prediction coefficient, nonlinear conversion is adopted to obtain the voiceprint characteristic parameters through differentiation, high-order differentiation and other steps. The convolution neural network in AI technology is used to train the dispatcher's voice data, the continuous segment speaker recognition model is introduced to determine the target speaker's identity, and the dispatcher's voiceprint judgment function is created to realize the AI backtracking extraction of voiceprint features. The simulation results show that the proposed method has high accuracy and fast speed of voiceprint feature extraction, and provides an important decision-making idea for the scientific management of power dispatching system.
[1] 陈钦柱,姚冬,陈林聪.构建电网调度控制系统架构的关键技术研究分析[J].微型电脑应用,2020,36(10):168-170.
[2] 何欣,尚天婷,黄一秀,等.电网调度员培训仿真系统设计与实现[J].微型电脑应用,2021,37(9):73-76.
[3] 丁勇,李佳慧,唐士杰,等.基于随机映射技术的声纹识别模板保护[J].计算机研究与发展,2020,57(10):2201-2208.
[4] 陈莹,陈湟康.基于多模态生成对抗网络和三元组损失的说话人识别[J].电子与信息学报,2020,42(2):379-385.
[5] 许春冬,凌贤鹏,应冬文,等.基于时频感知神经网络的语音频带扩展[J].信号处理,2021,37(10):2004-2012.
[6] 雷娅,方勇,张立明.基于Takenaka-Malmquist系的语音信号压缩与降噪方法[J].上海大学学报(自然科学版),2020,26(1):33-46.
[7] 胡德生,张雪英,张静,等.基于主辅网络特征融合的语音情感识别[J].太原理工大学学报,2021,52(5):769-774.
[8] 刘航,李扬,袁浩期,等.基于生成对抗网络的语音信号分离[J].计算机工程,2020,46(1):302-308.
[9] 黄洋,赵风海,卢景.语音信号处理中双门限端点检测算法的改进[J].南开大学学报(自然科学版),2021,54(2):58-62.
[10] 艾佳琪,左毅,刘君霞,等.基于余弦相似度的动态语音特征提取算法[J].计算机应用研究,2020,37(S2):147-149.
[11] 孙汉文,李喆,林睿,等.基于新奇检测的两级电气故障声纹识别算法[J].电网技术,2021,45(7):2888-2895.
[12] 肖金壮,李瑞鹏,纪盟盟.应用AAM损失函数的无文本说话人识别[J].激光杂志,2021,42(11):87-91.
[13] 徐晓梦,谭振华,李欣书.基于小波包全频分解的耐噪声纹识别算法[J].深圳大学学报(理工版),2020,37(S1):84-91.
[14] 李丽亚,闫宏印.融合递归求逆滤波的机器人混合语音识别方法[J].计算机仿真,2020,37(8):277-280.
[15] 刘云鹏,王博闻,岳浩天,等.基于50 Hz倍频倒谱系数与门控循环单元的变压器偏磁声纹识别[J].中国电机工程学报,2020,40(14):4681-4694.
基本信息:
中图分类号:TM73;TN912.3;TP18
引用信息:
[1]郭萌,徐胜,陈锦龙,等.电力调度系统人员声纹特征AI回溯提取方法[J].微型电脑应用,2025,41(01):134-137.
2025-01-20
2025-01-20