| 30 | 1 | 82 |
| 下载次数 | 被引频次 | 阅读次数 |
智能音视频分析系统中的声源运动目标检测与识别是通过估计多快拍的声源方位角实现的。为了在多快拍的前提下尽量发挥阵列行信号的稀疏先验作用,提出基于自适应套索的稀疏贝叶斯学习(aLASSO-SBL)方案。但由于该方案框架的参数更新需要复杂的矩阵运算,因此提出空间轮换变元(SRV)的优化方案,结合声源独立的特性轮换进行变量的更新,从而降低算法的运算复杂度。仿真实验结果表明,在信噪比为0 dB~20 dB和连续快拍数量小于5的条件下,所提算法具有明显的优势,能够更加准确地识别声源方位角。
Abstract:The detection and recognition of sound source moving target in intelligent audio and video analysis system is realized by estimating the sound source azimuth of multiple snapshots. In order to maximize the sparse prior function of array row signals under the premise of multiple snapshots, an adaptive LASSO based sparse Bayesian learning(aLASSO-SBL) scheme is proposed. However, since the parameter update of the scheme framework requires complex matrix operations, an optimization scheme of space rotation variable(SRV) is proposed, and the variables are updated in combination with the independent characteristics of the sound source, thereby reducing the computational complexity of the algorithm. The simulation results show that the proposed algorithm has obvious advantages under the conditions of signal-to-noise ratio of 0 dB~20 dB and the number of snapshots less than 5. It can identify the sound source azimuth more accurately.
[1] 王永良,陈辉,彭应宁,等.空间谱估计理论与算法[M].北京:清华大学出版社,2004:45-49.
[2] 陈心昭,毕传兴.近场声全息技术及其应用[M].北京:科学出版社,2013:23-51.
[3] 余恺.低成本无线声阵列网络的实时高效DOA估计研究[D].杭州:浙江大学,2016:1-117.
[4] 张磊.基于声音的智能移动终端室内定位关键技术研究[D].杭州:浙江大学,2019:1-126.
[5] 路敬祎,叶东,陈刚,等.双五元十字阵被动声定位融合算法及性能分析[J].仪器仪表学报,2016,37(4):827-833.
[6] 唐琳,孙叶,周鹏,等.基于卷积神经网络的多媒体视频压缩研究[J].微型电脑应用,2022,38(2):70-72.
[7] 柳艾飞,杨德森,时胜国,等.各向同性噪声场中单矢量传感器虚源消除MUSIC测向方法[J].声学学报,2019,44(4):698-706.
[8] 束长干.基于麦克风阵列的室内声学定位关键技术研究[D].北京:北京邮电大学,2016:1-121.
[9] STRAUSS M,MORDEL P,MIGUET V,et al.DREGON:Dataset and Methods for UAV-embedded Sound Source Localization[C]//2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).Octoter 1-5,2018,Madrid,Spain.IEEE,2018:1-8.
[10] DELEFORGE A,DI CARLO D D,STRAUSS M,et al.Audio-based Search and Rescue with a Drone:Highlights from the IEEE Signal Processing Cup 2019 Student Competition[SP competitions][J].IEEE Signal Processing Magazine,2019,36(5):138-144.
[11] XENAKI A,BOLDT J B,CHRISTENSEN M G.Sound Source Localization and Speech Enhancement with Sparse Bayesian Learning Beamforming[J].The Journal of the Acoustical Society of America,2018,143(6):3912-3921.
基本信息:
中图分类号:TN912.3;TP18
引用信息:
[1]邱兆新,关雪敏.智能音视频分析系统声源运动目标的检测与识别技术[J].微型电脑应用,2025,41(05):129-133.
基金信息:
广西民族师范学院科研项目(2020YB002)
2025-05-20
2025-05-20