收藏切换
Spoof Speech Detection with Channel-temporal Attention and Depthwise Separable Convolutions
收藏切换
PDF
Jia-qi FENG, Hua-peng WANG*, Tian-ci LIU
Science Technology and Engineering | 2025, 25(22) : 9427 - 9435
Less
收藏切换
Science Technology and Engineering | 2025, 25(22): 9427-9435
Papers·Automation and Computational Technology
Spoof Speech Detection with Channel-temporal Attention and Depthwise Separable Convolutions
Full
Jia-qi FENG, Hua-peng WANG*, Tian-ci LIU
Affiliations
  • College of Public Security Information Technology and Intelligence, Criminal Investigation Police University of China, Shenyang 110854, China
Published: 2025-08-08 doi: 10.12404/j.issn.1671-1815.2409674
Outline
收藏切换

The growing sophistication of deepfake speech poses significant security threats to ASV(automatic speaker verification) systems. Current anti-spoofing models based on CNNs(convolutional neural networks) are constrained by inadequate global feature extraction and limited generalization capability against unseen spoofing attacks. To address these challenges, a novel network architecture integrating CT-DSCNet(channel-temporal attention mechanisms with depthwise separable convolutions) was proposed. Building upon the RawNet2 framework, the developed model incorporates dual-domain attention modules to enhance discriminative feature representation while suppressing irrelevant acoustic artifacts. Furthermore, depthwise separable convolutional residual blocks were strategically implemented to optimize computational efficiency and real-time processing capabilities. Comprehensive evaluations were conducted across three benchmark datasets: ASVspoof2019 LA, ASVspoof2021 DF, and FMFCC-A. Experimental results demonstrate state-of-the-art performance with EER(equal error rate) of 1.53% on ASVspoof2019 LA, representing a 70.58% relative improvement over baseline systems. Notably, the proposed architecture exhibits superior cross-dataset generalization, achieving a 25.35% lower EER on the FMFCC-A evaluation set compared with conventional approaches. These findings validate the effectiveness of the hybrid attention-convolution design in advancing spoofing detection robustness and domain adaptability.

deepfake speech  /  attention mechanism  /  depthwise separable convolution  /  speech anti-spoofing
Jia-qi FENG, Hua-peng WANG, Tian-ci LIU. Spoof Speech Detection with Channel-temporal Attention and Depthwise Separable Convolutions[J]. Science Technology and Engineering, 2025 , 25 (22) : 9427 -9435 . DOI: 10.12404/j.issn.1671-1815.2409674
Year 2025 volume 25 Issue 22
PDF
145
61
Cite this Article
BibTeX
Article Info
doi: 10.12404/j.issn.1671-1815.2409674
  • Receive Date:2024-12-29
  • Online Date:2026-02-11
  • Published:2025-08-08
Article Data
Affiliations
History
  • Received:2024-12-29
  • Revised:2025-05-19
Funding
Affiliations
    College of Public Security Information Technology and Intelligence, Criminal Investigation Police University of China, Shenyang 110854, China
References
Share
https://castjournals.cast.org.cn/joweb/kxjsygc/EN/10.12404/j.issn.1671-1815.2409674
Share to
QR

Scan QR to access full text

Cite this article
BibTeX
Citations
表12种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏
  • BibTeX
  • EndNote
  • RefWorks
  • TxT