Spoof Speech Detection with Channel-temporal Attention and Depthwise Separable Convolutions

Spoof Speech Detection with Channel-temporal Attention and Depthwise Separable Convolutions

PDF

Jia-qi FENG, Hua-peng WANG^*, Tian-ci LIU

Science Technology and Engineering | 2025, 25(22) : 9427 - 9435

Less

Science Technology and Engineering | 2025, 25(22): 9427-9435

• Papers·Automation and Computational Technology •

Spoof Speech Detection with Channel-temporal Attention and Depthwise Separable Convolutions

Full

Jia-qi FENG, Hua-peng WANG^*, Tian-ci LIU

Affiliations

College of Public Security Information Technology and Intelligence, Criminal Investigation Police University of China, Shenyang 110854, China

Published: 2025-08-08 doi: 10.12404/j.issn.1671-1815.2409674

Outline

Abstract

Less

The growing sophistication of deepfake speech poses significant security threats to ASV(automatic speaker verification) systems. Current anti-spoofing models based on CNNs(convolutional neural networks) are constrained by inadequate global feature extraction and limited generalization capability against unseen spoofing attacks. To address these challenges, a novel network architecture integrating CT-DSCNet(channel-temporal attention mechanisms with depthwise separable convolutions) was proposed. Building upon the RawNet2 framework, the developed model incorporates dual-domain attention modules to enhance discriminative feature representation while suppressing irrelevant acoustic artifacts. Furthermore, depthwise separable convolutional residual blocks were strategically implemented to optimize computational efficiency and real-time processing capabilities. Comprehensive evaluations were conducted across three benchmark datasets: ASVspoof2019 LA, ASVspoof2021 DF, and FMFCC-A. Experimental results demonstrate state-of-the-art performance with EER(equal error rate) of 1.53% on ASVspoof2019 LA, representing a 70.58% relative improvement over baseline systems. Notably, the proposed architecture exhibits superior cross-dataset generalization, achieving a 25.35% lower EER on the FMFCC-A evaluation set compared with conventional approaches. These findings validate the effectiveness of the hybrid attention-convolution design in advancing spoofing detection robustness and domain adaptability.

Key words

deepfake speech / attention mechanism / depthwise separable convolution / speech anti-spoofing

Cite this Article

Jia-qi FENG, Hua-peng WANG, Tian-ci LIU. Spoof Speech Detection with Channel-temporal Attention and Depthwise Separable Convolutions[J]. Science Technology and Engineering, 2025 , 25 (22) : 9427 -9435 . DOI: 10.12404/j.issn.1671-1815.2409674

Appendix

Less

Year 2025 volume 25 Issue 22

PDF

145

Cite this Article

BibTeX

Article Info

doi: 10.12404/j.issn.1671-1815.2409674

Receive Date：2024-12-29
Online Date：2026-02-11
Published：2025-08-08

Article Data

Affiliations

History

Received：2024-12-29
Revised：2025-05-19

Funding

Affiliations

College of Public Security Information Technology and Intelligence, Criminal Investigation Police University of China, Shenyang 110854, China

References

Share

https://castjournals.cast.org.cn/joweb/kxjsygc/EN/10.12404/j.issn.1671-1815.2409674

Share to

Scan QR to access full text

Cite this article

BibTeX

Citations

表12种不同金属材料的力学参数

科 Family	属数 Number of genus	种数 Number of species	占总种数比例 Percentage of total species (%)	属 Genus	种数 Number of species	占总种数比例 Percentage of total species (%)
鹅膏菌科Amanitaceae	2	11	5.26	鹅膏菌属 Amanita	10	4.78
小菇科 Mycenaceae	2	12	5.74	丝盖伞属 Inocybe	5	2.39
多孔菌科 Polyporaceae	8	14	6.70	蜡蘑属 Laccaria	5	2.39
红菇科 Russulaceae	3	23	11.00	小皮伞属 Marasmius	6	2.87
				小菇属 Mycena	11	5.26
				光柄菇属 Pluteus	5	2.39
				红菇属 Russula	17	8.13
				栓菌属 Trametes	5	2.39

关闭全屏

BibTeX
EndNote
RefWorks
TxT

Articles: Latest Articles; Most Read; Collections

Updates: Events; News; Multimedia

About: About Us

Contact

No. 86 Xueyuan South Road, Haidian District, Beijing

100081

010-62199257

qkjq@cast.org.cn

Copyright © 2025 China Association for Science and Technology. All rights reserved. For all open access content, the relevant licensing terms apply.
Sponsored by the Office of the Leading Group for Cybersecurity and Informatization of CAST, and supported by Science and Technology Review Publishing House