Follow
Wei Zou
Wei Zou
PKU、Samsung、Baidu、Didi、Ke
No verified email
Title
Cited by
Cited by
Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
1612021
Improving transformer-based speech recognition using unsupervised pre-training
D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li
arXiv preprint arXiv:1910.09932, 2019
992019
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning
D Jiang, W Li, M Cao, W Zou, X Li
arXiv preprint arXiv:2010.13991, 2020
702020
Towards end-to-end code-switching speech recognition
N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li
arXiv preprint arXiv:1810.13091, 2018
592018
A further study of unsupervised pretraining for transformer based speech recognition
D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
412021
Comparable study of modeling units for end-to-end mandarin speech recognition
W Zou, D Jiang, S Zhao, G Yang, X Li
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
342018
Transformer based unsupervised pre-training for acoustic representation learning
R Zhang, H Wu, W Li, D Jiang, W Zou, X Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
332021
Didispeech: A large scale mandarin speech corpus
T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
292021
Audio deepfake detection system with neural stitching for add 2022
R Yan, C Wen, S Zhou, T Guo, W Zou, X Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
152022
Delta: A deep learning based language technology platform
K Han, J Chen, H Zhang, H Xu, Y Peng, Y Wang, N Ding, H Deng, Y Gao, ...
arXiv preprint arXiv:1908.01853, 2019
112019
Semantic data augmentation for end-to-end mandarin speech recognition
J Sun, Z Tang, H Yin, W Wang, X Zhao, S Zhao, X Lei, W Zou, X Li
arXiv preprint arXiv:2104.12521, 2021
92021
Chathome: Development and evaluation of a domain-specific language model for home renovation
C Wen, X Sun, S Zhao, X Fang, L Chen, W Zou
arXiv preprint arXiv:2307.15290, 2023
62023
Audio-visual wake word spotting system for misp challenge 2021
Y Xu, J Sun, Y Han, S Zhao, C Mei, T Guo, S Zhou, C Xie, W Zou, X Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
62022
Kespeech: An open source speech dataset of mandarin and its eight subdialects
Z Tang, D Wang, Y Xu, J Sun, X Lei, S Zhao, C Wen, X Tan, C Xie, S Zhou, ...
Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021
62021
TMT: A transformer-based modal translator for improving multimodal sequence representations in audio visual scene-aware dialog
W Li, D Jiang, W Zou, X Li
arXiv preprint arXiv:2010.10839, 2020
52020
From llm to conversational agent: A memory enhanced architecture with fine-tuning of large language models
N Liu, L Chen, X Tian, W Zou, K Chen, M Cui
arXiv preprint arXiv:2401.02777, 2024
42024
Time domain adversarial voice conversion for ADD 2022
C Wen, T Guo, X Tan, R Yan, S Zhou, C Xie, W Zou, X Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
42022
Cross-task pre-training for on-device acoustic scene classification
R Zhang, W Zou, X Li
arXiv preprint arXiv:1910.09935, 2019
32019
An Analysis of Decoding for Attention-Based End-to-End Mandarin Speech Recognition
D Jiang, W Zou, S Zhao, G Yang, X Li
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
22018
DUMA: a Dual-Mind Conversational Agent with Fast and Slow Thinking
X Tian, L Chen, N Liu, Y Liu, W Zou, K Chen, M Cui
arXiv preprint arXiv:2310.18075, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20