Wei Zou

Cited by

	All	Since 2019
Citations	599	597
h-index	10	10
i10-index	10	10

220

110

165

20182019202020212022202320242 13 41 100 164 202 74

Public access

View all

0 articles

1 article

available

not available

Based on funding mandates

Co-authors

Shuaijiang ZhaoKE DIDI BAIDUVerified email at ke.com
Cheng Wen（文成）Beike, DiDi AI Lab, BITVerified email at ke.com
Kun HanFacebookVerified email at cse.ohio-state.edu
Shuran ZhouUniversity of WashingtonVerified email at uw.edu
Jan "Yenda" TrmalAssociate Research Scientist at Johns Hopkins UniversityVerified email at jhu.edu
Jiayu DUAlibaba DAMO AcademyVerified email at alibaba-inc.com
Guanbo WangJohns Hopkins UniversityVerified email at jhu.edu
Dan SuTencent AI LabVerified email at tencent.com
Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com
Wei-Qiang Zhang (张卫强)Tsinghua University (清华大学)Verified email at tsinghua.edu.cn
zhao youtencent ai-labVerified email at tencent.com
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Guoguo ChenSeasalt.ai, Vobil.com, Baidu, KITT.AIVerified email at seasalt.ai
Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Zhiyuan TangTencent, Tsinghua University, University of Chinese Academy of SciencesVerified email at tsinghua.edu.cn
Haiyang XuAlibaba Group, DIDI AI LABS, SEUVerified email at seu.edu.cn
Ying LyuDiDi Research America, University of Southern CaliforniaVerified email at airbnb.com
Longbiao WangProfessor, Tianjin UniversityVerified email at tju.edu.cn
Meng GeTianjin University; CUHK-Shenzhen; National University of SingaporeVerified email at nus.edu.sg
Yingkui WangTianjin universityVerified email at tju.edu.cn

Wei Zou

PKU、Samsung、Baidu、Didi、Ke

No verified email

Speech NLP LLM


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	161	2021
Improving transformer-based speech recognition using unsupervised pre-training D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li arXiv preprint arXiv:1910.09932, 2019	99	2019
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning D Jiang, W Li, M Cao, W Zou, X Li arXiv preprint arXiv:2010.13991, 2020	70	2020
Towards end-to-end code-switching speech recognition N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li arXiv preprint arXiv:1810.13091, 2018	59	2018
A further study of unsupervised pretraining for transformer based speech recognition D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	41	2021
Comparable study of modeling units for end-to-end mandarin speech recognition W Zou, D Jiang, S Zhao, G Yang, X Li 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018	34	2018
Transformer based unsupervised pre-training for acoustic representation learning R Zhang, H Wu, W Li, D Jiang, W Zou, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	33	2021
Didispeech: A large scale mandarin speech corpus T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	29	2021
Audio deepfake detection system with neural stitching for add 2022 R Yan, C Wen, S Zhou, T Guo, W Zou, X Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	15	2022
Delta: A deep learning based language technology platform K Han, J Chen, H Zhang, H Xu, Y Peng, Y Wang, N Ding, H Deng, Y Gao, ... arXiv preprint arXiv:1908.01853, 2019	11	2019
Semantic data augmentation for end-to-end mandarin speech recognition J Sun, Z Tang, H Yin, W Wang, X Zhao, S Zhao, X Lei, W Zou, X Li arXiv preprint arXiv:2104.12521, 2021	9	2021
Chathome: Development and evaluation of a domain-specific language model for home renovation C Wen, X Sun, S Zhao, X Fang, L Chen, W Zou arXiv preprint arXiv:2307.15290, 2023	6	2023
Audio-visual wake word spotting system for misp challenge 2021 Y Xu, J Sun, Y Han, S Zhao, C Mei, T Guo, S Zhou, C Xie, W Zou, X Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	6	2022
Kespeech: An open source speech dataset of mandarin and its eight subdialects Z Tang, D Wang, Y Xu, J Sun, X Lei, S Zhao, C Wen, X Tan, C Xie, S Zhou, ... Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021	6	2021
TMT: A transformer-based modal translator for improving multimodal sequence representations in audio visual scene-aware dialog W Li, D Jiang, W Zou, X Li arXiv preprint arXiv:2010.10839, 2020	5	2020
From llm to conversational agent: A memory enhanced architecture with fine-tuning of large language models N Liu, L Chen, X Tian, W Zou, K Chen, M Cui arXiv preprint arXiv:2401.02777, 2024	4	2024
Time domain adversarial voice conversion for ADD 2022 C Wen, T Guo, X Tan, R Yan, S Zhou, C Xie, W Zou, X Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	4	2022
Cross-task pre-training for on-device acoustic scene classification R Zhang, W Zou, X Li arXiv preprint arXiv:1910.09935, 2019	3	2019
An Analysis of Decoding for Attention-Based End-to-End Mandarin Speech Recognition D Jiang, W Zou, S Zhao, G Yang, X Li 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018	2	2018
DUMA: a Dual-Mind Conversational Agent with Fast and Slow Thinking X Tian, L Chen, N Liu, Y Liu, W Zou, K Chen, M Cui arXiv preprint arXiv:2310.18075, 2023	1	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors