Qiushi Zhu

Cited by

	All	Since 2019
Citations	181	181
h-index	8	8
i10-index	6	6

120

20222023202414 115 52

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Qiushi Zhu

University of Science and Technology of China

Verified email at mail.ustc.edu.cn

speech recognition self-supervised pre-training


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A noise-robust self-supervised pre-training model based speech representation learning for automatic speech recognition QS Zhu, J Zhang, ZQ Zhang, MH Wu, X Fang, LR Dai ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	41	2022
A Joint Speech Enhancement and Self-Supervised Representation Learning Framework for Noise-Robust Speech Recognition QS Zhu, J Zhang, ZQ Zhang, LR Dai IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023	32*	2023
Robust data2vec: Noise-robust speech representation learning for asr by combining regression and improved contrastive learning QS Zhu, L Zhou, J Zhang, SJ Liu, YC Hu, LR Dai ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	25	2023
VatLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning Q Zhu, L Zhou, Z Zhang, S Liu, B Jiao, J Zhang, L Dai, D Jiang, J Li, F Wei IEEE Transactions on Multimedia, 2023	23	2023
Gradient remedy for multi-task learning in end-to-end noise-robust speech recognition Y Hu, C Chen, R Li, Q Zhu, ES Chng ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	15	2023
Supervised and self-supervised pretraining based COVID-19 detection using acoustic breathing/cough/speech signals XY Chen, QS Zhu, J Zhang, LR Dai *:Equal Contribution; ICASSP 2022-2022 IEEE International Conference on …, 2022	12	2022
Wav2code: Restore clean speech representations via codebook lookup for noise-robust asr Y Hu, C Chen, Q Zhu, ES Chng IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023	8	2023
BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions J Zhang, QT Xu, QS Zhu, ZH Ling Interspeech 2023, 2023	8	2023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition Y Hu, R Li, C Chen, H Zou, Q Zhu, ES Chng IJCAI 2023, 2023	4	2023
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition. Q Zhu, J Zhang, M Wu, X Fang, LR Dai Interspeech, 4334-4338, 2021	4	2021
Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization XY Zhao, QS Zhu, J Zhang 2022 Asia-Pacific Signal and Information Processing Association Annual …, 2022	3	2022
Noise-aware Speech Enhancement using Diffusion Probabilistic Model Y Hu, C Chen, R Li, Q Zhu, ES Chng arXiv preprint arXiv:2307.08029, 2023	2	2023
Rep2wav: Noise Robust text-to-speech Using self-supervised representations Q Zhu, Y Gu, C Weng, Y Hu, L Dai, J Zhang arXiv preprint arXiv:2308.14553, 2023	1	2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition Y Hu, R Li, C Chen, C Qin, Q Zhu, ES Chng ACL 2023, 2023	1	2023
Eeg2vec: Self-Supervised Electroencephalographic Representation Learning Q Zhu, X Zhao, J Zhang, Y Gu, C Weng, Y Hu arXiv preprint arXiv:2305.13957, 2023	1	2023
A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text Y Du, J Zhang, Q Zhu, L Dai, MH Wu, X Fang, ZW Yang Proc. Interspeech 2022, 2613-2617, 2022	1	2022
DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis Y Gu, Q Zhu, G Lei, C Weng, D Su ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
An Experimental Comparison of Noise-Robust Text-To-Speech Synthesis Systems Based On Self-Supervised Representation X Zhao, Q Zhu, Y Hu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation L Zhu, Qiushi and Zhang, Jie and Gu, Yu and Hu, Yuchen and Dai Proceedings of the AAAI Conference on Artificial Intelligence 38, 19768-19776, 2024		2024
Speech Enhancement with Multi-granularity Vector Quantization X Zhao, Q Zhu, J Zhang, Y Zhou, P Liu 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by