Shaofei Zhang

Cited by

	All	Since 2019
Citations	202	165
h-index	7	6
i10-index	5	5

20152016201720182019202020212022202320242 13 13 9 20 19 24 28 59 15

Public access

View all

3 articles

1 article

available

not available

Based on funding mandates

Co-authors

Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Lei HePrincipal Scientist Manager, MicrosoftVerified email at microsoft.com
Haohan GuoChinese University of Hong KongVerified email at se.cuhk.edu.hk
Yihan WuRenmin University of ChinaVerified email at ruc.edu.cn
Yougen YuanTencent, BeijingVerified email at nwpu-aslp.org
Pengcheng ZhuFuxi AI Lab, NetEase Inc.Verified email at corp.netease.com

Shaofei Zhang

Senior Software Engineer, Microsoft

Verified email at microsoft.com

Speech Synthesis Natural Language Processing Pronunciation/Prosody Assessment


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Conversational end-to-end tts for voice agents H Guo, S Zhang, FK Soong, L He, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 403-409, 2021	57	2021
Exemplar-based sparse representation of timbre and prosody for voice conversion H Ming, D Huang, L Xie, S Zhang, M Dong, H Li 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	47	2016
Fundamental frequency modeling using wavelets for emotional voice conversion H Ming, D Huang, M Dong, H Li, L Xie, S Zhang 2015 International Conference on Affective Computing and Intelligent …, 2015	42	2015
Paratts: Learning linguistic and prosodic cross-sentence information in paragraph-based tts L Xue, FK Soong, S Zhang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 2854-2864, 2022	14	2022
Self-supervised context-aware style representation for expressive speech synthesis Y Wu, X Wang, S Zhang, L He, R Song, JY Nie arXiv preprint arXiv:2206.12559, 2022	13	2022
An automatic voice conversion evaluation strategy based on perceptual background noise distortion and speaker similarity DY Huang, L Xie, S Zhang, YSW Lee, J Wu, H Ming, X Tian, C Ding, M Li, ...	9	2016
Non-negative matrix factorization using stable alternating direction method of multipliers for source separation S Zhang, D Huang, L Xie, ES Chng, H Li, M Dong 2015 Asia-Pacific Signal and Information Processing Association Annual …, 2015	9	2015
A hybrid virtual bass system with improved phase vocoder and high efficiency S Zhang, L Xie, ZH Fu, Y Yuan The 9th International Symposium on Chinese Spoken Language Processing, 401-405, 2014	5	2014
Stylespeech: Self-supervised style enhancing with vq-vae-based pre-training for expressive audiobook speech synthesis X Chen, X Wang, S Zhang, L He, Z Wu, X Wu, H Meng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	2	2024
MuLanTTS The Microsoft Speech Synthesis System for Blizzard Challenge 2023 Z Xu, S Zhang, X Wang, J Zhang, W Wei, L He, S Zhao arXiv preprint arXiv:2309.02743, 2023	1	2023
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading Y Xiao, S Zhang, X Wang, X Tan, L He, S Zhao, FK Soong, T Lee arXiv preprint arXiv:2307.00782, 2023	1	2023
Paragraph synthesis with cross utterance features for neural TTS S Zhang, L He US Patent App. 17/631,695, 2022	1	2022
Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation S Zhang, DY Huang, L Xie, ES Chng, H Li, M Dong Sixteenth Annual Conference of the International Speech Communication …, 2015	1	2015
Large-Scale Automatic Audiobook Creation B Walsh, M Hamilton, G Newby, X Wang, S Ruan, S Zhao, L He, S Zhang, ... arXiv preprint arXiv:2309.03926, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–14

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors