Haohan Guo

Cited by

	All	Since 2019
Citations	212	212
h-index	7	7
i10-index	6	6

2019202020212022202320248 22 41 46 75 16

Co-authors

Lei HePrincipal Scientist Manager, MicrosoftVerified email at microsoft.com
Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Xixin WuThe Chinese University of Hong KongVerified email at se.cuhk.edu.hk
Feng-Long xieXiaohongshuVerified email at xiaohongshu.com
Shaofei ZhangSenior Software Engineer, MicrosoftVerified email at microsoft.com
Shan YangTencent AI LabVerified email at nwpu-aslp.org
Dan SuTencent AI LabVerified email at tencent.com
Chunlei ZhangTencent AI Lab, Bellevue.Verified email at global.tencent.com
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Jiawen KangThe Chinese University of Hong KongVerified email at se.cuhk.edu.hk
Yujia XiaoThe Chinese University of Hong KongVerified email at link.cuhk.edu.hk

Haohan Guo

Chinese University of Hong Kong

Verified email at se.cuhk.edu.hk - Homepage

Speech Synthesis Voice Conversion Speech Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A new gan-based end-to-end tts training algorithm H Guo, FK Soong, L He, L Xie INTERSPEECH, 2019	58	2019
Conversational end-to-end tts for voice agents H Guo, S Zhang, FK Soong, L He, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 403-409, 2021	57	2021
Exploiting syntactic features in a parsed tree to improve end-to-end TTS H Guo, FK Soong, L He, L Xie INTERSPEECH, 2019	37	2019
Improving adversarial waveform generation based singing voice conversion with harmonic signals H Guo, Z Zhou, F Meng, K Liu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	14	2022
Feature reinforcement with word embedding and parsing information in neural TTS H Ming, L He, H Guo, FK Soong arXiv preprint arXiv:1901.00707, 2019	14	2019
Phonetic posteriorgrams based many-to-many singing voice conversion via adversarial training H Guo, H Lu, N Hu, C Zhang, S Yang, L Xie, D Su, D Yu arXiv preprint arXiv:2012.01837, 2020	10	2020
A multi-stage multi-codebook VQ-VAE approach to high-performance neural TTS H Guo, F Xie, FK Soong, X Wu, H Meng arXiv preprint arXiv:2209.10887, 2022	7	2022
MSMC-TTS: Multi-stage multi-codebook VQ-VAE based neural TTS H Guo, F Xie, X Wu, FK Soong, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1811-1824, 2023	6	2023
A multi-scale time-frequency spectrogram discriminator for GAN-based non-autoregressive TTS H Guo, H Lu, X Wu, H Meng arXiv preprint arXiv:2203.01080, 2022	5	2022
BASE TTS: Lessons from building a billion-parameter text-to-speech model on 100K hours of data M Łajszczak, G Cámbara, Y Li, F Beyhan, A van Korlaar, F Yang, A Joly, ... arXiv preprint arXiv:2402.08093, 2024	2	2024
QS-TTS: towards semi-supervised text-to-speech synthesis via vector-quantized self-supervised speech representation learning H Guo, F Xie, J Kang, Y Xiao, X Wu, H Meng arXiv preprint arXiv:2309.00126, 2023	1	2023
Towards High-Quality Neural TTS for Low-Resource Languages by Learning Compact Speech Representations H Guo, F Xie, X Wu, H Lu, H Meng arXiv preprint arXiv:2210.15131, 2022	1	2022
Unifying One-Shot Voice Conversion and Cloning with Disentangled Speech Representations H Lu, X Wu, H Guo, S Liu, Z Wu, H Meng ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition J Kang, L Meng, M Cui, H Guo, X Wu, X Liu, H Meng arXiv preprint arXiv:2401.04152, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–14

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors