Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ... arXiv preprint arXiv:1712.05884, 2017 | 3291 | 2017 |
TACOTRON: TOWARDS END-TO-END SPEECH SYN Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ... arXiv preprint arXiv:1703.10135, 2017 | 2494* | 2017 |
A leaf recognition algorithm for plant classification using probabilistic neural network SG Wu, FS Bao, EY Xu, YX Wang, YF Chang, QL Xiang Signal Processing and Information Technology, 2007 IEEE International …, 2007 | 1251 | 2007 |
On training targets for supervised speech separation Y Wang, A Narayanan, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 …, 2014 | 1201 | 2014 |
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition Q Kong, Y Cao, T Iqbal, Y Wang, W Wang, MD Plumbley arXiv preprint arXiv:1912.10211, 2019 | 1174 | 2019 |
Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Y Wang, D Stanton, Y Zhang, RJ Skerry-Ryan, E Battenberg, J Shor, ... arXiv preprint arXiv:1803.09017, 2018 | 981 | 2018 |
Complex ratio masking for monaural speech separation DS Williamson, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (3), 483-492, 2016 | 845 | 2016 |
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ... arXiv preprint arXiv:1803.09047, 2018 | 694 | 2018 |
Towards scaling up classification-based speech separation Y Wang, DL Wang IEEE Transactions on Audio, Speech, and Language Processing 21 (7), 1381-1390, 2013 | 546 | 2013 |
Learning spectral mapping for speech dereverberation and denoising K Han, Y Wang, DL Wang, WS Woods, I Merks, T Zhang IEEE Transactions on Audio, Speech, and Language Processing 23 (6), 982-992, 2015 | 315 | 2015 |
An algorithm to improve speech recognition in noise for hearing-impaired listeners EW Healy, SE Yoho, Y Wang, DL Wang The Journal of the Acoustical Society of America 134 (4), 3029-3038, 2013 | 281 | 2013 |
Hierarchical Generative Modeling for Controllable Speech Synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... arXiv preprint arXiv:1810.07217, 2018 | 280 | 2018 |
A feature study for classification-based speech separation at low signal-to-noise ratios J Chen, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 …, 2014 | 253 | 2014 |
A feature study for classification-based speech separation at low signal-to-noise ratios J Chen, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 …, 2014 | 253 | 2014 |
Exploring monaural features for classification-based speech segregation Y Wang, K Han, DL Wang IEEE Transactions on Audio, Speech, and Language Processing 21 (2), 270-279, 2013 | 250 | 2013 |
Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises J Chen, Y Wang, SE Yoho, DL Wang, EW Healy The Journal of the Acoustical Society of America 139 (5), 2604-2612, 2016 | 209 | 2016 |
Robust speaker identification in noisy and reverberant conditions X Zhao, Y Wang, DL Wang IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP) 22 (4 …, 2014 | 182 | 2014 |
Trainable frontend for robust and far-field keyword spotting Y Wang, P Getreuer, T Hughes, RF Lyon, RA Saurous Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 181 | 2017 |
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis D Stanton, Y Wang, RJ Skerry-Ryan arXiv preprint arXiv:1808.01410, 2018 | 147 | 2018 |
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan arXiv preprint arXiv:1808.10128, 2018 | 141 | 2018 |