Ye Jia
Ye Jia
Google Brain
Dirección de correo verificada de google.com
TítuloCitado porAño
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Y Jia, Y Zhang, RJ Weiss, Q Wang, J Shen, F Ren, Z Chen, P Nguyen, ...
Advances in Neural Information Processing Systems, 2018
1142018
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis
Y Wang, D Stanton, Y Zhang, RJ Skerry-Ryan, E Battenberg, J Shor, ...
arXiv preprint arXiv:1803.09017, 2018
1112018
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
arXiv preprint arXiv:1810.04826, 2018
422018
Hierarchical generative modeling for controllable speech synthesis
WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ...
arXiv preprint arXiv:1810.07217, 2018
292018
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
242019
Leveraging weakly supervised data to improve end-to-end speech-to-text translation
Y Jia, M Johnson, W Macherey, RJ Weiss, Y Cao, CC Chiu, N Ari, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
202019
LibriTTS: A corpus derived from librispeech for text-to-speech
H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu
arXiv preprint arXiv:1904.02882, 2019
202019
Direct speech-to-speech translation with a sequence-to-sequence model
Y Jia, RJ Weiss, F Biadsy, W Macherey, M Johnson, Z Chen, Y Wu
arXiv preprint arXiv:1904.06037, 2019
152019
Parrotron: An end-to-end speech-to-speech conversion model and its applications to hearing-impaired speech and speech separation
F Biadsy, RJ Weiss, PJ Moreno, D Kanvesky, Y Jia
arXiv preprint arXiv:1904.04169, 2019
62019
Speech Recognition with Augmented Synthesized Speech
A Rosenberg, Y Zhang, B Ramabhadran, Y Jia, P Moreno, Y Wu, Z Wu
arXiv preprint arXiv:1909.11699, 2019
22019
Learning to speak fluently in a foreign language: Multilingual speech synthesis and cross-language voice cloning
Y Zhang, RJ Weiss, H Zen, Y Wu, Z Chen, RJ Skerry-Ryan, Y Jia, ...
arXiv preprint arXiv:1907.04448, 2019
22019
The ASVspoof 2019 database
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
arXiv preprint arXiv:1911.01601, 2019
12019
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–12