Segueix
Daniel Povey
Daniel Povey
Chief Speech Scientist, Xiaomi Corp.
Correu electrònic verificat a xiaomi.com - Pàgina d'inici
Títol
Citada per
Citada per
Any
The Kaldi speech recognition toolkit
D Povey, A Ghoshal, G Boulianne, L Burget, O Glembek, N Goel, ...
IEEE 2011 workshop on automatic speech recognition and understanding, 2011
61552011
The HTK book
S Young, G Evermann, M Gales, T Hain, D Kershaw, X Liu, G Moore, ...
Cambridge university engineering department 3 (175), 12, 2002
44552002
Librispeech: an asr corpus based on public domain audio books
V Panayotov, G Chen, D Povey, S Khudanpur
2015 IEEE international conference on acoustics, speech and signal …, 2015
33442015
X-vectors: Robust dnn embeddings for speaker recognition
D Snyder, D Garcia-Romero, G Sell, D Povey, S Khudanpur
2018 IEEE international conference on acoustics, speech and signal …, 2018
18922018
A time delay neural network architecture for efficient modeling of long temporal contexts
V Peddinti, D Povey, S Khudanpur
Sixteenth annual conference of the international speech communication …, 2015
10202015
Audio augmentation for speech recognition
T Ko, V Peddinti, D Povey, S Khudanpur
Sixteenth annual conference of the international speech communication …, 2015
9132015
Minimum phone error and I-smoothing for improved discriminative training
D Povey, PC Woodland
2002 IEEE international conference on acoustics, speech, and signal …, 2002
9032002
Sequence-discriminative training of deep neural networks.
K Veselý, A Ghoshal, L Burget, D Povey
Interspeech 2013, 2345-2349, 2013
8632013
Purely sequence-trained neural networks for ASR based on lattice-free MMI.
D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar, X Na, Y Wang, ...
Interspeech, 2751-2755, 2016
8442016
Deep neural network embeddings for text-independent speaker verification.
D Snyder, D Garcia-Romero, D Povey, S Khudanpur
Interspeech 2017, 999-1003, 2017
7642017
Musan: A music, speech, and noise corpus
D Snyder, G Chen, D Povey
arXiv preprint arXiv:1510.08484, 2015
7532015
Discriminative training for large vocabulary speech recognition
D Povey
University of Cambridge, 2005
6802005
Strategies for training large scale neural network language models
T Mikolov, A Deoras, D Povey, L Burget, J Černocký
2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 196-201, 2011
6052011
A study on data augmentation of reverberant speech for robust speech recognition
T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
5822017
Large scale discriminative training of hidden Markov models for speech recognition
PC Woodland, D Povey
Computer Speech & Language 16 (1), 25-47, 2002
5502002
Boosted MMI for model and feature-space discriminative training
D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, ...
2008 IEEE International Conference on Acoustics, Speech and Signal …, 2008
4812008
Semi-orthogonal low-rank matrix factorization for deep neural networks.
D Povey, G Cheng, Y Wang, K Li, H Xu, M Yarmohammadi, S Khudanpur
Interspeech, 3743-3747, 2018
4192018
Improving deep neural network acoustic models using generalized maxout networks
X Zhang, J Trmal, D Povey, S Khudanpur
2014 IEEE international conference on acoustics, speech and signal …, 2014
3762014
fMPE: Discriminatively trained features for speech recognition
D Povey, B Kingsbury, L Mangu, G Saon, H Soltau, G Zweig
Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005
3712005
Deep neural network-based speaker embeddings for end-to-end speaker verification
D Snyder, P Ghahremani, D Povey, D Garcia-Romero, Y Carmiel, ...
2016 IEEE Spoken Language Technology Workshop (SLT), 165-170, 2016
3692016
En aquests moments el sistema no pot dur a terme l'operació. Torneu-ho a provar més tard.
Articles 1–20