Samuel Thomas
Title
Cited by
Cited by
Year
The subspace Gaussian mixture model—A structured model for speech recognition
D Povey, L Burget, M Agarwal, P Akyazi, F Kai, A Ghoshal, O Glembek, ...
Computer Speech & Language 25 (2), 404-439, 2011
3132011
English Conversational Telephone Speech Recognition by Humans and Machines
G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ...
arXiv preprint arXiv:1703.02136, 2017
2532017
Subspace Gaussian mixture models for speech recognition
D Povey, L Burget, M Agarwal, P Akyazi, K Feng, A Ghoshal, NK Goel, ...
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
2072010
Multilingual acoustic modeling for speech recognition based on subspace Gaussian mixture models
L Burget, P Schwarz, M Agarwal, P Akyazi, K Feng, A Ghoshal, N Goel, ...
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
1772010
Deep neural network features and semi-supervised training for low resource speech recognition
S Thomas, ML Seltzer, K Church, H Hermansky
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
1082013
Recognition of reverberant speech using frequency domain linear prediction
S Thomas, S Ganapathy, H Hermansky
IEEE Signal Processing Letters 15, 681-684, 2008
982008
Multilingual MLP features for low-resource LVCSR systems
S Thomas, S Ganapathy, H Hermansky
962012
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition.
A Jansen, E Dupoux, S Goldwater, M Johnson, S Khudanpur, K Church, ...
ICASSP, 8111-8115, 2013
932013
Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions
S Thomas, S Ganapathy, G Saon, H Soltau
ICASSP, 2014
772014
Rapid evaluation of speech representations for spoken term discovery
MA Carlin, S Thomas, A Jansen, H Hermansky
Twelfth Annual Conference of the International Speech Communication Association, 2011
772011
Cross-lingual and multi-stream posterior features for low resource LVCSR systems
S Thomas, S Ganapathy, H Hermansky
Eleventh Annual Conference of the International Speech Communication Association, 2010
692010
Speech recognition with segmental conditional random fields: a summary of the JHU CLSP 2010 summer workshop
G Zweig, P Nguyen, D Van Compernolle, K Demuynck, L Atlas, P Clark, ...
Proc. ICASSP, 2011
58*2011
Weak top-down constraints for unsupervised acoustic model training
A Jansen, S Thomas, H Hermansky
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
562013
The IBM Speech Activity Detection System for the DARPA RATS Program
G Saon, S Thomas, H Soltau, S Ganapathy, B Kingsbury
532013
Annealed dropout training of deep networks
SJ Rennie, V Goel, S Thomas
2014 IEEE Spoken Language Technology Workshop (SLT), 159-164, 2014
512014
Invariant Representations for Noisy Speech Recognition
D Serdyuk, K Audhkhasi, P Brakel, B Ramabhadran, S Thomas, Y Bengio
arXiv preprint arXiv:1612.01928, 2016
492016
Text-to-Speech Synthesis using syllable-like units
MN Rao, S Thomas, T Nagarajan, HA Murthy
Proceedings of National Conference on Communications, IIT, India, 277-280, 2005
482005
Phoneme recognition using spectral envelope and modulation frequency features
S Thomas, S Ganapathy, H Hermansky
2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009
442009
Improvements to the IBM speech activity detection system for the DARPA RATS program
S Thomas, G Saon, M Van Segbroeck, SS Narayanan
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
422015
Temporal envelope compensation for robust phoneme recognition using modulation spectrum
S Ganapathy, S Thomas, H Hermansky
The Journal of the Acoustical Society of America 128 (6), 3769-3780, 2010
422010
The system can't perform the operation now. Try again later.
Articles 1–20