gpuRIR: A python library for room impulse response simulation with GPU acceleration D Diaz-Guerra, A Miguel, JR Beltran Multimedia Tools and Applications 80 (4), 5653-5671, 2021 | 146 | 2021 |
Voice pathology detection on the Saarbrücken voice database with calibration and fusion of scores using multifocal toolkit D Martínez, E Lleida, A Ortega, A Miguel, J Villalba Advances in Speech and Language Technologies for Iberian Languages …, 2012 | 127 | 2012 |
Robust sound source tracking using SRP-PHAT and 3D convolutional neural networks D Diaz-Guerra, A Miguel, JR Beltran IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 300-311, 2020 | 109 | 2020 |
Spoofing detection with DNN and one-class SVM for the ASVspoof 2015 challenge J Villalba, A Miguel, A Ortega, E Lleida Proc. Interspeech 2015, 2067-2071, 2015 | 97 | 2015 |
Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA JM Benedı, E Lleida, A Varona, MJ Castro, I Galiano, R Justo, I López, ... Fifth International Conference on Language Resources and Evaluation (LREC …, 2006 | 89 | 2006 |
AV@ CAR: A Spanish Multichannel Multimodal Corpus for In-Vehicle Automatic Audio-Visual Speech Recognition. A Ortega, F Sukno, E Lleida, AF Frangi, A Miguel, L Buera, E Zacur LREC, 2004 | 67 | 2004 |
Intelligibility assessment and speech recognizer word accuracy rate prediction for dysarthric speakers in a factor analysis subspace D Martínez, E Lleida, P Green, H Christensen, A Ortega, A Miguel ACM Transactions on Accessible Computing (TACCESS) 6 (3), 1-21, 2015 | 57 | 2015 |
Albayzin 2018 evaluation: the iberspeech-rtve challenge on speech technologies for spanish broadcast media E Lleida, A Ortega, A Miguel, V Bazán-Gil, C Pérez, M Gómez, ... Applied sciences 9 (24), 5412, 2019 | 55 | 2019 |
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data P Gimeno, I Viñals, A Ortega, A Miguel, E Lleida EURASIP Journal on Audio, Speech, and Music Processing 2020, 1-19, 2020 | 52 | 2020 |
Cepstral vector normalization based on stereo data for robust speech recognition L Buera, E Lleida, A Miguel, A Ortega, Ó Saz IEEE transactions on audio, speech, and language processing 15 (3), 1098-1113, 2007 | 51 | 2007 |
Detecting replay attacks in audiovisual identity verification H Bredin, A Miguel, IH Witten, G Chollet 2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006 | 47 | 2006 |
Prosodic features and formant modeling for an ivector-based language recognition system D Martinez, E Lleida, A Ortega, A Miguel 2013 ieee international conference on acoustics, speech and signal …, 2013 | 41 | 2013 |
Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification V Mingote, A Miguel, A Ortega, E Lleida Computer Speech & Language 63, 101078, 2020 | 40 | 2020 |
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge. I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida Interspeech, 2803-2807, 2018 | 32 | 2018 |
Audio segmentation-by-classification approach based on factor analysis in broadcast news domain D Castán, A Ortega, A Miguel, E Lleida EURASIP Journal on Audio, Speech, and Music Processing 2014, 1-13, 2014 | 31 | 2014 |
Multi-environment models based linear normalization for speech recognition in car conditions L Buera, E Lleida, A Miguel, A Ortega 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004 | 31 | 2004 |
Unsupervised data-driven feature vector normalization with acoustic model adaptation for robust speech recognition L Buera, A Miguel, Ó Saz, A Ortega, E Lleida IEEE transactions on audio, speech, and language processing 18 (2), 296-309, 2009 | 28 | 2009 |
Optimization of False Acceptance/Rejection Rates and Decision Threshold for End-to-End Text-Dependent Speaker Verification Systems. V Mingote, A Miguel, D Ribas, AO Giménez, E Lleida INTERSPEECH, 2903-2907, 2019 | 26 | 2019 |
Direction of arrival estimation of sound sources using icosahedral CNNs D Diaz-Guerra, A Miguel, JR Beltran IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 313-321, 2022 | 25 | 2022 |
Augmented state space acoustic decoding for modeling local variability in speech. A Miguel, E Lleida, RC Rose, L Buera, A Ortega INTERSPEECH, 3009-3012, 2005 | 25 | 2005 |