Seguir
Pablo Gimeno
Pablo Gimeno
Otros nombresPablo Gimeno Jordán
ELSA Corp.
Dirección de correo verificada de elsanow.io
Título
Citado por
Citado por
Año
Multiclass audio segmentation based on recurrent neural networks for broadcast domain data
P Gimeno, I Viñals, A Ortega, A Miguel, E Lleida
EURASIP Journal on Audio, Speech, and Music Processing 2020, 1-19, 2020
442020
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge.
I Vinals, P Gimeno, A Ortega, A Miguel, E Lleida
Interspeech, 2803-2807, 2018
322018
Phonetically-Aware Embeddings, Wide Residual Networks with Time-Delay Neural Networks and Self Attention Models for the 2018 NIST Speaker Recognition Evaluation.
I Viñals, D Ribas, V Mingote, J Llombart, P Gimeno, A Miguel, ...
Interspeech, 4310-4314, 2019
112019
Generalizing AUC optimization to multiclass classification for audio segmentation with limited training data
P Gimeno, V Mingote, A Ortega, A Miguel, E Lleida
IEEE Signal Processing Letters 28, 1135-1139, 2021
102021
ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge
I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida
Proc. Interspeech 2019, 988-992, 2019
102019
In-domain Adaptation Solutions for the RTVE 2018 Diarization Challenge
I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida
Proc. IberSPEECH 2018, 220-223, 2018
102018
Partial AUC Optimisation using Recurrent Neural Networks for Music Detection with Limited Training Data
P Gimeno, V Mingote, A Ortega, A Miguel, E Lleida
Interspeech, 3067-3071, 2020
92020
Convolutional Recurrent Neural Networks for Speech Activity Detection in Naturalistic Audio from Apollo Missions
P Gimeno, D Ribas, A Ortega, A Miguel, E Lleida
Iberspeech 2021, 2021
82021
A Recurrent Neural Network Approach to Audio Segmentation for Broadcast Domain Data
P Gimeno, I Viñals, A Ortega, A Miguel, E Lleida
Proc. IberSPEECH 2018, 87-91, 2018
72018
Unsupervised Representation Learning for Speech Activity Detection in the Fearless Steps Challenge 2021
P Gimeno, A Ortega, A Miguel, E Lleida
Proc. Interspeech 2021, 4359-4363, 2021
52021
Improved cross-lingual transfer learning for automatic speech translation
S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ...
arXiv preprint arXiv:2306.00789, 2023
32023
Multimodal diarization systems by training enrollment models as identity representations
V Mingote, I Viñals, P Gimeno, A Miguel, A Ortega, E Lleida
Applied Sciences 12 (3), 1141, 2022
32022
ViVoLAB Multimodal Diarization System for RTVE 2020 Challenge
V Mingote, I Vinals, P Gimeno, A Miguel, A Ortega, E Lleida
Iberspeech 2021, 2021
22021
Unsupervised adaptation of deep speech activity detection models to unseen domains
P Gimeno, D Ribas, A Ortega, A Miguel, E Lleida
Applied Sciences 12 (4), 1832, 2022
12022
Diarization and Identity Attribution Compatibility in the Albayzin 2020 Challenge
I Viñals, P Gimeno, A Ortega, A Miguel, E Lleida
Proc. IberSPEECH 2021, 94-98, 2021
12021
ViVoVAD: a Voice Activity Detection Tool based on Recurrent Neural Networks
PG Jordán, IV Bailo, AO Giménez, AM Artiaga, EL Solano
Jornada de Jóvenes Investigadores del I3A 7, 2019
12019
Cross-Lingual Transfer Learning for Low-Resource Speech Translation
S Khurana, N Dawalatabad, A Laurent, L Vicente, P Gimeno, V Mingote, ...
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024
2024
Direct Text to Speech Translation System Using Acoustic Units
V Mingote, P Gimeno, L Vicente, S Khurana, A Laurent, J Duret
IEEE Signal Processing Letters, 2023
2023
Advances in Binary and Multiclass Audio Segmentation with Deep Learning Techniques
P Gimeno, A Ortega
2023
Multi-lingual Speech to Speech Translation for Under-Resourced Languages
A Larcher, Y Estève, M Rouvier, N Tomashenko, J Duret, G Laperriere, ...
Le Mans Université, 2022
2022
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20