Pan Zexu

Citado por

	Total	Desde 2019
Citas	434	434
Índice h	9	9
Índice i10	9	9

220

110

165

202120222023202421 121 202 89

Acceso público

Ver todo

8 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, SingaporeDirección de correo verificada de u.nus.edu
Tao RuijieNational University of Singapore, ECE departmentDirección de correo verificada de u.nus.edu
Xinyuan QianAssociate Professor, University of Science and Technology Beijing, ChinaDirección de correo verificada de nus.edu.sg
Meng GeTianjin University; CUHK-Shenzhen; National University of SingaporeDirección de correo verificada de nus.edu.sg
Chenglin XuKuaishou Technology, ChinaDirección de correo verificada de kuaishou.com
Jonathan Le RouxMERLDirección de correo verificada de merl.com
Zhaojie LuoOsaka University Assistant ProfessorDirección de correo verificada de irl.sys.es.osaka-u.ac.jp

Seguir

Pan Zexu

MERL; National University of Singapore

Dirección de correo verificada de u.nus.edu - Página principal

Multi-media Speaker extraction


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Is someone speaking? exploring long-term temporal features for audio-visual active speaker detection R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li Proceedings of the 29th ACM international conference on multimedia, 3927-3935, 2021	145	2021
Multi-modal Attention for Speech Emotion Recognition Z Pan, Z Luo, J Yang, H Li Proc. Interspeech 2020, 364--368, 2020	74	2020
Selective listening by synchronizing speech with lips Z Pan, R Tao, C Xu, H Li IEEE/ACM Transactions on Audio, Speech and Language Processing 30, 1650 - 1664, 2022	36	2022
Muse: Multi-modal target speaker extraction with visual cues Z Pan, R Tao, C Xu, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	36	2021
Multi-target DoA estimation with an audio-visual fusion mechanism X Qian, M Madhavi, Z Pan, J Wang, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	34	2021
USEV: Universal speaker extraction with visual cue Z Pan, M Ge, H Li IEEE/ACM Transactions on Audio, Speech and Language Processing 30, 3032 - 3045, 2022	30	2022
Speaker Extraction with Co-Speech Gestures Cue Z Pan, X Qian, H Li IEEE Signal Processing Letters 29, 1467 - 1471, 2022	17	2022
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Z Pan, M Ge, H Li Proc. Interspeech 2022, 2022	10	2022
VCSE: Time-Domain Visual-Contextual Speaker Extraction Network J Li, M Ge, Z Pan, L Wang, J Dang Proc. Interspeech 2022, 906-910, 2022	10	2022
Target active speaker detection with audio-visual cues Y Jiang, R Tao, Z Pan, H Li arXiv preprint arXiv:2305.12831, 2023	9	2023
Time-domain speech separation networks with graph encoding auxiliary T Wang, Z Pan, M Ge, Z Yang, H Li IEEE Signal Processing Letters 30, 110-114, 2023	6	2023
Is someone speaking R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li Proceedings of the 29th ACM International Conference on Multimedia, Oct, 2021	6	2021
Rethinking the visual cues in audio-visual speaker extraction J Li, M Ge, Z Pan, R Cao, L Wang, J Dang, S Zhang arXiv preprint arXiv:2306.02625, 2023	5	2023
NeuroHeed: Neuro-steered speaker extraction using eeg signals Z Pan, M Borsdorf, S Cai, T Schultz, H Li arXiv preprint arXiv:2307.14303, 2023	4	2023
Towards end-to-end speaker diarization in the wild Z Pan, G Wichern, FG Germain, A Subramanian, J Le Roux arXiv preprint arXiv: 2211.01299, 2022	4	2022
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding Inpainting Z Pan, W Wang, M Borsdorf, H Li ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2022	3	2022
NeuroHeed+: Improving neuro-steered speaker extraction with joint auditory attention detection Z Pan, G Wichern, FG Germain, S Khurana, J Le Roux ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
Generation or Replication: Auscultating Audio Latent Diffusion Models D Bralios, G Wichern, FG Germain, Z Pan, S Khurana, C Hori, J Le Roux ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism Y Chen, X Qian, Z Pan, K Chen, H Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
Restoring Speaking Lips from Occlusion for Audio-Visual Speech Recognition J Wang, Z Pan, M Zhang, RT Tan, H Li Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19144 …, 2024	1	2024

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores