Taejin Park

Cited by

	All	Since 2019
Citations	1057	981
h-index	14	14
i10-index	17	17

300

150

225

20152016201720182019202020212022202320244 16 25 26 37 48 140 198 286 266

Public access

View all

3 articles

1 article

available

not available

Based on funding mandates

Co-authors

Shrikanth (Shri) NarayananUniversity Professor and Niki & Max Nikias Chair in Engineering, University of Southern CaliforniaVerified email at sipi.usc.edu
Keunwoo ChoiPrescient Design, GenentechVerified email at gene.com
Boris GinsburgNVIDIAVerified email at nvidia.com
Nithin Rao KoluguriNVIDIA CorporationVerified email at nvidia.com
Kyu Jeong HanAmazon Web Services (AWS)Verified email at amazon.com
Panayiotis (Panos) GeorgiouApple/University of Southern CaliforniaVerified email at apple.com
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Raghuveer PeriApplied Scientist, AmazonVerified email at amazon.com
Naoyuki KandaMicrosoftVerified email at microsoft.com
Manoj KumarUniversity of Southern CaliforniaVerified email at usc.edu
Seungkwon BeackETRI

Taejin Park

NVIDIA

Verified email at nvidia.com - Homepage

Speech Signal Processing Audio Signal Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A review of speaker diarization: Recent advances with deep learning TJ Park, N Kanda, D Dimitriadis, KJ Han, S Watanabe, S Narayanan Computer Speech & Language 72, 101317, 2022	359	2022
Auto-tuning spectral clustering for speaker diarization using normalized maximum eigengap TJ Park, KJ Han, M Kumar, S Narayanan IEEE Signal Processing Letters 27, 381-385, 2019	130	2019
Titanet: Neural model for speaker representation with 1d depth-wise separable convolutions and global context NR Koluguri, T Park, B Ginsburg ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	94	2022
Binaural rendering method and apparatus for decoding multi channel audio YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ... US Patent 9,319,819, 2016	50	2016
Musical instrument sound classification with deep convolutional neural network using feature fusion approach T Park, T Lee arXiv preprint arXiv:1512.07370, 2015	49	2015
Multimodal speaker segmentation and diarization using lexical and acoustic cues via sequence to sequence neural networks TJ Park, P Georgiou arXiv preprint arXiv:1805.10731, 2018	43	2018
Speaker diarization with lexical information TJ Park, KJ Han, J Huang, X He, B Zhou, P Georgiou, S Narayanan arXiv preprint arXiv:2004.06756, 2020	38	2020
Speaker diarization using latent space clustering in generative adversarial network M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	26	2020
Meta-learning with latent space clustering in generative adversarial network for speaker diarization M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021	24	2021
Multi-scale speaker diarization with dynamic scale weighting TJ Park, NR Koluguri, J Balam, B Ginsburg arXiv preprint arXiv:2203.15974, 2022	23	2022
Tackling dynamics in federated incremental learning with variational embedding rehearsal TJ Park, K Kumatani, D Dimitriadis arXiv preprint arXiv:2110.09695, 2021	18	2021
Multi-scale speaker diarization with neural affinity score fusion TJ Park, M Kumar, S Narayanan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	17	2021
Automatic prediction of suicidal risk in military couples using multimodal interaction cues from couples conversations SN Chakravarthula, M Nasir, SY Tseng, H Li, TJ Park, B Baucom, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	15	2020
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech. A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ... Interspeech, 2463-2467, 2019	15	2019
Encoding/decoding apparatus for processing channel signal and method therefor JI Seo, SK Beack, DY Jang, KO Kang, TJ Park, YJ Lee, KW Choi, JW Kim US Patent 10,068,579, 2018	13	2018
Enhancing speaker diarization with large language models: A contextual beam search approach TJ Park, K Dhawan, N Koluguri, J Balam ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	12	2024
The Second DIHARD Challenge: System Description for USC-SAIL Team. TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ... INTERSPEECH, 998-1002, 2019	11	2019
Robust multi-channel speech recognition using frequency aligned network T Park, K Kumatani, M Wu, S Sundaram ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	8	2020
Binaural rendering method and apparatus for decoding multi channel audio YJ Lee, JI Seo, JH Yoo, SK Beack, JM Sung, TJ Lee, KO Kang, JW Kim, ... US Patent 10,199,045, 2019	8	2019
Apparatus for processing audio signal for sound bar and method therefor JI Seo, DY Jang, TJ Park, KW Choi, KO Kang, JW Kim US Patent App. 14/760,770, 2015	8	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors