Publications

Show all

2023

Sharma, Rahul; Narayanan, Shrikanth

Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection Journal Article

In: IEEE Open Journal of Signal Processing , pp. 225-232, 2023.

Abstract | Links | BibTeX | Tags: active speaker localization, computational media understanding, cross-modal learning, multimedia understanding

2022

Sharma, Rahul; Somandepalli, Krishna; Narayanan, Shrikanth

Cross modal video representations for weakly supervised active speaker localization Journal Article

In: IEEE Transactions on Multimedia, Early Access , pp. 1-12, 2022.

Abstract | Links | BibTeX | Tags: active speaker localization, cross-modal learning, multiple instance learning, weakly supervised learning