Publications

2023

Hebbar, Rajat; Bose, Digbalay; Narayanan, Shrikanth

SEAR: Semantically-grounded Audio Representations Conference

ACM Multimedia , 2023.

BibTeX | Tags: computational media understanding, multimodal, self-supervision

Bose, Digbalay; Hebbar, Rajat; Feng, Tiantian; Somandepalli, Krishna; Xu, Anfeng; Narayanan, Shrikanth

MM-AU: Towards Multimodal Understanding of Advertisement Videos Conference

ACM Multimedia , 2023.

BibTeX | Tags: advertisements, computational media understanding, content analysis, multimedia understanding, multimodal

Sharma, Rahul; Narayanan, Shrikanth

Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection Journal Article

In: IEEE Open Journal of Signal Processing , pp. 225-232, 2023.

Abstract | Links | BibTeX | Tags: active speaker localization, computational media understanding, cross-modal learning, multimedia understanding

Bose, Digbalay; Hebbar, Rajat; Somandepalli, Krishna; Narayanan, Shrikanth

Contextually-rich human affect perception using multimodal scene information Conference

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , 2023.

Abstract | Links | BibTeX | Tags: emotion recognition, multimedia understanding, multimodal

Avramidis, Kleanthis; Stewart, Shanti; Narayanan, Shrikanth

On the Role of Visual Context in Enriching Music Representations Conference

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , 2023.

Abstract | Links | BibTeX | Tags: multimedia understanding, multimodal, music representations

Hebbar, Rajat; Bose, Digbalay; Somandepalli, Krishna; Vijai, Veena; Narayanan, Shrikanth

A dataset for Audio-Visual Sound Event Detection in Movies Conference

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , 2023.

Abstract | Links | BibTeX | Tags: audio-visual event detection, multimodal

Baruah, Sabyasachee; Narayanan, Shrikanth

Character Coreference Resolution in Movie Screenplays Inproceedings

In: Findings of the Association for Computational Linguistics: ACL 2023, pp. 10300–10313, 2023.

Abstract | BibTeX | Tags: content analysis, coreference resolution, multimedia understanding

Greer, Timothy; Shi, Xuan; Ma, Benjamin; Narayanan, Shrikanth

Creating musical features using multi-faceted, multi-task encoders based on transformers Journal Article

In: Scientific Reports, 13 (1), pp. 10713, 2023.

Abstract | BibTeX | Tags: autoencoders, music representations, self-supervision

2022

Martinez, Victor; Somandepalli, Krishna; Narayanan, Shrikanth

Boys don’t cry (or kiss or dance): A computational linguistic lens into gendered actions in film Journal Article

In: PLoS One, 2022.

Abstract | BibTeX | Tags: gendered analysis, multimedia understanding, semantic role labeling

Sharma, Rahul; Somandepalli, Krishna; Narayanan, Shrikanth

Cross modal video representations for weakly supervised active speaker localization Journal Article

In: IEEE Transactions on Multimedia, Early Access , pp. 1-12, 2022.

Abstract | Links | BibTeX | Tags: active speaker localization, cross-modal learning, multiple instance learning, weakly supervised learning

Bose, Digbalay; Hebbar, Rajat; Somandepalli, Krishna; Zhang, Haoyang; Cui, Yin; Cole-McLaughlin, Kree; Wang, Huisheng; Narayanan, Shrikanth

MovieCLIP: Visual Scene Recognition in Movies Conference

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2023), 2022.

Abstract | Links | BibTeX | Tags: taxonomy curation, visual scene recognition

Tóth, Gábor Mihály; Hempel, Tim; Somandepalli, Krishna; Narayanan, Shrikanth

Studying Large-Scale Behavioral Differences in Auschwitz-Birkenau with Simulation of Gendered Narratives Journal Article

In: Digital Humanities Quarterly, 16 (3), 2022.

Abstract | Links | BibTeX | Tags: Auschwitz, computational narrative modeling, survivor testimonies

Baruah, Sabyasachee; Somandepalli, Krishna; Narayanan, Shrikanth

Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis Journal Article

In: PLoS ONE, 2022.

Abstract | Links | BibTeX | Tags: Media, Professions

2021

Baruah, Sabyasachee; Chakravarthula, Sandeep Nallan; Narayanan, Shrikanth

Annotation and Evaluation of Coreference Resolution in Screenplays Inproceedings

In: pp. 2004–2010, Association for Computational Linguistics, 2021.

Abstract | Links | BibTeX | Tags: coreference resolution

Somandepalli, Krishna; Hebbar, Rajat; Narayanan, Shrikanth S

Multi-Face: Self-supervised Multiview Adaptation for Robust Face Clustering in Videos Journal Article

In: IEEE Transactions on Multimedia, 2021, ISSN: 1520-9210.

Abstract | Links | BibTeX | Tags:

Somandepalli, Krishna; Hebbar, Rajat; Narayanan, Shrikanth

Robust Character Labeling in Movie Videos: Data Resources and Self-supervised Feature Adaptation. Journal Article

In: IEEE Transactions on Multimedia, 24 , pp. 3355 - 3368, 2021.

Abstract | Links | BibTeX | Tags: computational media understanding, face clustering, face diarization, multiview correlation, self-supervision, triplet loss, video character labeling

Hebbar, Rajat; Somandepalli, Krishna; Peri, Raghuveer; Travadi, Ruchir; Tuplin, Tracy; Rivera, Fernando; Narayanan, Shrikanth

A Computational Tool to Study Vocal Participation of Women in UN-ITU Meetings Inproceedings

In: 2021 International Conference on Content-Based Multimedia Indexing (CBMI), pp. 1–4, IEEE 2021.

BibTeX | Tags:

Knox, Dillon; Greer, Timothy; Ma, Benjamin; Kuo, Emily; Somandepalli, Krishna; Narayanan, Shrikanth

Loss Function Approaches for Multi-label Music Tagging Inproceedings

In: 2021 International Conference on Content-Based Multimedia Indexing (CBMI), pp. 1–4, IEEE 2021.

BibTeX | Tags:

Ma, Benjamin; Greer, Timothy; Knox, Dillon; Narayanan, Shrikanth

A computational lens into how music characterizes genre in film Journal Article

In: PloS one, 16 (4), pp. e0249957, 2021.

BibTeX | Tags:

Somandepalli, Krishna; Guha, Tanaya; Martinez, Victor R; Kumar, Naveen; Adam, Hartwig; Narayanan, Shrikanth

Computational media intelligence: human-centered machine analysis of media Journal Article

In: Proceedings of the IEEE, 2021.

BibTeX | Tags:

2020

Martinez, Victor; Somandepalli, Krishna; Tehranian-Uhls, Yalda; Narayanan, Shrikanth

Joint Estimation and Analysis of Risk Behavior Ratings in Movie Scripts Inproceedings

In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 4780–4790, 2020.

BibTeX | Tags:

Ramakrishna, Anil Kumar; Gupta, Rahul; Narayanan, Shrikanth

Joint Multi-Dimensional Model for Global and Time-Series Annotations Journal Article

In: IEEE Transactions on Affective Computing, 2020.

BibTeX | Tags:

Narayanan, Shrikanth S; Madni, Asad M

Inclusive Human centered Machine Intelligence Journal Article

In: The Bridge, 50 , pp. 113-116, 2020.

BibTeX | Tags:

2019

Martinez, Victor R; Somandepalli, Krishna; Singla, Karan; Ramakrishna, Anil; Uhls, Yalda T; Narayanan, Shrikanth

Violence rating prediction from movie scripts Inproceedings

In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 671–678, 2019.

BibTeX | Tags:

Sharma, Rahul; Somandepalli, Krishna; Narayanan, Shrikanth

Toward visual voice activity detection for unconstrained videos Inproceedings

In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 2991–2995, IEEE 2019.

BibTeX | Tags:

Hebbar, Rajat; Somandepalli, Krishna; Narayanan, Shrikanth

Robust speech activity detection in movie audio: Data resources and experimental evaluation Inproceedings

In: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4105–4109, IEEE 2019.

BibTeX | Tags:

Somandepalli, Krishna; Narayanan, Shrikanth

Reinforcing self-expressive representation with constraint propagation for face clustering in movies Inproceedings

In: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4065–4069, IEEE 2019.

BibTeX | Tags:

Somandepalli, Krishna; Kumar, Naveen; Travadi, Ruchir; Narayanan, Shrikanth

Multimodal representation learning using deep multiset canonical correlation Journal Article

In: arXiv preprint arXiv:1904.01775, 2019.

BibTeX | Tags:

2018

Somandepalli, Krishna; Martinez, Victor; Kumar, Naveen; Narayanan, Shrikanth

Multimodal Representation of Advertisements Using Segment-Level Autoencoders Inproceedings

In: Proceedings of the 20th ACM International Conference on Multimodal Interaction, pp. 418–422, Association for Computing Machinery, Boulder, CO, USA, 2018, ISBN: 9781450356923.

Abstract | Links | BibTeX | Tags: advertisements, autoencoders, multimodal joint representation

Hebbar, Rajat; Somandepalli, Krishna; Narayanan, Shrikanth

Improving Gender Identification in Movie Audio Using Cross-Domain Data Inproceedings

In: Proc. Interspeech 2018, pp. 282–286, 2018.

Links | BibTeX | Tags:

Somandepalli, Krishna; Kumar, Naveen; Guha, Tanaya; Narayanan, Shrikanth S

Unsupervised Discovery of Character Dictionaries in Animation Movies Journal Article

In: IEEE Transactions on Multimedia, 20 (3), pp. 539-551, 2018.

Links | BibTeX | Tags:

2017

Ramakrishna, Anil; Martinez, Victor R; Malandrakis, Nikolaos; Singla, Karan; Narayanan, Shrikanth

Linguistic analysis of differences in portrayal of movie characters Inproceedings

In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 1669–1678, Vancouver, Canada, 2017.

Abstract | Links | BibTeX | Tags:

2016

Tadimari, Adarsh; Kumar, Naveen; Guha, Tanaya; Narayanan, Shrikanth S

Opening big in box office? Trailer content can help Inproceedings

In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2777-2781, 2016.

Links | BibTeX | Tags:

Nasir, Md; Kumar, Naveen; Georgiou, Panayiotis; Narayanan, Shrikanth S

Robust Multichannel Gender Classification from Speech in Movie Audio Inproceedings

In: Proceedings of Interspeech, 2016.

Links | BibTeX | Tags:

Goyal, Ankit; Kumar, Naveen; Guha, Tanaya; Narayanan, Shrikanth S

A multimodal mixture-of-experts model for dynamic emotion prediction in movies Inproceedings

In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2822-2826, 2016.

Links | BibTeX | Tags:

2015

Guha, Tanaya; Kumar, Naveen; Narayanan, Shrikanth S; Smith, Stacy L

Computationally deconstructing movie narratives: An informatics approach Inproceedings

In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2264-2268, 2015.

Links | BibTeX | Tags:

Guha, Tanaya; Huang, Che-Wei; Kumar, Naveen; Zhu, Yan; Narayanan, Shrikanth S

Gender Representation in Cinematic Content: A Multimodal Approach Inproceedings

In: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, pp. 31–34, Association for Computing Machinery, Seattle, Washington, USA, 2015, ISBN: 9781450339124.

Abstract | Links | BibTeX | Tags: content analysis, gender representation, movie, multimodal