Hello! I’m a third-year PhD Candidate at the Penn State College of Information Sciences and Technology, co-advised by Prof. Dongwon Lee and Prof. Sharon Huang. My research interests lie at the intersection of Vision & Language, with a focus on multi-modal generative models and 3D Computer Vision. Prior to my PhD, I received my B.E in Computer Science and Engineering from SSN College of Engineering, Chennai, India.
As a member of the PIKE Research Group, I work on enhancing contextual alignment of Vision & Language Models in neural news media generation problems. Currently, my research is aimed at improving sentence understanding of Text-to-Image generators when prompted with Abstractive News Captions. Recognizing the security risks these domain-specific Language Models pose such as fake news synthesis and copyright infringement, I am also keen on exploring robust text watermarking strategies for downstream generation tasks.
In the past, I have had the oppotunity to do internships at The Washington Post, Ford Motor Company and other Computer Vision startups. I’m always open to collaboration and research discussions on topics related to Vision & Language tasks. Feel free to drop me an email or ping me on LinkedIn if you wish to get in touch with me!
PhD in Informatics, Currently Pursuing
The Pennsylvania State University
B.E in Computer Science and Engineering, 2021
SSN College of Engineering, Anna University