Aashish Anantha Ramakrishnan

Aashish Anantha Ramakrishnan

Informatics PhD Candidate

The Pennsylvania State University

About Me

Hello! I’m a third-year PhD Candidate at the Penn State College of Information Sciences and Technology, co-advised by Prof. Dongwon Lee and Prof. Sharon Huang. My research interests lie at the intersection of Vision & Language, with a focus on multi-modal generative models and 3D Computer Vision. Prior to my PhD, I received my B.E in Computer Science and Engineering from SSN College of Engineering, Chennai, India.

As a member of the PIKE Research Group, I work on enhancing contextual alignment of Vision & Language Models in neural news media generation problems. Currently, my research is aimed at improving sentence understanding of Text-to-Image generators when prompted with Abstractive News Captions. Recognizing the security risks these domain-specific Language Models pose such as fake news synthesis and copyright infringement, I am also keen on exploring robust text watermarking strategies for downstream generation tasks.

In the past, I have had the opportunity to do internships at The Washington Post, Ford Motor Company, and other Computer Vision startups. I’m always open to collaboration and research discussions on topics related to Vision & Language tasks. Feel free to drop me an email or ping me on LinkedIn if you wish to get in touch with me!

  • Computer Vision
  • Natural Language Processing
  • Multi-modal Generative AI
  • PhD in Informatics, Currently Pursuing

    The Pennsylvania State University

  • B.E in Computer Science and Engineering, 2021

    SSN College of Engineering, Anna University

Recent Posts

Recent Publications and Pre-Prints

Quickly discover relevant content by filtering publications.
(2023). ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions. arXiv [cs.CV].

Cite Dataset Arxiv