Gowthami Somepalli

Post-training · Diffusion Models|

Gowthami Somepalli

Multimodal AI researcher obsessed with how machines perceive, remember, and generate the world. Based in Mountain View, CA.

PhD from UMD focused on diffusion model memorization — built memorization evals for diffusion models and CSD, a widely-used style similarity metric. Also built evals for video understanding: CinePile (long-video QA benchmark, Best Paper at CVPR 2024 SynCV) and ARGUS (hallucination/omission eval for dense captions). (Friends call me the "Evals Shill" for a reason.)

Before academia: did SGD in industry for a while in India, IIT Madras alum, founded a Fashion AI startup that was way too early to the party.

Open to collabs on generative modeling (evals + post-training). Hit me up: gowthami [dot] somepalli [at] gmail.com

// papers

all papers →