Blog · Gowthami Somepalli

Mar 2026

Latent Scaffolding

A series exploring emergent capabilities hidden inside VL-conditioned single-stream diffusion transformers. No fine-tuning, no additional training — just scaffolding: architectural hacking and careful probing of what these models already know.

diffusion image-to-image emergent behavior series

Mar 2026

Latent Scaffolding: Token Dropout for Diverse Image Variations

Vision-only token dropout solves mode collapse in spliced I2I generation. Along the way: hunting attention sinks, discovering which conditioning tokens are load-bearing, and two orthogonal knobs for controlling diversity.

diffusion image-to-image token dropout attention sinks

Mar 2026

Latent Scaffolding: Z-Image Is Secretly an Image-to-Image Model

Z-Image Base and Turbo are text-to-image models, but they're secretly image-to-image models too. A simple architectural splice unlocks zero-shot image variations with no training.

diffusion image-to-image emergent behavior

Dec 2024

Another Sample Post with Tables

Lorem ipsum dolor sit amet. This template shows how to use tables and other formatting options.

sample template