Blog

Notes on generative models, multimodal learning, and research ideas.

Continuous Latent Diffusion Language Model

基于 Cola DLM 的统一多模态 Flow Matching

2026 · Unified Multimodal Pretraining · Latent Diffusion · Flow Matching

An exploration of unified text-vision modeling with Cola DLM, using continuous latent spaces and a shared block-causal MMDiT for understanding and generation.