Name: Parallel Sessions 1K: Optimization for GenAI -- diffusion models and LLMs
Start: 2025-07-21T10:30:00-0700
End: 2025-07-21T11:45:00-0700

Monday July 21, 2025 10:30am - 11:45am PDT

Taper Hall (THH) 118

Session: Optimization for GenAI -- diffusion models and LLMs
Chair: Wenpin Tang
Cluster: Optimization For Data Science

Talk 1: Gradient Guidance for Diffusion Models: An Optimization Perspective
Speaker: Minshuo Chen
Abstract: Diffusion models have demonstrated empirical successes in various applications and can be adapted to task-specific needs via guidance. This talk introduces a form of gradient guidance for adapting or fine-tuning diffusion models towards user-specified optimization objectives. We study the theoretic aspects of a guided score-based sampling process, linking the gradient-guided diffusion model to first-order optimization. We show that adding gradient guidance to the sampling process of a pre-trained diffusion model is essentially equivalent to solving a regularized optimization problem, where the regularization term acts as a prior determined by the pre-training data. We further consider an iteratively fine-tuned version of gradient-guided diffusion where one can query gradients at newly generated data points and update the score network using new samples. This process mimics a first-order optimization iteration in expectation, for which we prove O(1/K) convergence rate to the global optimum when the objective function is concave.

Talk 2: RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Speaker: Hanyang Zhao
Abstract: Recently, numerous preference optimization algorithms have been introduced as extensions to the Direct Preference Optimization (DPO) family. While these methods have successfully aligned models with human preferences, there is a lack of understanding regarding the contributions of their additional components. Moreover, fair and consistent comparisons are scarce, making it difficult to discern which components genuinely enhance downstream performance. In this work, we propose RainbowPO, a unified framework that demystifies the effectiveness of existing DPO methods by categorizing their key components into seven broad directions. We integrate these components into a single cohesive objective, enhancing the performance of each individual element. Through extensive experiments, we demonstrate that RainbowPO outperforms existing DPO variants. Additionally, we provide insights to guide researchers in developing new DPO methods and assist practitioners in their implementations.

Talk 3: A preliminary study on the generation process of diffusion models with different noise distributions
Speaker: Nanshan Jia
Abstract: We propose a class of structured diffusion models, in which the prior distribution is chosen as a mixture of Gaussians, rather than a standard Gaussian distribution. The specific mixed Gaussian distribution, as prior, can be chosen to incorporate certain structured information of the data. We develop a simple-to-implement training procedure that smoothly accommodates the use of mixed Gaussian as prior. Theory is provided to quantify the benefits of our proposed models, compared to the classical diffusion models. Numerical experiments with synthetic, image and operational data are conducted to show comparative advantages of our model. Our method is shown to be robust to mis-specifications and in particular suits situations where training resources are limited or faster training in real time is desired.

Speakers

Minshuo Chen

Name: Dr. Slothington "Slow Convergence" McNapface Title: Distinguished Professor of Continuous Optimization & Energy Minimization Affiliation: The Lush Canopy Institute of Sluggish Algorithms Bio: Dr. Slothington McNapface is a leading expert in continuous optimization, specializing... Read More →

ICCOPT2025USC

Minshuo Chen

Wenpin Tang

Hanyang Zhao

Nanshan Jia

Attendees (3)

Get help with the event

ICCOPT2025USC

Minshuo Chen

Wenpin Tang

Hanyang Zhao

Nanshan Jia

Attendees (3)

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Get help with the event