Poster

Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation

Zongrui Li ⋅ Minghui Hu ⋅ Qian Zheng ⋅ Xudong Jiang

Strong blind review: This paper was not made available on public preprint services during the review process

Strong Double Blind

2024 Poster

Paper PDF [ Poster] [ Supplemental]

Abstract

Recent advancements in text-to-3D generation significantly improved outcome quality despite issues like oversaturation and missing detail. Upon thoroughly examining score distillation, we've identified similarities between score and consistency distillation methods. In this work, We first elucidate the equivalent formulation of score distillation via the consistency function format, which mitigates the divide between text-to-image distillation and text-to-3D distillation and benefits the research in both fields. Our investigation further reveals the current methods under consistency distillation framework for 3D contexts still have limitations, such as the distillation errors and inconsistencies of trajectories. To address this issue, we propose Guided Consistency Sampling (GCS), which is seamlessly integrated with advanced rendering techniques and Gaussian Splatting (GS). GCS consists of three parts: A compact consistency loss leads to a better generator for the origin, a conditional guidance score enriches the detail, and a constrain on the pixel domain enhances the color and light impact. Furthermore, we have observed persistent 3D rendering issues inherent in the optimization process of GS-based methods, such as oversaturation. We attribute this to the accumulation of highlights and introduce the Brightness-equalized Generation (BEG) method to effectively alleviate the problem. Experimental results demonstrate that our approach yields higher-quality outcomes with more intricate details than the current state-of-the-art methods.

Chat is not available.