Image-to-3D
Diffusers
Safetensors
SpatialGenDiffusionPipeline

There is no complete text2scene pipeline.

#4
by nikitavovch - opened

You cannot generate just from the text description + GT layout, as it says. You'll always need atleast RGB images to inference the text 2 scene pipeline. I dont get it, why you haven't made all in one script. I mean generation of rgb, depth, normal and etc. images to inference the full text to scene pipeline in one click. To inference your own generation from only a GT layout, first of all you need to run the preprocessing script, then you need to generate rgb's by yourself, with their flux wireframe model, and etc etc etc...

Sign up or log in to comment