MAISI‑3D RFlow pretrained UNet – what should I feed into class_embedding? #2008
Unanswered
arunkumar-kannan
asked this question in
Q&A
Replies: 1 comment
-
|
num_class_embeds: 128 means modality categories. In the current version, we always let the input tensor to be 1. But it is an API you can use if your datasets have several categories |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi team,
Thanks so much for the great work and the excellent codebase on MAISI model.
I’m experimenting with the MAISI‑3D RFlow pretrained UNet that ships with the model‑zoo weights. My goal is plain, unconditional generation (i.e. no ControlNet, no extra conditioning).
While inspecting the accompanying config I saw that the network was trained with
num_class_embeds:128 and therefore the checkpoint contains aclass_embedding.weightof shape [128, 256].Questions
What semantic meaning does this class‑embedding have in MAISI‑3D? (Does each of the 128 indices correspond to a specific dataset, organ, or sampling routine?)
For unconditional inference, what should I pass as class_labels? (Is feeding a tensor of zeros (torch.zeros(batch_size, dtype=torch.long)) acceptable?)
Thanks in advance for clarifying how the
class_embeddingis meant to be used!Beta Was this translation helpful? Give feedback.
All reactions