I am trying to figure out what would be a good architecture for neural network that takes projections (2D images) from different angles and creates volume consisting of 2D slices (CT-like).
So for example:
- Input [180,100,100] -> 180 projections of image 100x100 pixels.
- Output [100,100,100] -> Volume of size 100x100x100 (100 slices of 2D images)
I have ground truth volumes.
I came up with the idea of using ResNet as Encoder. But I'm not really sure how to implement Decoder and what model would be a good choice for this kind of problem. I did think of U-net architecture, but output dimension is different, so I've abandoned this idea.
I am using PyTorch.