Dropping objects from 3cm above helps improve real-world success rates;
it’s an arbitrary choice to help avoid motion planning problems that are out of scope.
We did experiments with the actual predicted poses, and saw that StructDiffusion still achieves the best
performance with a 70.0% success rate (vs. 54.0% for the next best). We will add more analysis to the final paper.