this really shows there should be a 3d mesh object, with all the pitch yaw roll functions of the box, but a "load mesh" feature, i mean, cmon youve made one within the program! and its taking no control from the GPU. ontop of load mesh later on we could give it texture "diffuse" and mapping coords which are also loaded in.
i know 3d isnt like sprites, but the ability to just load meshes like youre doing here would be great and unlock alot of creativity, a simple crossection of the shape could be used for collisions, just like the 3d box