Forgive me if I'm wrong, but in regular collision detection, the computer must loop through all x, and y coords, then if it finds something it reports it. However with 3d its like adding umpteenth loops to all that. So now its like going over x,y, of first z, then, x, y of second etc etc. Its like adding hundreds or thousands of iterations.