From what we have been testing internally, its no where near as bad as you are making it out to be.
That said, the extensive test schedule is strict and requires things done exactly as Apple demand it, but we do a few of our own tests. Its pretty good.