If I'm interpreting the results from these tables correctly, Krait has
2 (!) cycles latency even for simple scalar integer arithmetics. Most of the modern processors (both ARM and Intel) have 1 cycle latency for such instructions. If this is correct, then Krait clearly does not stand a chance against genuine Cortex-A15. Krait is going to perform particularly bad on the code with long dependency chains. And by the way, GCC has some issues related to it:
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=55623