So as for me it looks like compiler issue rather than anything else. In fact, VLIW4 seems to be lite version of VLIW5. AMD just saved some bucks on making smaller cheaper ICs and selling them as "new", "improved" thingies. Sure, they improved TDP. At cost of computations speed . Yet selling cards of same class under same price. Epic marketing win (for AMD).