does GCC 4.6 optimise for FMA4 and XOP.
FMA can fuse a multiply and an add into a single step, and so could give a big speed up to some code.
also wouldn't it be great if GCC had a 'fastest flags that don't break the test-suite' option.
does GCC 4.6 optimise for FMA4 and XOP.
FMA can fuse a multiply and an add into a single step, and so could give a big speed up to some code.
also wouldn't it be great if GCC had a 'fastest flags that don't break the test-suite' option.
If anyone's interested in seeing how Mesa parses shaders, I had ported Aras' Mesa glsl optimizer back to linux in the beginning of the summer. It's not only a syntax check, but also outputting the common optimizations, I found it useful in shader development.
You may need to get the code from the linux merge, I haven't tested if master still runs on linux, and Aras is mostly win/mac.
https://github.com/aras-p/glsl-optimizer
Ummm, if you google for for "-march=bdver1" you'll find all the information you need!Most important piece of info is this: http://goo.gl/LXkBr.
Nope! I'd share a document with you, but I don't want to get my butt kicked, so I found some public info on the home page of the Swiss National Supercomputing Center (they are just getting their Interlagos-based Cray XMT shipped) which covers the most important stuff:
http://user.cscs.ch/news/2011/10/17/...-xop-and-fma4/
Excellent!
Another minor issue, reproducible with both Chromium and Google Chrome version 14.0.835.202, see screenshot: http://dl.dropbox.com/u/239841/openb...hromium_14.png
I remember testing XOP with gcc 4.5 and gcc 4.5 release notes say that's possible:
Support for the XOP, FMA4, and LWP instruction sets for the AMD Orochi processors are now available with the -mxop, -mfma4, and -mlwp options.
IRC is by apppointment only afaik. Have you considered the forum http://vdrift.net/ or the issue tracker https://github.com/VDrift/vdrift/issues ?