I think this is the profiling work mentioned by Marek, maybe interesting for you too.
again very useful, i have hope that using apitrace/oprofile/CS cross check i can find some bottlenecks and maybe playing a bit with tom stellard llvm compiler[won't be easy tho but should be a good way to get more intimate with gallium code ]