I think it's very promising. The Xonontic benchmarks are very pleasing.
I am guessing from the benchmarks is that there is still some stuff falling back to software that is killing performance for certain things. With some optimization to applications and filling in some missing pieces in the drivers and we are golden. Once open source gets within about 70-80% of proprietary then I'd call it success.
Hmm, I've got a couple traces from games and my own stuff(20-70fps, 100-400MB). And they take about the same time.
Just did a quick run with vdrift about 2min, 130MB trace. Frame rate without tracing is about 22fps, with tracing 17fps, retracing 15fps(68% of original fps). Are my results atypical?
As I see it, the slowdown would be the same for all benchmarked cards and we are interested in the relative performance only.
Buy lots of RAM and store the apitrace in a Snappy or LZ4-compressed ramdiskThat should provide for a faster load time for the apitrace...
Indeed. The thing about open source drivers is that they can be debugged. It is possible to find out where they are slow, and then further optimise those parts of the code.
After adding HiZ and doing some further chasing down of performance bottlenecks in the open source code, performance can be expected to reach perhaps 80% of the closed binary drivers. Since almost no-one needs 200 fps performance, and the difference between 160 fps and 200 fps is all but imperceptible anyway, the perfromance issue with open source drivers will essentially be solved.
Bridgman, given than GCN moved to hardware scheduling, I assume lacking an advanced compiler in Mesa becomes less of a bottleneck. How would you estimate the effect of that move?
E.g. do you see GCN cards getting to 80% of catalyst, where earlier can get 70% etc?
Yeah, I don't have any real numbers but from a pure shader compiler POV my guess is that half the gap between open source and proprietary driver might go away with GCN.
For compute the impact will probably be even greater (since graphics is naturally short-vector work while compute is naturally scalar). We're also picking up some compiler improvements at the same time by using LLVM, so it could get interesting.
The bigger question is how much of the performance delta today comes from shader compiler rather than things like HyperZ, since the impact of both of them increase with display resolution.
Last edited by bridgman; 03-24-2012 at 11:03 AM.
I also think the opposite will happen with nouveau/kepler, because they removed hw scheduling there. Half the fps on a newer gen card there, on a shader-heavy workload?