AMD Radeon GPU compute update

Memory operations: Vertex buffer read, atomic operations, cacheless write are working, which means that the most problematic issues with compute support
have been taken care of.
It is possible that, there are still problems with thread scheduling, I haven't tested the code with too many threads. But with but relatively small amount of work the code can be made usable, and GPU machine code
can be used.
Next step is some kind of compatibility with GPU binaries compiled from
OpenCL by the AMD compiler.
code repository:

git://github.com/rakadam/HD-Radeon-Compute.git
git://github.com/rakadam/AMD-GPU-Asm-Disasm.git

The code is still a bit buggy (need better makefiles and some refactoring, some libdrm functionality has been duplicated in C++ too)