Created attachment 33479 [details] the test case to reproduce the issue After running the testcase "vsraytrace" the X server craches and the screen remains black. I can not switch back to console. The kernel is not hung, I can soft-shut down the machine. kernel 2.6.32.8 X 1.7.5-1 libdrm 2.4.17-1 xf86-video-intel 2.9.1-2 mesa 7.7-3 (same issue with 7.6.1-1) Everything except for the kernel from official unstable/experimental debian packages.
Comment on attachment 33479 [details] the test case to reproduce the issue the test cases uses shaderutil.c/h from progs/util/ in Mesa sources.
Created attachment 33480 [details] hopefully relevant part of kern.log
Created attachment 33481 [details] log of crashing X server
Created attachment 33482 [details] log from kdm
The issue persists with Debian's libdrm 2.4.18-1.
The problem is still there with the 2.6.32.9 kernel.
If you are here, have a look at https://bugs.freedesktop.org/show_bug.cgi?id=27060 too please.
I have re-run the test case with today's Mesa master 8df65e98998b4c104db30cbba8a38be7eb2a9acd (including the above referred patch) and drm master c1c8bbf80b1f734e23996bf805dc78f32ebaf56f and the X server crash still exists!
I provided the test case that reliably reproduces to problem for me, so would someone from Intel please have a look into this?
(In reply to comment #9) > I provided the test case that reliably reproduces to problem for me, so would > someone from Intel please have a look into this? > Please run the test case with LIBGL_ALWAYS_SOFTWARE=1 ./vsraytrace to what the correct rendering should be.
The trigger seems to be the number of vertices in the pointset. Using 10x10, 50x50 does not trigger the hang, but using 250x250 points does. I'm not certain what the significance of this is yet...
Could be that our hangcheck timer is too aggressive, given that there's no error reported but we get a hangcheck timeout... Tag you're it Chris!
Also it would be very nice, if the X server was able to restart after.
Piglit case glsl-vs-raytrace-bug26691 failed with error message: intel_bufmgr_gem.c:1070: Error setting domain 598: Input/output error intel_bufmgr_gem.c:1247: Error setting memory domains 598 (00000040 00000000): Input/output error . X still alive, not crash. After running this case, all the rest of piglit cases failed.
promoting to P1, as it impacts the rest piglit execution.
Created attachment 35551 [details] [review] Include instdone in hangcheck. This is a patch that I've been using to reduce the number of spurious errors.
Dropping priority, as far as we can tell this a bug in the hang-check spuriously firing for which the attached kernel patch should reduce the error rate.
The hangcheck change is now upstream. [gm45] Running the test case on the old compiler throws an error that it does not handle multiple returns from a function. So it appears that this residual will hopefully be fixed with the glsl2 compiler work, which is being tracked at bug 28748. Marking this bug as closed as the test case is now part of piglit (and the test suite) and is being tracked separately. Thanks for the bug report and the excellent test case!
I can confirm, it is working on GM45 with upstream mesa. Thank you!
Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.