Bug 94691 - [SKL] GPU HANG: ecode 9:0:0x87f93ff9
Summary: [SKL] GPU HANG: ecode 9:0:0x87f93ff9
Status: RESOLVED INVALID
Alias: None
Product: Mesa
Classification: Unclassified
Component: Drivers/DRI/i965 (show other bugs)
Version: git
Hardware: x86-64 (AMD64) Linux (All)
: medium major
Assignee: Intel 3D Bugs Mailing List
QA Contact: Intel 3D Bugs Mailing List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-03-24 23:12 UTC by Markus Schauler
Modified: 2017-02-10 22:38 UTC (History)
2 users (show)

See Also:
i915 platform: SKL
i915 features: GPU hang


Attachments
GPU crashdump (40.82 KB, text/plain)
2016-03-24 23:12 UTC, Markus Schauler
Details

Description Markus Schauler 2016-03-24 23:12:38 UTC
Created attachment 122528 [details]
GPU crashdump

hardware: intel-next kernel (commit f5d413cccefa1f93d64c34f357151d42add63a84
)

on Intel NUC6i5 (skylake) hardware, with newest firmware (DMC fw version: 1.26)

/proc/cpuinfo:

processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 78
model name      : Intel(R) Core(TM) i5-6260U CPU @ 1.80GHz
stepping        : 3
microcode       : 0x6a
cpu MHz         : 499.992
cache size      : 4096 KB
physical id     : 0
siblings        : 4
core id         : 0
cpu cores       : 2
apicid          : 0
initial apicid  : 0
fpu             : yes
fpu_exception   : yes
cpuid level     : 22
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch epb intel_pt tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflushopt xsaveopt xsavec xgetbv1 dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp
bugs            :
bogomips        : 3600.33
clflush size    : 64
cache_alignment : 64
address sizes   : 39 bits physical, 48 bits virtual
power management:

GPU hangs/crashes frequently, journal message is:

Mär 24 23:46:37 linux.suse kernel: [drm] stuck on render ring
Mär 24 23:46:37 linux.suse kernel: [drm] GPU HANG: ecode 9:0:0x87f93ff9, in plasmashell [2427], reason: Engine(s) hung, action: reset
Mär 24 23:46:37 linux.suse kernel: [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
Mär 24 23:46:37 linux.suse kernel: [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
Mär 24 23:46:37 linux.suse kernel: [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
Mär 24 23:46:37 linux.suse kernel: [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
Mär 24 23:46:37 linux.suse kernel: [drm] GPU crash dump saved to /sys/class/drm/card0/error
Mär 24 23:46:37 linux.suse kernel: drm/i915: Resetting chip after gpu hang
Comment 1 yann 2016-09-05 10:33:05 UTC
Assigning to Mesa product.

From this error dump, hung is happening in render ring batch with active head
at 0xfe69135c, with 0x78260000 (3DSTATE_BINDING_TABLE_POINTERS_VS) as IPEHR.

Batch extract (around 0xfe69135c):

0xfe691328:      0x78170009: 3D UNKNOWN: 3d_965 opcode = 0x7817
0xfe69132c:      0x00000000: MI_NOOP
0xfe691330:      0x00000000: MI_NOOP
0xfe691334:      0x00000000: MI_NOOP
0xfe691338:      0x00000000: MI_NOOP
0xfe69133c:      0x00000000: MI_NOOP
0xfe691340:      0x00000000: MI_NOOP
0xfe691344:      0x00000000: MI_NOOP
0xfe691348:      0x00000000: MI_NOOP
0xfe69134c:      0x00000000: MI_NOOP
0xfe691350:      0x00000000: MI_NOOP
0xfe691354:      0x78260000: 3DSTATE_BINDING_TABLE_POINTERS_VS
0xfe691358:      0x00000000:    dword 1
0xfe69135c:      0x782a0000: 3DSTATE_BINDING_TABLE_POINTERS_PS
0xfe691360:      0x00006840:    dword 1
Bad length 9 in 3DSTATE_VS, expected 6-6
0xfe691364:      0x78100007: 3DSTATE_VS
0xfe691368:      0x00000740:    kernel pointer
0xfe69136c:      0x00000000:    SPF=0, VME=0, Sampler Count 0, Binding table count 0
0xfe691370:      0x00000000:    scratch offset
0xfe691374:      0x00000000:    Dispatch GRF start 0, VUE read length 0, VUE read offset 0
0xfe691378:      0x00000000:    Max Threads 1, Vertex Cache enable, VS func disable

Moreover, since your kernel is 4.5.0-3.m1+, you may also consider to upgrade your kernel since there are additional workarounds for skl.
Comment 2 Matt Turner 2016-11-04 00:41:02 UTC
Please test a new version of Mesa (12 or 13) and mark as REOPENED
if you can reproduce and RESOLVED/* if you cannot reproduce.
Comment 3 Annie 2017-02-10 22:38:34 UTC
Dear Reporter,

This Mesa bug has been in the "NEEDINFO" status for over 60 days. I am closing this bug based on lack of response but feel free to reopen if resolution is still needed. Please ensure you're supplying the correct information as requested.

Thank you.


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.