OSDN Git Service

drm/amdgpu: only harvest gcea/mmea error status in arcturus
authorHawking Zhang <Hawking.Zhang@amd.com>
Fri, 16 Apr 2021 09:34:13 +0000 (17:34 +0800)
committerAlex Deucher <alexander.deucher@amd.com>
Wed, 21 Apr 2021 01:35:45 +0000 (21:35 -0400)
commit53ee6609b42e09f89bf2cdd15a340c236694ecd3
treec7cf82ceedef56a55b750b0cc1f5b9b6bcf58601
parent9406d39bb6ef11e8525d7bd9acfcba5708db485b
drm/amdgpu: only harvest gcea/mmea error status in arcturus

SDP RdRspStatus/WrRspStatus or first parity error on
RdRsp data can cause system fatal error in arcturus.
GPU will be freezed in such case.

Driver needs to harvest these error information before
reset the GPU. Check error type to avoid harvest normal
gcea/mmea information.

Signed-off-by: Hawking Zhang <Hawking.Zhang@amd.com>
Reviewed-by: Stanley Yang <Stanley.Yang@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/gfx_v9_4.c
drivers/gpu/drm/amd/amdgpu/mmhub_v9_4.c
drivers/gpu/drm/amd/include/asic_reg/gc/gc_9_4_1_sh_mask.h