OSDN Git Service

habanalabs/gaudi2: reset device upon critical ECC event
authorOfir Bitton <obitton@habana.ai>
Tue, 28 Jun 2022 05:34:28 +0000 (08:34 +0300)
committerOded Gabbay <ogabbay@kernel.org>
Tue, 12 Jul 2022 06:09:28 +0000 (09:09 +0300)
commita85e389a845825a1ed3d26dd95fe24d5ad71531d
tree244559480662fce6d8132120c7e13d5c0f0a637a
parent6b4e8a12b2b9a5385b89d25ac450deddb8ed9a62
habanalabs/gaudi2: reset device upon critical ECC event

Correctable ECC events are not fatal, but as they accumulate, the f/w
can decide that a hard-rest is required. This indication is
propagated to the host using the existing ECC event interface.

Signed-off-by: Ofir Bitton <obitton@habana.ai>
Reviewed-by: Oded Gabbay <ogabbay@kernel.org>
Signed-off-by: Oded Gabbay <ogabbay@kernel.org>
drivers/misc/habanalabs/gaudi2/gaudi2.c
drivers/misc/habanalabs/include/common/cpucp_if.h