OSDN Git Service

mlxsw: core: Add validation of transceiver temperature thresholds
authorVadim Pasternak <vadimp@nvidia.com>
Fri, 8 Jan 2021 14:52:09 +0000 (16:52 +0200)
committerJakub Kicinski <kuba@kernel.org>
Sun, 10 Jan 2021 00:25:10 +0000 (16:25 -0800)
commit57726ebe2733891c9f59105eff028735f73d05fb
treefad00833e299e3aa8fdd94034529329181ee015e
parentb77413446408fdd256599daf00d5be72b5f3e7c6
mlxsw: core: Add validation of transceiver temperature thresholds

Validate thresholds to avoid a single failure due to some transceiver
unreliability. Ignore the last readouts in case warning temperature is
above alarm temperature, since it can cause unexpected thermal
shutdown. Stay with the previous values and refresh threshold within
the next iteration.

This is a rare scenario, but it was observed at a customer site.

Fixes: 6a79507cfe94 ("mlxsw: core: Extend thermal module with per QSFP module thermal zones")
Signed-off-by: Vadim Pasternak <vadimp@nvidia.com>
Reviewed-by: Jiri Pirko <jiri@nvidia.com>
Signed-off-by: Ido Schimmel <idosch@nvidia.com>
Signed-off-by: Jakub Kicinski <kuba@kernel.org>
drivers/net/ethernet/mellanox/mlxsw/core_thermal.c