]> git.hungrycats.org Git - linux/commit
net/mlx5e: Add recovery flow for tx devlink health reporter for unhealthy PTP SQ
authorRahul Rameshbabu <rrameshbabu@nvidia.com>
Wed, 9 Aug 2023 04:10:21 +0000 (21:10 -0700)
committerGreg Kroah-Hartman <gregkh@linuxfoundation.org>
Tue, 28 Nov 2023 17:14:57 +0000 (17:14 +0000)
commit647eb36aaeca8ea1ccd5e89674cc5d5080390dee
tree30091a9bb4270bd425f705dec5d0aac02ff048d9
parente729382c297e2c492ff2a260aa1f23183eadae2e
net/mlx5e: Add recovery flow for tx devlink health reporter for unhealthy PTP SQ

[ Upstream commit 53b836a44db4259b94ffcfff321fb3d63f976b76 ]

A new check for the tx devlink health reporter is introduced for
determining when the PTP port timestamping SQ is considered unhealthy. If
there are enough CQEs considered never to be delivered, the space that can
be utilized on the SQ decreases significantly, impacting performance and
usability of the SQ. The health reporter is triggered when the number of
likely never delivered port timestamping CQEs that utilize the space of the
PTP SQ is greater than 93.75% of the total capacity of the SQ. A devlink
health reporter recover method is also provided for this specific TX error
context that restarts the PTP SQ.

Signed-off-by: Rahul Rameshbabu <rrameshbabu@nvidia.com>
Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
Stable-dep-of: 92214be5979c ("net/mlx5e: Update doorbell for port timestamping CQ before the software counter")
Signed-off-by: Sasha Levin <sashal@kernel.org>
Documentation/networking/devlink/mlx5.rst
drivers/net/ethernet/mellanox/mlx5/core/en/health.h
drivers/net/ethernet/mellanox/mlx5/core/en/ptp.c
drivers/net/ethernet/mellanox/mlx5/core/en/ptp.h
drivers/net/ethernet/mellanox/mlx5/core/en/reporter_tx.c