All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH net v4] net/smc: Reset conn->lgr when link group registration fails
@ 2022-01-06  6:44 Wen Gu
  2022-01-06 10:00 ` Karsten Graul
  0 siblings, 1 reply; 3+ messages in thread
From: Wen Gu @ 2022-01-06  6:44 UTC (permalink / raw)
  To: kgraul, davem, kuba; +Cc: linux-s390, netdev, linux-kernel

SMC connections might fail to be registered in a link group due to
unable to find a link to assign to during its creation. As a result,
connection creation will return a failure and most resources related
to the connection won't be applied or initialized, such as
conn->abort_work or conn->lnk.

If smc_conn_free() is invoked later, it will try to access the
resources related to the connection, which wasn't initialized, thus
causing a warning or crash.

This patch tries to fix this by resetting conn->lgr to NULL if an
abnormal exit occurs in smc_lgr_register_conn(), thus avoiding the
access to uninitialized resources in smc_conn_free().

Meanwhile, the new created link group should be terminated if smc
connections can't be registered in it. So smc_lgr_cleanup_early() is
modified to take care of link group only and invoked to terminate
unusable link group by smc_conn_create(). The call to smc_conn_free()
is moved out from smc_lgr_cleanup_early() to smc_conn_abort().

Fixes: 56bc3b2094b4 ("net/smc: assign link to a new connection")
Suggested-by: Karsten Graul <kgraul@linux.ibm.com>
Signed-off-by: Wen Gu <guwen@linux.alibaba.com>
---
v1->v2:
- Reset conn->lgr to NULL in smc_lgr_register_conn().
- Only free new created link group.
v2->v3:
- Using __smc_lgr_terminate() instead of smc_lgr_schedule_free_work()
  for an immediate free.
v3->v4:
- Modify smc_lgr_cleanup_early() and invoke it from smc_conn_create().
---
 net/smc/af_smc.c   |  7 ++++---
 net/smc/smc_core.c | 12 +++++++-----
 net/smc/smc_core.h |  2 +-
 3 files changed, 12 insertions(+), 9 deletions(-)

diff --git a/net/smc/af_smc.c b/net/smc/af_smc.c
index 230072f..f22f3ca 100644
--- a/net/smc/af_smc.c
+++ b/net/smc/af_smc.c
@@ -630,10 +630,11 @@ static int smc_connect_decline_fallback(struct smc_sock *smc, int reason_code,
 
 static void smc_conn_abort(struct smc_sock *smc, int local_first)
 {
+	struct smc_connection *conn = &smc->conn;
+
+	smc_conn_free(conn);
 	if (local_first)
-		smc_lgr_cleanup_early(&smc->conn);
-	else
-		smc_conn_free(&smc->conn);
+		smc_lgr_cleanup_early(conn->lgr);
 }
 
 /* check if there is a rdma device available for this connection. */
diff --git a/net/smc/smc_core.c b/net/smc/smc_core.c
index 412bc85..cd3c3b8 100644
--- a/net/smc/smc_core.c
+++ b/net/smc/smc_core.c
@@ -171,8 +171,10 @@ static int smc_lgr_register_conn(struct smc_connection *conn, bool first)
 
 	if (!conn->lgr->is_smcd) {
 		rc = smcr_lgr_conn_assign_link(conn, first);
-		if (rc)
+		if (rc) {
+			conn->lgr = NULL;
 			return rc;
+		}
 	}
 	/* find a new alert_token_local value not yet used by some connection
 	 * in this link group
@@ -622,15 +624,13 @@ int smcd_nl_get_lgr(struct sk_buff *skb, struct netlink_callback *cb)
 	return skb->len;
 }
 
-void smc_lgr_cleanup_early(struct smc_connection *conn)
+void smc_lgr_cleanup_early(struct smc_link_group *lgr)
 {
-	struct smc_link_group *lgr = conn->lgr;
 	spinlock_t *lgr_lock;
 
 	if (!lgr)
 		return;
 
-	smc_conn_free(conn);
 	smc_lgr_list_head(lgr, &lgr_lock);
 	spin_lock_bh(lgr_lock);
 	/* do not use this link group for new connections */
@@ -1835,8 +1835,10 @@ int smc_conn_create(struct smc_sock *smc, struct smc_init_info *ini)
 		write_lock_bh(&lgr->conns_lock);
 		rc = smc_lgr_register_conn(conn, true);
 		write_unlock_bh(&lgr->conns_lock);
-		if (rc)
+		if (rc) {
+			smc_lgr_cleanup_early(lgr);
 			goto out;
+		}
 	}
 	conn->local_tx_ctrl.common.type = SMC_CDC_MSG_TYPE;
 	conn->local_tx_ctrl.len = SMC_WR_TX_SIZE;
diff --git a/net/smc/smc_core.h b/net/smc/smc_core.h
index d63b082..73d0c35 100644
--- a/net/smc/smc_core.h
+++ b/net/smc/smc_core.h
@@ -468,7 +468,7 @@ static inline void smc_set_pci_values(struct pci_dev *pci_dev,
 struct smc_sock;
 struct smc_clc_msg_accept_confirm;
 
-void smc_lgr_cleanup_early(struct smc_connection *conn);
+void smc_lgr_cleanup_early(struct smc_link_group *lgr);
 void smc_lgr_terminate_sched(struct smc_link_group *lgr);
 void smcr_port_add(struct smc_ib_device *smcibdev, u8 ibport);
 void smcr_port_err(struct smc_ib_device *smcibdev, u8 ibport);
-- 
1.8.3.1


^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH net v4] net/smc: Reset conn->lgr when link group registration fails
  2022-01-06  6:44 [PATCH net v4] net/smc: Reset conn->lgr when link group registration fails Wen Gu
@ 2022-01-06 10:00 ` Karsten Graul
  2022-01-06 11:33   ` Wen Gu
  0 siblings, 1 reply; 3+ messages in thread
From: Karsten Graul @ 2022-01-06 10:00 UTC (permalink / raw)
  To: Wen Gu, davem, kuba; +Cc: linux-s390, netdev, linux-kernel

On 06/01/2022 07:44, Wen Gu wrote:
> @@ -630,10 +630,11 @@ static int smc_connect_decline_fallback(struct smc_sock *smc, int reason_code,
>  
>  static void smc_conn_abort(struct smc_sock *smc, int local_first)
>  {
> +	struct smc_connection *conn = &smc->conn;
> +
> +	smc_conn_free(conn);
>  	if (local_first)
> -		smc_lgr_cleanup_early(&smc->conn);
> -	else
> -		smc_conn_free(&smc->conn);
> +		smc_lgr_cleanup_early(conn->lgr);
>  }

Looks like I missed a prereq patch here, but wo'nt conn->lgr be set to NULL
after smc_conn_free() called smc_lgr_unregister_conn()?

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH net v4] net/smc: Reset conn->lgr when link group registration fails
  2022-01-06 10:00 ` Karsten Graul
@ 2022-01-06 11:33   ` Wen Gu
  0 siblings, 0 replies; 3+ messages in thread
From: Wen Gu @ 2022-01-06 11:33 UTC (permalink / raw)
  To: Karsten Graul, davem, kuba; +Cc: linux-s390, netdev, linux-kernel



On 2022/1/6 6:00 pm, Karsten Graul wrote:

> Looks like I missed a prereq patch here, but wo'nt conn->lgr be set to NULL
> after smc_conn_free() called smc_lgr_unregister_conn()?

Right... I should hold a local copy of lgr in smc_conn_abort().

My another RFC patch removes 'conn->lgr = NULL' from smc_lgr_unregister_conn(),
so I make a mistake here...

I will fix this. Thank you.

Wen Gu

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2022-01-06 11:33 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-01-06  6:44 [PATCH net v4] net/smc: Reset conn->lgr when link group registration fails Wen Gu
2022-01-06 10:00 ` Karsten Graul
2022-01-06 11:33   ` Wen Gu

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.