[rds-devel] what event is delivered when HCA is hot plugged ?

Richard Frank richard.frank at oracle.com
Wed Aug 13 08:02:06 PDT 2008


Posting to rds-dev...

Pradeep, do you also have patches to fix these issues - and can you test 
this solution with hot-plug capability ?

Pradeep wrote:
>
> These are two different issues:
>
> 1. rds_ib_remove_one() is called in case the HCA is going
>   away - due to a PCI fatal error or an admin initiated hotplug
>   remove. rds_ib_remove_one() doesn't seem to cleanup connections
>   that are created on that device. RDS knows the IP addresses that are
>   assigned to the IB device through rds_ibdev->ipaddr_list.
>   So it isn't too difficult to cleanup connections that belong
>   to these IPs.
>
> 2. Currently RDS initiates a reconnect in case of connection
>   failure (an error on the QP). In case of port down, this will
>   happen after the IB transport timeout. I was wondering whether
>   RDS needs to delay it to that time or can it start the reconnect
>   upon a PORT_DOWN event from IB stack. I don't know how OFED stack
>   delivers this event. But I'm sure the HCA driver gets a PORT_DOWN
>   event.
>
> -Pradeep



More information about the rds-devel mailing list