-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[20191130.82] intfmgrd failed to assign IPv4 address to PortChannel #10336
Comments
Error logs
|
Update: We also observed the auto-restart testing failure of
the failure is due to the race condition between processes
|
Update: See the same race condition issue on device After
Syslog messages are as following:
|
From logs I see this is a completely different scenario where the portchannels were not cleaned up on exit
So can we try updating the autorestart testcase to use SIGTERM and see the behavior ? |
@yozhao101 can you please try Judy's suggestion? Thanks. |
This issue still exists and I observe it on all branches (I checked on "master", "202205", "202012"). |
Also observed on 202211. Can be recoverable through a config reload or reboot. |
Observed in 202311 as well. |
Description
Recently we observed the testing of
teamd
auto-restart feature consistently failed onstr-s6000-on-2
in our nightly test. The failure is due to the BGP sessions with 5 T2 upstream VMs were not up.When we did RCA for this BGP session issue, BGP sessions were not up is due to there is no IPv4 address assigned to these 5 PortChannels. We found the root cause is there is a race condition between the processes
intfmgrd
fromSwSS
andteammgrd
fromteamd
. Details are as following:intfmgrd
will assign an IPv4 address to each PortChannel whileteammgrd
will bring up each PortChannel and add interfaces into each PortChannel.If
teammgrd
finishes its job first, thenintfmgrd
can successfully assign IPv4 address to each PortChannel. However, ifintfmgrd
did its job first beforeteammgrd
brings PortChannel up and adds interfaces into it, then there will be no IPv4 address assigned to each PortChannel.We attach the syslog file from device for reference and Bing also shared the error messages from device.
Steps to reproduce the issue:
autorestart
feature was enabled forteamd
on device by running commandshow feature status
.If it was not enabled, we can issue the command
sudo config feature autorestart teamd enabled
to enable it.teamsyncd
orteammgrd
inteamd
container by running thesudo kill -SIGKILL <pid>
command.show ip bgp summary
to see whether BGP sessions with T2 devices are up.ifconfig PortChannel102
to see whether IPv4 address was assigned to that PortChannel.Describe the results you received:
Describe the results you expected:
Output of
show version
:Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):
syslog.txt
The text was updated successfully, but these errors were encountered: