Skip to content

[BUG] AlmaLinux 8 support #3052

@filipeferreira89

Description

@filipeferreira89

Describe the bug: A clear and concise description of what the bug is.

Even though RHEL 8 and AlmaLinux are almost identical twins, there is no official support yet for AlmaLinux 8, whoever it's already supporting AlmaLinux 9.

We identified a bug with the WALinuxAgent 2.9.1.1 version on AlmaLinux 8 VMs. Sometimes, when we trigger a change on the hostname of the virtual machine with the following command hostnamectl set-hostname testvm000008a, WALinuxAgent triggers a restart of the NetworkManager Service and the eth0 can enter in a failed state and the vm loses any routes.

waagent.log

INFO EnvHandler ExtHandler EnvMonitor: Detected hostname change: testvm000008A -> testvm000008a
INFO EnvHandler ExtHandler Examine /proc/net/route for primary interface
INFO EnvHandler ExtHandler Primary interface is [eth0]
INFO EnvHandler ExtHandler Examine /proc/net/route for primary interface
INFO EnvHandler ExtHandler Primary interface is [eth0]
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8459] dhcp: init: Using DHCP client 'internal'
Feb  7 18:00:08 testvm000008A waagent[10128]: 2024-02-07T18:00:08.849389Z INFO EnvHandler ExtHandler Examine /proc/net/route for primary interface
Feb  7 18:00:08 testvm000008A waagent[10128]: 2024-02-07T18:00:08.849802Z INFO EnvHandler ExtHandler Primary interface is [eth0]
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8459] device (lo): carrier: link connected
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8461] manager: (lo): new Generic device (/org/freedesktop/NetworkManager/Devices/1)
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8468] device (eth0): carrier: link connected
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8473] manager: (eth0): new Ethernet device (/org/freedesktop/NetworkManager/Devices/2)
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8477] manager: (eth0): assume: will attempt to assume matching connection 'System eth0' (5fb06bd0-0bb0-7ffb-45f1-d6edd65f3e03) (indicated)
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8478] device (eth0): state change: unmanaged -> unavailable (reason 'connection-assumed', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8483] device (eth0): state change: unavailable -> disconnected (reason 'connection-assumed', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8490] device (eth0): Activation: starting connection 'System eth0' (5fb06bd0-0bb0-7ffb-45f1-d6edd65f3e03)
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8493] device (eth0): state change: disconnected -> prepare (reason 'none', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8495] device (eth0): state change: prepare -> config (reason 'none', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8497] device (eth0): state change: config -> ip-config (reason 'none', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8500] dhcp4 (eth0): activation: beginning transaction (timeout in 300 seconds)
Feb  7 18:00:08 testvm000008A waagent[10128]: 2024-02-07T18:00:08.856208Z INFO EnvHandler ExtHandler Examine /proc/net/route for primary interface
Feb  7 18:00:08 testvm000008A waagent[10128]: 2024-02-07T18:00:08.856608Z INFO EnvHandler ExtHandler Primary interface is [eth0]
Feb  7 18:00:08 testvm000008A dbus-daemon[837]: [system] Activating via systemd: service name='org.freedesktop.resolve1' unit='dbus-org.freedesktop.resolve1.service' requested by ':1.51' (uid=0 pid=35175 comm="/usr/sbin/NetworkManager --no-daemon " label="system_u:system_r:NetworkManager_t:s0")
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8589] dhcp4 (eth0): state changed new lease, address=AAA.BBB.CCC.DDD
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.8593] policy: set 'System eth0' (eth0) as default for IPv4 routing and DNS
Feb  7 18:00:08 testvm000008A kernel: IPv4: martian source AAA.BBB.CCC.DDD from 169.254.169.254, on dev eth0
Feb  7 18:00:08 testvm000008A kernel: ll header: 00000000: 00 0d 3a 29 d6 67 12 34 56 78 9a bc 08 00        ..:).g.4Vx....
Feb  7 18:00:08 testvm000008A kernel: IPv4: martian source AAA.BBB.CCC.DDD from 169.254.169.254, on dev eth0
Feb  7 18:00:08 testvm000008A kernel: ll header: 00000000: 00 0d 3a 29 d6 67 12 34 56 78 9a bc 08 00        ..:).g.4Vx....
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.9113] device (eth0): state change: ip-config -> ip-check (reason 'none', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.9134] device (eth0): state change: ip-check -> secondaries (reason 'none', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.9136] device (eth0): state change: secondaries -> activated (reason 'none', sys-iface-state: 'assume')
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.9138] manager: NetworkManager state is now CONNECTED_SITE
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.9140] device (eth0): Activation: successful, device activated.
Feb  7 18:00:08 testvm000008A NetworkManager[35175]: <info>  [1707328808.9144] manager: startup complete
Feb  7 18:00:08 testvm000008A systemd[1]: Started Network Manager Wait Online.
Feb  7 18:00:08 testvm000008A systemd[1]: nm-cloud-setup.service: Succeeded.
Feb  7 18:00:08 testvm000008A systemd[1]: Stopped Automatically configure NetworkManager in cloud.
Feb  7 18:00:08 testvm000008A systemd[1]: Starting Automatically configure NetworkManager in cloud...
Feb  7 18:00:08 testvm000008A systemd[1]: iscsi.service: Unit cannot be reloaded because it is inactive.
Feb  7 18:00:16 testvm000008A systemd[1]: nm-cloud-setup.service: Succeeded.
Feb  7 18:00:16 testvm000008A systemd[1]: Started Automatically configure NetworkManager in cloud.
Feb  7 18:00:19 testvm000008A systemd[1]: NetworkManager-dispatcher.service: Succeeded.
Feb  7 18:00:38 testvm000008A systemd[1]: systemd-hostnamed.service: Succeeded.
Feb  7 18:00:42 testvm000008A waagent[10128]: 2024-02-07T18:00:42.974630Z ERROR ExtHandler ExtHandler Error fetching the goal state: [ProtocolError] [Wireserver Exception] [HttpError] [HTTP Failed] GET http://168.63.129.16/machine/ -- IOError [Errno 101] Network is unreachable -- 6 attempts made
2024-02-04T14:56:07.661404Z INFO EnvHandler ExtHandler EnvMonitor: Detected hostname change: testvm000008A -> testvm000008a
2024-02-07T14:56:07.787131Z INFO EnvHandler ExtHandler Examine /proc/net/route for primary interface
2024-02-07T14:56:07.787731Z INFO EnvHandler ExtHandler Primary interface is [eth0]
2024-02-07T14:56:07.796409Z INFO EnvHandler ExtHandler Examine /proc/net/route for primary interface
2024-02-07T14:56:07.796754Z INFO EnvHandler ExtHandler Primary interface is [eth0]

Note: Please add some context which would help us understand the problem better

  1. Section of the log where the error occurs.
  2. Serial console output
  3. Steps to reproduce the behavior.

Distro and WALinuxAgent details (please complete the following information):

  • Distro and Version: AlmaLinux 9
  • WALinuxAgent version 2.9.1.1

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions