Problem with dnsmasq getting killed.
Hi,
We have a openstack environment with 1 controller , 1 network node and many computes, there are multiple such environments some running grizzly and some running havana.
We are facing a common problem everywhere, the dnsmasq process dies on it own and we have to restart the dhcp agent or add the network back to the dhcp agent in case it dies.
We have a lease period of 4 hours and running a load of approx. 800+ VMs.
Has someone faced a similar problem if not any suggestions/pointers will really be helpful.
Can you pull the log data from DHCP and Neutron-Server. Do you see any errors when you match up timestamps with when the process fails? Are you using a single subnet for all 800 VMs?
Unfortunately there is no error in the logs....yes, we are using a single subnet which has multiple cidrs.