Ask Your Question

Revision history [back]

click to hide/show revision 1
initial version

Finding error using ServerGroups

I am trying to setup an anti affinity server group and running into an error.

Basic 3 node system: controller, compute1, compute2. Mitaka running on Ubuntu 16.04.1

+----+------------------+-------------+----------+---------+-------+----------------------------+-----------------+
| Id | Binary           | Host        | Zone     | Status  | State | Updated_at                 | Disabled Reason |
+----+------------------+-------------+----------+---------+-------+----------------------------+-----------------+
| 11 | nova-consoleauth | controller1 | internal | enabled | up    | 2016-09-13T18:06:08.000000 | -               |
| 12 | nova-scheduler   | controller1 | internal | enabled | up    | 2016-09-13T18:06:01.000000 | -               |
| 14 | nova-conductor   | controller1 | internal | enabled | up    | 2016-09-13T18:06:09.000000 | -               |
| 19 | nova-compute     | compute1    | nova     | enabled | up    | 2016-09-13T18:06:07.000000 | -               |
| 20 | nova-compute     | compute2    | nova     | enabled | up    | 2016-09-13T18:06:09.000000 | -               |
+----+------------------+-------------+----------+---------+-------+----------------------------+-----------------+

Compute1 & 2 info:

+---------------------------+------------------------------------------+
| Property                  | Value                                    |
+---------------------------+------------------------------------------+
| cpu_info_arch             | x86_64                                   |
| cpu_info_features         | ["pge", "clflush", "sep", "syscall",     |
|                           | "vme", "dtes64", "msr", "vmx", "xtpr",   |
|                           | "cmov", "pbe", "est", "pat", "monitor",  |
|                           | "lm", "tsc", "nx", "fxsr", "tm",         |
|                           | "sse4.1", "pae", "sse4.2", "acpi",       |
|                           | "mmx", "cx8", "mce", "de", "tm2", "ht",  |
|                           | "dca", "lahf_lm", "pdcm", "mca", "apic", |
|                           | "sse", "pse", "ds", "invtsc", "pni",     |
|                           | "rdtscp", "sse2", "ss", "ds_cpl",        |
|                           | "ssse3", "fpu", "cx16", "pse36", "mtrr", |
|                           | "popcnt"]                                |
| cpu_info_model            | Nehalem                                  |
| cpu_info_topology_cells   | 2                                        |
| cpu_info_topology_cores   | 4                                        |
| cpu_info_topology_sockets | 1                                        |
| cpu_info_topology_threads | 2                                        |
| cpu_info_vendor           | Intel                                    |
| current_workload          | 0                                        |
| disk_available_least      | 93                                       |
| free_disk_gb              | 101                                      |
| free_ram_mb               | 31647                                    |
| host_ip                   | 10.0.2.129                               |
| hypervisor_hostname       | compute1                                 |
| hypervisor_type           | QEMU                                     |
| hypervisor_version        | 2005000                                  |
| id                        | 1                                        |
| local_gb                  | 101                                      |
| local_gb_used             | 0                                        |
| memory_mb                 | 32159                                    |
| memory_mb_used            | 512                                      |
| running_vms               | 0                                        |
| service_disabled_reason   | None                                     |
| service_host              | compute1                                 |
| service_id                | 19                                       |
| state                     | up                                       |
| status                    | enabled                                  |
| vcpus                     | 16                                       |
| vcpus_used                | 0                                        |
+---------------------------+------------------------------------------+

+---------------------------+------------------------------------------+
| Property                  | Value                                    |
+---------------------------+------------------------------------------+
| cpu_info_arch             | x86_64                                   |
| cpu_info_features         | ["pge", "clflush", "sep", "syscall",     |
|                           | "vme", "dtes64", "msr", "vmx", "xtpr",   |
|                           | "cmov", "pbe", "est", "pat", "monitor",  |
|                           | "lm", "tsc", "nx", "fxsr", "tm",         |
|                           | "sse4.1", "pae", "sse4.2", "acpi",       |
|                           | "mmx", "cx8", "mce", "de", "tm2", "ht",  |
|                           | "dca", "lahf_lm", "pdcm", "mca", "apic", |
|                           | "sse", "pse", "ds", "invtsc", "pni",     |
|                           | "rdtscp", "sse2", "ss", "ds_cpl",        |
|                           | "ssse3", "fpu", "cx16", "pse36", "mtrr", |
|                           | "popcnt"]                                |
| cpu_info_model            | Nehalem                                  |
| cpu_info_topology_cells   | 2                                        |
| cpu_info_topology_cores   | 4                                        |
| cpu_info_topology_sockets | 1                                        |
| cpu_info_topology_threads | 2                                        |
| cpu_info_vendor           | Intel                                    |
| current_workload          | 0                                        |
| disk_available_least      | 85                                       |
| free_disk_gb              | 93                                       |
| free_ram_mb               | 30623                                    |
| host_ip                   | 10.0.2.130                               |
| hypervisor_hostname       | compute2                                 |
| hypervisor_type           | QEMU                                     |
| hypervisor_version        | 2005000                                  |
| id                        | 2                                        |
| local_gb                  | 101                                      |
| local_gb_used             | 8                                        |
| memory_mb                 | 32159                                    |
| memory_mb_used            | 1536                                     |
| running_vms               | 1                                        |
| service_disabled_reason   | None                                     |
| service_host              | compute2                                 |
| service_id                | 20                                       |
| state                     | up                                       |
| status                    | enabled                                  |
| vcpus                     | 16                                       |
| vcpus_used                | 2                                        |
+---------------------------+------------------------------------------+

Setting up the anti-affinity group is pretty straight forward:

+--------------------------------------+-----------------------+----------------------------------+----------------------------------+--------------------+-------------------------------------------+----------+
| Id                                   | Name                  | Project Id                       | User Id                          | Policies           | Members                                   | Metadata |

+--------------------------------------+-----------------------+----------------------------------+----------------------------------+--------------------+-------------------------------------------+----------+
| 238f300e-a115-410b-b47b-669f9bad6cac | gr-WebServer-Anti-Aff | 726978d9d83b4c959680d964f0e920e3 | eaf9e32b26d24246aa6ffcfb12981726 | [u'anti-affinity'] | [u'b596c343-9348-43c5-b9e9-918acb050f4e'] | {}       |
+--------------------------------------+-----------------------+----------------------------------+----------------------------------+--------------------+-------------------------------------------+----------+

But then it breaks:

+--------------------------------------+---------------+--------+------------+-------------+---------------------------+
| ID                                   | Name          | Status | Task State | Power State | Networks                  |
+--------------------------------------+---------------+--------+------------+-------------+---------------------------+
| 5fc8ea14-85b8-483e-a938-cb0976fe3608 | webserver-ext | ERROR  | -          | NOSTATE     |                           |
| b596c343-9348-43c5-b9e9-918acb050f4e | webserver-int | ACTIVE | -          | Running     | Troy_PP_Grn-net=10.0.32.5 |
+--------------------------------------+---------------+--------+------------+-------------+---------------------------+

Relevant lines from nova scheduler log:

2016-09-13 11:02:46.421 12451 INFO nova.filters [req-ea9e7c8c-7589-4209-be64-c5852950b71e eaf9e32b26d24246aa6ffcfb12981726 726978d9d83b4c959680d964f0e920e3 - - -] Filter ServerGroupAntiAffiityFilter returned 0 hosts
2016-09-13 11:02:46.422 12451 INFO nova.filters [req-ea9e7c8c-7589-4209-be64-c5852950b71e eaf9e32b26d24246aa6ffcfb12981726 726978d9d83b4c959680d964f0e920e3 - - -] Filtering removed all host for the request with instance ID '5fc8ea14-85b8-483e-a938-cb0976fe3608'. Filter results: ['RetryFilter: (start: 2, end: 1)', 'AvailabilityZoneFilter: (start: 1, end: 1)', 'RamFilter: (star: 1, end: 1)', 'DiskFilter: (start: 1, end: 1)', 'ComputeFilter: (start: 1, end: 1)', 'ComputeCapabilitiesFilter: (start: 1, end: 1)', 'ImagePropertiesFilter: (start: 1, end: 1)', 'ServerGrupAntiAffinityFilter: (start: 1, end: 0)']

I started these two instances without the Server Anti Affinity and they happily started on compute1 & compute2. (They are the only instances so far so they split between the two).

I'm new to OpenStack so I have my suspicions that it's user error causing this issue, but if someone could point me in the right direction I'd appreciate it.

Thank you