Having some issues getting Advanced Recovery going. Got two servers at two different locations but they are directly on the Internet using the system firewall. Did the SSH key generation between the servers and verified that I can SSH using the keys and login without a password.
When I try to run the quick configuration wizard all it does it tells me there is an SSH issue. The wiki doesn’t have a lot of FAQ on possible errors/steps. I’m curious if anyone else has had this issue and if there was a work around for it? Or perhaps there is a step not outlined in the wiki that needs to be done somewhere else (like Filestore, etc) though I figured a wizard would be dealing with those things.
I’ve also disabled the firewalls on each server and had the same issue. Each time following the “verify SSH” steps in the wiki SSH is verified at the system level but the module continues to say “SSH issue”
Well I’ve done some more testing and this is what I’ve come up with.
First, I created a Warm Spare backup on the primary server, did all the needed steps. Tried to do a backup, it failed. Couldn’t find /var/spool/asterisk/backup on the secondary server. I looked and sure enough:
OK cool, the directory exists. So now I try to do the backup again. I get another error. While this time the backup was sent to the secondary server the restore process failed to work. So I could to validate things:
-rw-r--r-- 1 root root 717387 Feb 18 16:53 /var/spool/asterisk/backup/20210218-115305-1613667185-15.0.17.17-193674846.tar.gz
Backup exists. So then I changed the restore method from SSH to API, put in all the API details. Run a new backup, the file is transferred by SSH and the API triggers the restore process, it works fine.
So basically, the primary can log into the secondary via SSH keys from both the CLI and via the Warm Spare Backup process. I still am getting SSH errors from Advanced Recovery so I’m starting to point the issue at the module itself since everything else that requires the SSH key based login between two machines works just fine.
Well this is a 30 day trial of Advanced Recovery for another party, I’m just doing the setup. The admin has spoken with their sales rep about the issue and asked for some assistance, the response was that support credits needed to be purchased. They weren’t thrilled with that response which lead to my warm spare testing.
So there is no included support right now. If you still think opening a ticket is going to get something down without support credits, I will tell the admin to open one since all this is under his Sangoma account (deployment IDs, licenses, etc).
So just a little update. A ticket was opened on Thursday about this issue, it took until about 40 minutes ago to get a response that it would be assigned to an engineer. This is after two days of me asking for an update.
However, since there were modules updated in the last few days including another update to Core yesterday (.51) I went into the primary server and updated all the modules that needed to be updated. Now it is important to note, I only did this on the primary server not the secondary server. After the update was completed I tested Advanced Recovery and BOOM! SSH connection is made to the server I kept getting errors that SSH wasn’t setup right. The recovery builds out the primary and the secondary with no issue.
At this point I delete everything in Advanced Recovery on the primary server, this basically resets it to run the configuration wizard again. I jump over the secondary and try to connect back to the primary under Advanced Recovery and bam, “The server does not have SSH access configured properly” so it can’t connect to the system.
So what did I just do, I updated the secondary server and tested again. Lo and behold, I can connect to the primary server via Advanced Recovery so clearly these errors about the other server having SSH misconfigured were false as it only worked when the system issuing the SSH connection via Advanced Recovery was updated to most recent modules released in the last few days.
This was clearly an issue with the modules and something over the releases from the last couple days fixed the issue. Since things like Framework and Core are base modules that almost everything else depends on (perhaps System Admin too) I have to ask, when these modules are updated the QA process does require that all the other modules that depend on them are tested too? Because this seems to be a theme where one module is updated and other things that depend on it aren’t tested and those break.
Notice how that section becomes scrollable because the tooltip display content? Yeah, so when you hover away from the tooltip to scroll down to read the full tooltip the content becomes hidden again. So good job on that, way to make it helpful for users.
@ncorbic You guys need to be better at this. Period.