So, I’ve tried about half a dozen different AMI’s using AWS Linux, RHEL, and Ubuntu. Nothing getting me past the basics of docker-machine as shown below. Following the docker-machine class it appears to try spawning a machine and fails. It catches an exit of the process immediately.
info: Found docker-machine executable
info: Loaded 1 nodes
info: Loaded 2 routes
info: Starting http proxy on 3000
info: Trying to create machine… (1)
warn: Cannot create machine: Error: docker-machine exited with code 1
info: Trying to create machine… (2)
warn: Cannot create machine: Error: docker-machine exited with code 1
info: Trying to create machine… (3)
warn: Cannot create machine: Error: docker-machine exited with code 1
warn: Could not remove docker-machine, it’s likely that the machine was not created, but double-check!
warn: Cannot create node via autoscaling: Cannot create machine (attempted 3 times)
warn: Cannot forward task 5d400299-07f8-4327-8876-d9e22aa8d7f0 to processing node 10.0.0.181:3001: No nodes available (attempted to autoscale but failed). Try again later.
Exit code 1, going by docker convention, suggests an application error. Looking through ClusterODM.log shows nothing of interest. Versioning has been an issue in other places, are there specific versions of supporting packages that could be fouling this up? I’m currently running node 14.13.0, docker 20.10.7, and docker-machine 0.16.2. Spot is currently ‘false’ so no spot complexities. The node configuration starts with a single, locked node.
A few things I’ve considered and tried:
- varying the OS versions
- making sure the user (ec2-user for RHEL and AWS, ubuntu for Ubuntu) has permissions for ec2 and s3.
- Updating and logging into a provided AMI before creating a child AMI so that the cred’s are in .ssh.
- Ensuring that ports are open. I’ve allowed inbound traffic for the primary for 22 and 3000-3001. The private IP’s are allowed 22, 443, 80, and 3000-3001. Are there other ports used by docker-machine?
If I can get past machine creation, then I fully expect AWS Linux to fail since the 19.03.9.sh engineInstallURL doesn’t account for amzn. (BTW - that shell script is awfully similar to the docker one here - https://get.docker.com.) However, that doesn’t explain the success others have reported using an Ubuntu AMI - obviously, the solution doesn’t rest on launching AWS Linux instances only although that would be a reasonable extension of the .sh docker installer.
I saw drivers used in docker machine creation for virtualbox. Do I need that installed? Are there specific OS releases that have worked?
Any insights would be greatly appreciated!