failed to connect to 'ipv4:10.160.113.47:2222': socket error: connection refused

Question

failed to connect to 'ipv4:10.160.113.47:2222': socket error: connection refused

echoyes opened this issue 8 years ago · comments

when I run the distributed mnist_cnn.py and I just followed the comand like this "./start_tf.sh 8 3 mnist_cnn.py", I encountered some errors such as "failed to connect to 'ipv4:10.160.113.47:2222': socket error: connection refused" .
Besides, I am also wondering by using the command "./start_tf.sh 8 3 mnist_cnn.py" how to start remote server process without using ssh or some other protocols.
thanks.

郑泽宇 · Answer 1 · Mon Aug 15 2016 20:48:43 GMT+0800 (China Standard Time)

You should run the start_tf.sh inside the k8s cluster. That means you need to use
kubectl exec -it some-pod bash
to go into the cluster and start the training process. some-pod could be any pod that runs inside the cluster. For example you use the ps-worker pod.

If you want to train models using remote server, TensorFlow uses gRPC by default (and I don't think you can change that without significant code change).

GuiYang · Answer 2 · Sat Apr 22 2017 00:25:57 GMT+0800 (China Standard Time)

您好！
您已经开源了自己的v1.0版本的代码，我想问下，如果自己搭建起来了kubernetes集群后，将您的v1.0版本的代码整合到想有集群中，可行么？

谢谢🙏

caicloud-bot · Answer 3 · Thu Jul 05 2018 19:17:46 GMT+0800 (China Standard Time)

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

/lifecycle stale

caicloud-bot · Answer 4 · Sat Aug 04 2018 20:23:48 GMT+0800 (China Standard Time)

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

caicloud-bot · Answer 5 · Mon Sep 03 2018 21:22:00 GMT+0800 (China Standard Time)

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

/close