Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi, I Have Run K8S_Glue_Example.Py On My On-Prem K8S, And Have Preconfigured Nodeport Services. I Succeeded To Use Clearml Session To Create Pods But The Ssh Tunneling Failed. It Tried To Connect Clusterip Of The Pod And Port 10020 Instead Of Node Ip And

Hi, I have run k8s_glue_example.py on my on-prem K8s, and have preconfigured NodePort services. I succeeded to use ClearML Session to create pods but the SSH tunneling failed. It tried to connect ClusterIP of the pod and port 10020 instead of node IP and NodePort. How should I fix it?
Following is my service yml:

kind: Service
apiVersion: v1
metadata:
  name: clearml-agent-1-nodeprot
  namespace: clearml
spec:
  ports:
  - name: clearml-agent-ssh
    port: 10022
    targetPort: 10022
  type: NodePort    
  selector:
    ai.allegro.agent.serial: pod-1
  
  
Posted 9 months ago
Votes Newest

Answers 4


Hi @<1523701087100473344:profile|SuccessfulKoala55> , Even I run the clearml-session with command line option --remote-ssh-port and --remote-gateway the SSH tunneling still failed.
Following is my complete step:

  • set k8s service with the following yml:
kind: Service
apiVersion: v1
metadata:
  name: clearml-agent-1-nodeprot
  namespace: clearml
spec:
  ports:
  - name: clearml-agent-ssh
    port: 10022
    targetPort: 10022
    nodePort: 31919
  type: NodePort
  selector:
    ai.allegro.agent.serial: pod-1
  1. run python k8s_glue_example.py --queue gpu-1 --ports-mode --template-yaml gpu-1.yml on k8s node.
  2. run clearml-session --docker nvidia/cuda:11.0.3-runtime-ubuntu20.04 --remote-gateway 10.190.253.18 --remote-ssh-port 31919 on my PC. 10.190.253.18 is the node IP that session pod running.
  3. the clearml-session log on my PC:
Remote machine is ready
Setting up connection to remote session
Starting SSH tunnel to root@10.190.253.18, port 31919

SSH tunneling failed, retrying in 3 seconds
Starting SSH tunnel to root@10.190.253.18, port 31919
.......

Could you provide a complete example or tutorial?

  
  
Posted 9 months ago

Hi @<1589065012880609280:profile|PunySquid51> , clearml-session uses port 10022 by default. you can use the --remote-ssh-port command line option to specify a different port to use

  
  
Posted 9 months ago

I also tried to run ssh -P 31919 root@10.190.253.18 , but got error message: ssh: connect to host 0.0.124.175 port 22: Unknown error .

  
  
Posted 9 months ago

@<1523701087100473344:profile|SuccessfulKoala55> I solved it by other settings, thanks.

  
  
Posted 9 months ago
496 Views
4 Answers
9 months ago
9 months ago
Tags