Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
StoutElephant16
Moderator
5 Questions, 32 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

32 × Eureka!
0 Votes
16 Answers
893 Views
0 Votes 16 Answers 893 Views
Hi All 🙂 I am trying to run a Hyperparameter Optimization Task, where the controller task is submitted to the services queue (and picked up by the the defau...
one year ago
0 Votes
10 Answers
935 Views
0 Votes 10 Answers 935 Views
Hi All 🙂 I am self hosting my ClearML Server on an EC2 instance on AWS. As far as I understood based on None " By default, the open source ClearML Server ru...
one year ago
0 Votes
7 Answers
993 Views
0 Votes 7 Answers 993 Views
Hey all 🙂 I'm having trouble using the clearml-agent command. I am executing an experiment from a code repository and I am using a requirements.txt file to ...
2 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
Hi everybody, I found that it is possible to schedule experiments using the TaskScheduler class in python https://clear.ml/docs/latest/docs/references/sdk/sc...
2 years ago
0 Votes
30 Answers
994 Views
0 Votes 30 Answers 994 Views
Hey hey, I having trouble with ClearML and ALBs in the AWS. Could someone help me? 🙂 I am currently trying to deploy ClearML in the AWS. The Basic Infrastru...
2 years ago
0 Hey All

Hey SuccessfulKoala55 . I use my own custom Daemon that in turn runs clearml-agent execute for some complicated reasons (other correlated processes) I want to be able to fetch and execute only certain task id, instead of pulling one from the queue.

2 years ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

Ok, I think that's been very helpful 🙂 I'll experiment a little, now that I know a Health Check that must work. I'll write here if I find something! Thanks a lot for the awesome support!

2 years ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

And it's still unhealthy. I am starting to suspect that somehow the Autoscaling Part in between the ALB and the ClearML server could be causing the problem.

2 years ago
0 Hi All

UPDATE: Now the agent-services is working 🙂 I was able to solve it by providing CLEARML_API_HOST: ${CLEARML_API_HOST:- None } in my docker-compose instead of CLEARML_API_HOST: None , where the environment variable CLEARML_API_HOST was set as my public api address. So in other words, the traffic is going through the internet, back to the server (same machine) and now it seems to be working. Thanks @<1593051292383580160:...

one year ago
0 Hi All

@<1593051292383580160:profile|SoreSparrow36> thanks a lot, I'll try it out 😉 Did I get it right? You have the public DNSs for CLEARML_WEB_HOST and CLEARML_FILES_HOST (both without http:// or https://)?

one year ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

Currently I'm "cheating" and counting a 405 as the success code for the healthcheck.

2 years ago
0 Hi All

@<1523701087100473344:profile|SuccessfulKoala55> but the problem still persists. Any other ideas?

one year ago
0 Hi All

After about 8hrs running I finally got clearml_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the ClearML API server None ?

one year ago
0 Hi All

In my environment I have defined CLEARML_API_HOST (hard coded in docker-compose), CLEARML_WEB_HOST , CLEARML_FILES_HOST , CLEARML_API_ACCESS_KEY , CLEARML_API_SECRET_KEY , CLEARML_AGENT_GIT_USER and CLEARML_AGENT_GIT_PASS .

one year ago
0 Hey All

Ok thanks a lot for the Info! For now (as a simple error handling): is there any way I can tell the ClearML Server that the experiment should be cancelled using the shell?

2 years ago
0 Hi All

UPDATE: setting SHUTDOWN_IF_NO_ACCESS_KEY: 1 allowed me to see the agent-services container, and then a docker inspect clearml-agent-services showed me that the environment variables needed for the agent in the docker-compose.yml were empty. So the problem was in my bootstrap script.

Because SHUTDOWN_IF_NO_ACCESS_KEY was set to 0 before, the container would disappear 🙂

Thanks a lot for helping me figure this out!

one year ago
0 Hi All

Good idea!
So, my api server is CLEARML_API_HOST= None and I ran telnet apiserver 8008 and received:

Trying 172.18.0.6...
Connected to apiserver.
Escape character is '^]'.

It seems the container is able to resolve the address and connect.

one year ago
0 Hi All

Hi @<1523701087100473344:profile|SuccessfulKoala55> Thanks! it seems the container is able to download packages, I attached the full log here 😉

one year ago
0 Hi All

I have this block in my docker compose:

  agent-services:
    networks:
      - backend
    container_name: clearml-agent-services
    image: allegroai/clearml-agent-services:latest
    deploy:
      restart_policy:
        condition: on-failure
    privileged: true
    environment:
      <....>
    volumes:
      - /var/run/docker.sock:/var/run/docker.sock
      - /opt/clearml/agent:/root/.clearml
    depends_on:
      - apiserver
    entrypoint: >
      bash -c "curl --retry 10 --retr...
one year ago
0 Hi All

Thanks a lot! Yes, I don't see such a worker in the UI. docker ps returns the containers below. I suppose the clearml-apiserver is the relevant one.
image

one year ago
0 Hi All

Yes thanks a lot 🙂 This already helped me a lot 😉 I'll investigate!

one year ago
0 Hi All

Here's my docker-compose, maybe I'm missing something 😄 And thanks again for the support 😉

one year ago
0 Hi All

Hi @<1593051292383580160:profile|SoreSparrow36> , thanks a lot! I ran docker network connect backend clearml-agent-services and got the response:
Error response from daemon: endpoint with name clearml-agent-services already exists in network clearml_backend
It was expected because my docker-compose had the entry

  agent-services:
    networks:
      - backend 

I can also resolve and curl None from the clearml-agent-services container.

I managed...

one year ago
0 Hi All

I left the environment variables out to keep things short, but there is one SHUTDOWN_IF_NO_ACCESS_KEY: 1 . Maybe some authentication is failing and the container is stopping.

one year ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

JuicyFox94 I think I found the problem. To my absolute shame, the security group of the ALB had no Outbound rules, i.e. no traffic was allowed out of the ALB 🙈 . Now I can access the ClearML Webserver!

2 years ago
2 years ago
0 Hey All

Fantastic thanks a lot 🙂

2 years ago
0 Hi All

Currently I have the environment variable CLEARML_API_HOST= None set and CLEARML_HOST_IP is empty. I assume that the latter is not needed when the CLEARML_API_HOST is defined.

one year ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

And I could access the web server even if the health check was failing. So that was not a problem in the end.

2 years ago
Show more results compactanswers