Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
StoutElephant16
Moderator
5 Questions, 32 Answers
  Active since 10 January 2023
  Last activity 9 months ago

Reputation

0

Badges 1

32 × Eureka!
0 Votes
16 Answers
434 Views
0 Votes 16 Answers 434 Views
Hi All 🙂 I am trying to run a Hyperparameter Optimization Task, where the controller task is submitted to the services queue (and picked up by the the defau...
9 months ago
0 Votes
2 Answers
656 Views
0 Votes 2 Answers 656 Views
Hi everybody, I found that it is possible to schedule experiments using the TaskScheduler class in python https://clear.ml/docs/latest/docs/references/sdk/sc...
one year ago
0 Votes
10 Answers
463 Views
0 Votes 10 Answers 463 Views
Hi All 🙂 I am self hosting my ClearML Server on an EC2 instance on AWS. As far as I understood based on None " By default, the open source ClearML Server ru...
9 months ago
0 Votes
30 Answers
542 Views
0 Votes 30 Answers 542 Views
Hey hey, I having trouble with ClearML and ALBs in the AWS. Could someone help me? 🙂 I am currently trying to deploy ClearML in the AWS. The Basic Infrastru...
one year ago
0 Votes
7 Answers
545 Views
0 Votes 7 Answers 545 Views
Hey all 🙂 I'm having trouble using the clearml-agent command. I am executing an experiment from a code repository and I am using a requirements.txt file to ...
one year ago
0 Hi All

UPDATE: Now the agent-services is working 🙂 I was able to solve it by providing CLEARML_API_HOST: ${CLEARML_API_HOST:- None } in my docker-compose instead of CLEARML_API_HOST: None , where the environment variable CLEARML_API_HOST was set as my public api address. So in other words, the traffic is going through the internet, back to the server (same machine) and now it seems to be working. Thanks @<1593051292383580160:...

9 months ago
0 Hi All

@<1593051292383580160:profile|SoreSparrow36> thanks a lot, I'll try it out 😉 Did I get it right? You have the public DNSs for CLEARML_WEB_HOST and CLEARML_FILES_HOST (both without http:// or https://)?

9 months ago
0 Hi All

After about 8hrs running I finally got clearml_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the ClearML API server None ?

9 months ago
0 Hi All

UPDATE: setting SHUTDOWN_IF_NO_ACCESS_KEY: 1 allowed me to see the agent-services container, and then a docker inspect clearml-agent-services showed me that the environment variables needed for the agent in the docker-compose.yml were empty. So the problem was in my bootstrap script.

Because SHUTDOWN_IF_NO_ACCESS_KEY was set to 0 before, the container would disappear 🙂

Thanks a lot for helping me figure this out!

9 months ago
0 Hi All

@<1523701087100473344:profile|SuccessfulKoala55> but the problem still persists. Any other ideas?

9 months ago
0 Hi All

Currently I have the environment variable CLEARML_API_HOST= None set and CLEARML_HOST_IP is empty. I assume that the latter is not needed when the CLEARML_API_HOST is defined.

9 months ago
0 Hi All

In my environment I have defined CLEARML_API_HOST (hard coded in docker-compose), CLEARML_WEB_HOST , CLEARML_FILES_HOST , CLEARML_API_ACCESS_KEY , CLEARML_API_SECRET_KEY , CLEARML_AGENT_GIT_USER and CLEARML_AGENT_GIT_PASS .

9 months ago
0 Hi All

Hi @<1523701087100473344:profile|SuccessfulKoala55> Thanks! it seems the container is able to download packages, I attached the full log here 😉

9 months ago
0 Hi All

Good idea!
So, my api server is CLEARML_API_HOST= None and I ran telnet apiserver 8008 and received:

Trying 172.18.0.6...
Connected to apiserver.
Escape character is '^]'.

It seems the container is able to resolve the address and connect.

9 months ago
0 Hi All

Hi @<1593051292383580160:profile|SoreSparrow36> , thanks a lot! I ran docker network connect backend clearml-agent-services and got the response:
Error response from daemon: endpoint with name clearml-agent-services already exists in network clearml_backend
It was expected because my docker-compose had the entry

  agent-services:
    networks:
      - backend 

I can also resolve and curl None from the clearml-agent-services container.

I managed...

9 months ago
0 Hi All

Here's my docker-compose, maybe I'm missing something 😄 And thanks again for the support 😉

9 months ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

JuicyFox94 I think I found the problem. To my absolute shame, the security group of the ALB had no Outbound rules, i.e. no traffic was allowed out of the ALB 🙈 . Now I can access the ClearML Webserver!

one year ago
0 Hi All

Thanks a lot! Yes, I don't see such a worker in the UI. docker ps returns the containers below. I suppose the clearml-apiserver is the relevant one.
image

9 months ago
0 Hi All

Yes thanks a lot 🙂 This already helped me a lot 😉 I'll investigate!

9 months ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

And I could access the web server even if the health check was failing. So that was not a problem in the end.

one year ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

Ok, I think that's been very helpful 🙂 I'll experiment a little, now that I know a Health Check that must work. I'll write here if I find something! Thanks a lot for the awesome support!

one year ago
0 Hi All

I left the environment variables out to keep things short, but there is one SHUTDOWN_IF_NO_ACCESS_KEY: 1 . Maybe some authentication is failing and the container is stopping.

9 months ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

Currently I'm "cheating" and counting a 405 as the success code for the healthcheck.

one year ago
0 Hi All

I have this block in my docker compose:

  agent-services:
    networks:
      - backend
    container_name: clearml-agent-services
    image: allegroai/clearml-agent-services:latest
    deploy:
      restart_policy:
        condition: on-failure
    privileged: true
    environment:
      <....>
    volumes:
      - /var/run/docker.sock:/var/run/docker.sock
      - /opt/clearml/agent:/root/.clearml
    depends_on:
      - apiserver
    entrypoint: >
      bash -c "curl --retry 10 --retr...
9 months ago
one year ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

But I still have one thing I'd like to fix: the health check for the file server on port 8081 gives me unhealthy for path "/". Is there a valid path you know I can use there for health checks? A curl gives me

one year ago
0 Hey Hey, I Having Trouble With Clearml And Albs In The Aws. Could Someone Help Me?

And it's still unhealthy. I am starting to suspect that somehow the Autoscaling Part in between the ALB and the ClearML server could be causing the problem.

one year ago
0 Hey All

Hey SuccessfulKoala55 . I use my own custom Daemon that in turn runs clearml-agent execute for some complicated reasons (other correlated processes) I want to be able to fetch and execute only certain task id, instead of pulling one from the queue.

one year ago
Show more results compactanswers