Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
Hi, I Am Having Problem With Clearml Running On Our Private Server. This Error Occured On Older Version On Clearml And Server. Now After Update And Purge Of All Old Database With

Hi, I am having problem with ClearML running on our private server.
This error occured on older version on clearML and server. now after update and purge of all old database with docker down -v the error persists and I have no idea how to fix it. ClearML and server are up to latest version as of 31 May 2022

  
  
Posted 2 years ago
Votes Newest

Answers 19


Are you running your dockers on Linux or Windows?

  
  
Posted 2 years ago

The index events-training_stats_scalar-d1bd92a3b039400cbafc60a7a5b1e52b status is red. Meaning that the data for this index got corrupted. Since there are no replicas the only feasible option would be to delete this index. All the training scalars events for the old taskd would be lost then. But the newly created tasks should start working fine.
curl -XDELETE

  
  
Posted 2 years ago

SweetBadger76 sorry to tag you but I dont know where to find logs. Do I have elasticsearch logs on my server that I installed the Clearml-server?

  
  
Posted 2 years ago

AppetizingMouse58 all is Linux. Or idea was to run docker on same server to initiate tasks from UI but it was taking to much time so we give up and still do "python train.py experiment=myexpname"

  
  
Posted 2 years ago

Glad to hear that it helped:)

  
  
Posted 2 years ago

Ok, I see. Then you can enter the apiserver container:
sudo docker exec -it clearml-apiserver /bin/bashAnd run the following commands inside the container
curl -XGET curl -XGET

  
  
Posted 2 years ago

do you think if we manually delete folder /opt/clearml/data/ that would solve this problem same way?

  
  
Posted 2 years ago

Do you mean the "search_phase_execution" error? Yes, stopping containers, deleting the data folder and running the containers again would bring you to a "clean install" state. But then you would loose all your data not only the task scalar results

  
  
Posted 2 years ago

AppetizingMouse58 Thanks for the answer, sending the logs

  
  
Posted 2 years ago

I have firewall installed on the server and not all ports are open

  
  
Posted 2 years ago

Hi MortifiedDove27 , you can run the following commands on the clearml server host to get the docker logs for the apiserver and elasticsearch:
sudo docker logs clearml-apiserver > apiserver.logs 2>&1 sudo docker logs clearml-elastic > elastic.logs 2>&1

  
  
Posted 2 years ago

can you please provide the apiserver log and the elasticsearch log?

  
  
Posted 2 years ago

It seems that elasticsearch is failing on any search request. Can you please run the following commands and share the results?
curl -XGET curl -XGET

  
  
Posted 2 years ago

curl: (7) Failed to connect to localhost port 9200: Connection refused

  
  
Posted 2 years ago

doesn't fit in 1 message in slack

  
  
Posted 2 years ago

Thank you very much it worked! I hope I will never see this kind of bug, will be happy to give more feedback if you would like to find a rootcause

  
  
Posted 2 years ago

Hi Igor
we are working on your issue and will update you asap

  
  
Posted 2 years ago

Hi David, where can I get these logs?

  
  
Posted 2 years ago

Hey Igor
I am not the expert about this topic. I have someone who better knows the topic that is coming back to you straight after his meeting. 🙂

  
  
Posted 2 years ago
964 Views
19 Answers
2 years ago
one year ago
Tags