Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Answered
I Have A Question About Remote Dataset Get/Download.

I have a question about remote dataset get/download.
(Server A)Deploy clearml server. (Server B)Execute clearml-init and set api credentials from server A's clearml web (Server B)Dataset.get() is working for data which created and uploaded form server "B", but not working with created and uploaded from different server. (Dataset.get() works well across servers when executed in server A)
I think clearml on server B is trying to access localhost:8081 file server unless server "A"'s fileserver address (ex: example.com:8081) is set in clearml.conf
I'll attach python error log at thread. thx.

  
  
Posted 2 years ago
Votes Newest

Answers 12


The dataset file URL is set on upload and stored on the server - you can't have different machines using a different server address for the same server

  
  
Posted 2 years ago

>>> d = Dataset.get(dataset_name="Anonymous task (user@beryl 2022-03-23 04:05:19)", dataset_project="test_project").get_local _copy() Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c23e4b490>: Failed to establish a new connection: [Errno 111] Connection refused')': /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json Retrying (Retry(total=1, connect=1, read=5, redirect=5, status=None)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c23e4b6d0>: Failed to establish a new connection: [Errno 111] Connection refused')': /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json Retrying (Retry(total=0, connect=0, read=5, redirect=5, status=None)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c23e4b910>: Failed to establish a new connection: [Errno 111] Connection refused')': /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json 2022-03-23 14:25:51,073 - clearml.storage - ERROR - Could not download ` , err: HTTPConnectionPool(host='localhost', port=8081): Max retries exceeded with url: /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c01f7b070>: Failed to establish a new connection: [Errno 111] Connection refused'))
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/home/user/.conda/envs/test_project/lib/python3.8/site-packages/clearml/datasets/dataset.py", line 968, in get
raise ValueError('Could not load Dataset id={} state'.format(task.id))
ValueError: Could not load Dataset id=c05641c2e1c74389b471fbc9110c302d state

d = Dataset.get(dataset_name="Anonymous task (user@beryl 2022-03-23 04:05:19)", dataset_project="test_project").get_mutable_local_copy()
Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c23e63790>: Failed to establish a new connection: [Errno 111] Connection refused')': /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json
Retrying (Retry(total=1, connect=1, read=5, redirect=5, status=None)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c23e639d0>: Failed to establish a new connection: [Errno 111] Connection refused')': /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json
Retrying (Retry(total=0, connect=0, read=5, redirect=5, status=None)) after connection broken by 'NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c23e63c10>: Failed to establish a new connection: [Errno 111] Connection refused')': /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json
2022-03-23 14:27:03,162 - clearml.storage - ERROR - Could not download , err: HTTPConnectionPool(host='localhost', port=8081): Max retries exceeded with url: /test_project/Anonymous%20task%20%28user%2540beryl%202022-03-23%2004%253A05%253A19%29.c05641c2e1c74389b471fbc9110c302d/artifacts/state/state.json (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f0c25ee7f70>: Failed to establish a new connection: [Errno 111] Connection refused'))
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/home/user/.conda/envs/test_project/lib/python3.8/site-packages/clearml/datasets/dataset.py", line 968, in get
raise ValueError('Could not load Dataset id={} state'.format(task.id))
ValueError: Could not load Dataset id=c05641c2e1c74389b471fbc9110c302d state `

  
  
Posted 2 years ago

Hi MagnificentWorm7 ,

but not working with created and uploaded from different server

What is the clearml.conf configuration in the different server?

  
  
Posted 2 years ago

So for some reason it tries to use "localhost" instead of the actual address - do you have any other config file there? (perhaps an old trainf.conf file?), or maybe some ClearML-related environment variables?

  
  
Posted 2 years ago

but on which machine? and what's the clearml.conf configuration there (just to make sure the generated file is OK)

  
  
Posted 2 years ago

SuccessfulKoala55 yep

  
  
Posted 2 years ago

is this log from running on machine "B"?

  
  
Posted 2 years ago

SuccessfulKoala55 Thanks a lot. problem solved. Have a good day!

  
  
Posted 2 years ago

SuccessfulKoala55 created file run after clearml-init on cli

  
  
Posted 2 years ago

Oh, wait - scratch that - is it possible that the dataset was uploaded from server A where you used localhost:8081 as the address?

  
  
Posted 2 years ago

SuccessfulKoala55

The dataset file URL is set on upload and stored on the server

This might be a reason. I think server IP in machine A is set to "localhost:port"
Then, after I change IP "localhost" to "<server IP>" in server A and re upload Dataset, Is it accessible remotely?

  
  
Posted 2 years ago

Yeah, sure

  
  
Posted 2 years ago
1K Views
12 Answers
2 years ago
one year ago
Tags