Reputation
Badges 1
72 × Eureka!we will probably end up pulling the images from docker.io and pushing those to our container registry
you are correct and thank you for the reply @<1523701070390366208:profile|CostlyOstrich36>
going forward, I assume the clearml-server open-source releases will be continue to be released in Docker Hub
Hey @<1688125253085040640:profile|DepravedCrow61>
should I open issue to follow this up?
we are seeing this bug in almost every task
hey @<1688125253085040640:profile|DepravedCrow61>
I can confirm upgrading to 2.1 did the the work!
(closed the issue)
Thanks!
in the UI I also see the display name, so I pulled all the users info, and match name to id
correct, but!
I wrote a script that pulls tasks and limit for user
so I'm looking for users to knows their own id in advance
thanks @<1523701070390366208:profile|CostlyOstrich36>
I've done this successfully using the API already
as for the sdk option - in which format should I provide the list of tasks/projects to the sdk?
foronly_fields=["id", "name","created","status_changed","status", "user"], :
output example
{'id': '02a3f5929cf246138994c9243a692219', 'name': 'docfm_v7_safe_32gpu80g_11Jan24_4w', 'created': datetime.datetime(2024, 1, 11, 9, 54, 33, 406000, tzinfo=tzutc()), 'status_changed': dateti...
yep that was my approached with no luck so far
hopefully someone from the ClearML dev team can give their input on this
I have another instance with clearml-server 1.7 and I got same behavior
as I missing anything? I was under the assumption that jobs with same project/task names should be overwritten and not duplicated
same basic job not gets overwritten, but created new one every time
I manage clearml-server for many users
wrote a script that pulls tasks (for review - pending deletion) and limit for specific user (saw the REST can limit only by user id )
so user will need to provide the script his own user id in advance
I guess my options are:
- maintain stastic list of users (which I get from
users.get_allendpoint) - add to my script another query that will match user name to user id, before running the
tasks_get.allfor specific userid
I'm using rpm based machine, but I get your direction
put the cert in the right place for python to looks for it automatically
can I assume if it works smoothly with requests or urllib3 it will work for the ClearML API?
to be honest, the use case is mostly convenience
when people train ~5000+ experiments, all saved in few sub folders with long string as experiment name
before publishing a paper for example, we want to move copy small numbers of successful training to separate location and share it with other colleagues/management
I'd guess the alternative can be
changing the name of the successful training under the existing sub folder
using move instead of clone
anything else?
in case this will help someone else, I did not had root access to the training machine to add the cert to store
you can point your python to your own CA using:
export CURL_CA_BUNDLE=/path/to/CA.pem
@<1523701435869433856:profile|SmugDolphin23> working! here is what I have on Fedora/RHEL
- copy certs to
/etc/pki/ca-trust/source/anchors/ update-ca-trust
yep, again most jobs works .. the issue with when a job tries to upload artifacts to fileserver
offset = 0
limit = 500
all_data = []
while True:
params = {
'offset': offset,
'limit': limit
}
response = requests.get(url, headers=headers, params=params,verify=False)
data = response.json()
projects = data['data']['projects']
print(f"pulled {len(projects)} projects.")
if len(projects) == 0:
print("no project found - exiting ...")
break
all_data.extend(projects)
offset += limit
print(f"pulled {le...
ohhh severe error here 🙂
I was mixed between other API I worked on .. and did not read carefully the right flag
simple adding page to the body did the work
thanks again @<1724235687256920064:profile|LonelyFly9>
I think this is the right approach, let me have a deeper look
thanks @<1724235687256920064:profile|LonelyFly9>
OK I got everything to work
I think this script can be useful to other people and will be happy to share
@<1523701070390366208:profile|CostlyOstrich36> is there some repo I fork and contribute?
the way I do pagination is wrong
@<1523701070390366208:profile|CostlyOstrich36> might throw some champions tip over here 🙂
I know the 500 limit and using it
but my while loop keeps pulling the same 500 ... and running endless
just confirming this with the user and will share it over here
I do recall in the past that latest version caused this, and downgrading to some prior version fixed the issue
let me get the info and will post back here
10x @<1523701087100473344:profile|SuccessfulKoala55>
@<1523701435869433856:profile|SmugDolphin23> thanks for good pointers
it did not work on first attempt - requests did not validated the certs right
I have added this:
token_req = requests.get(api_server + "/auth.login", verify="<my_org_CA>", auth=(access_key, secret_key))```
print(token_req)
I got back
<Response [200]>
which I believe is good right?
when addingtoken = token_req.json()["data"]["token"]
I got errors from json decoder, which I believe is exp...
VivaciousPenguin66 your docs was helpful, I got SSL running but my question remained
have you kept needed http services accessible and only running the authentication via https?api_server: "http://<my-clearml-server>:8008" web_server: " " files_server: "http://<my-clearml-server:8081"my current state is that the webserver is accessible via http and https, in 8080 & 443
ok, hopefully someone will share some thoughts and how it went 🙂
the application is functional on localhost for sure