Reputation
Badges 1
49 × Eureka!This is clearml-agent helm chart values.yaml file i used to install
@<1523701087100473344:profile|SuccessfulKoala55> what is task log? you mean the pod log provisioned by clearml-agent? do you want me to show them?
I run clearml-agent manually in gpu available pod using command clearml-agent daemon --queue shelley
and this doesnât show gpu usage same with when i run task remotely
and here is the log
agent.worker_id =
agent.worker_name = shelley-gpu-pod
agent.force_git_ssh_protocol = false
agent.python_binary =
agent.package_manager.type = pip
agent.package_manager.pip_version.0 = <20.2 ; python_version < â3.10â
agent.package_manager.pip_version.1 = <22.3 ; python_ver...
The clearml server I installed is a self-hosted server, and developers log in using a fixed ID and password for authentication. Thatâs it!
Futhermore, to access ssh/vscode/jupyterlab directly without ssh tunneling, I modified the clearml-session script, and once I upload this script to the DevOps project in draft status, developers clone it to their own project. Then, they enqueue and wait for the command and URL to access ssh/vscode/jupyterlab, which will be displayed.
Thanks! also logs too?
pod log is too long. would it be ok if i upload pod log file here??
@<1523701087100473344:profile|SuccessfulKoala55> Okay..but how can i specify agentâs verison in helm chart?
then, is there any way to get embed code from scalars?
It seems that there is no way to add environments, so i customized charts and using it on my own.
Are there other people experiencing the same issue as me?
@<1523701087100473344:profile|SuccessfulKoala55> I realized that this is not an issue with the cloud or on-premise environment. itâs working well on gke but not working on eks. here is the log when i run âclearml-agent daemon --queue ~â command on eks
root@shelley-gpu-pod:/# clearml-agent daemon --queue shelley3
/usr/local/lib/python3.8/dist-packages/requests/init.py:109: RequestsDependencyWarning: urllib3 (2.0.1) or chardet (None)/charset_normalizer (3.1.0) doesnât match a supported ve...
Hi @<1523701205467926528:profile|AgitatedDove14>
The server is already self hosted. I realized i canât create a report using clearml sdk. so i think i need to find other ways
i fount the solution!! i added configuration to helmâs values.yaml below.
additionalConfigs:
# services.conf: |
# tasks {
# non_responsive_tasks_watchdog {
# # In-progress tasks that havenât been updated for at least âvalueâ seconds will be stopped by the watchdog
# threshold_sec: 21000
# # Watchdog will sleep for this number of seconds after each cycle
# watch_interval_sec: 900
# }
# }
apiserver.co...
It also shows on project detail page.
because clearml-agnet is not installed in my gke cluster
root@shelley-gpu-pod:/# clearml-agent daemon --queue shelley2 --foreground
/usr/local/lib/python3.8/dist-packages/requests/init.py:109: RequestsDependencyWarning: urllib3 (2.0.2) or chardet (None)/charset_normalizer (3.1.0) doesnât match a supported version!
warnings.warn(
Using environment access key CLEARML_API_ACCESS_KEY=ââ
Using environment secret key CLEARML_API_SECRET_KEY=********
Current configuration (clearml_agent v1.5.2, location: None):
agent.worker_id ...
Hi again đ @<1523701087100473344:profile|SuccessfulKoala55> sure!

@<1523701070390366208:profile|CostlyOstrich36> Hello. Oh, sorry for the lack of explanation.when i execute the command âclearml-session ~â, jupyter url format is â None :{local_jupyter_port}/?token={jupyter_token}â and vs code url format is just â None :{local_vscode_port}â like the pic i attached here. I wonder why vs code url doesnât have token.
