ThoughtfulOctopus83

7 Questions, 29 Answers

Active since 29 April 2023

Last activity one year ago

Reputation

Badges 1

28 × Eureka!

Questions 7
Answers 29

0 Votes

2 Answers

1K Views

0 Votes 2 Answers 1K Views

I Am Getting Error When Cloning Tfs Repository With Error . We User Https To Clone And Have No Possibility Yet With Ssh. "Check If Remote-Worker Has Valid Credentials [See Worker Configuration File]." I Have Added Below Config In /Home/Ubuntu-User/Clearm

i am getting error when cloning tfs repository with error . We user https to clone and have no possibility yet with ssh. "Check if remote-worker has valid cr...

mlops

one year ago

0 Votes

6 Answers

1K Views

0 Votes 6 Answers 1K Views

Need Guide To Install Clearml Server In Complete Disconnected Environment From Internet On Kubernetes Using Helm Chart. We Have Private Registry Harbour Which I Can Use For This. Please Let Me Know What Can Be Done.

Need guide to install ClearML server in complete disconnected environment from internet on kubernetes using helm chart. We have private registry Harbour whic...

kubernetes

one year ago

0 Votes

27 Answers

981 Views

0 Votes 27 Answers 981 Views

Clearml Agent Can Work On Gpu Machine In No Internet Environment Where We Have Proxy For Pip Packages And Ubuntu Updates ? If Yes, How To Configure These Proxy In Agent Settings. I See At Launch Of Script Saying "Hello" It Install Many Packages Befor

ClearML agent can work on GPU machine in no internet environment where we have proxy for pip packages and ubuntu updates ? If yes, How to configure these pro...

mlops

one year ago

0 Votes

3 Answers

1K Views

0 Votes 3 Answers 1K Views

Where I Can Find Logs Of Clearml Server Which Should Go Into Elk (Elastic Search ) For Security.

where I can find logs of clearML server which should go into ELK (elastic search ) for security.

clearml

one year ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Is It Possible To Serve Model With Frontend Html Page To Allow Input To Be Entered. Something Like Image Upload To Predict Number On It For Minst Dataset

is it possible to serve model with frontend HTML page to allow input to be entered. Something like image upload to predict number on it for minst dataset ?

clearml

one year ago

0 Votes

3 Answers

895 Views

0 Votes 3 Answers 895 Views

@<1523701827080556544:profile|JuicyFox94> We are getting SSL error on clearML inference serving pod which I deployed using ClearML serving Helm chat repo. We...

clearml

one year ago

0 Votes

13 Answers

1K Views

0 Votes 13 Answers 1K Views

Need Help ! I Am Able To Train Models From Our Local Machines And Log Everything On The Clearml Server Without Any Issues, The Same Training Gets Stuck When I Use Remote Training. The Logs Do Not Provide Any Useful Information, And The Last Line In The L

Need help ! I am able to train models from our local machines and log everything on the ClearML server without any issues, the same training gets stuck when ...

mlops

one year ago

0 We'Ve Identified A Regression Caused By The Release Of Urllib3 V2, Clearml Sdk 1.10.4Rc1 Is Out And Fixes It, And We'Ll Release The Official Clearml Sdk V1.10.4 Next Week.

@<1523701087100473344:profile|SuccessfulKoala55> Is it a fix for below error which we are getting with new version of clearml server "Starting Task Execution:

Traceback (most recent call last):
File "/home/admin/.clearml/venvs-builds/3.10/lib/python3.10/site-packages/clearml/utilities/requests_toolbelt/_compat.py", line 48, in <module>
from requests.packages.urllib3.contrib import appengine as gaecontrib
ImportError: cannot import name 'appengine' from 'requests.packages.urllib3.contr...

one year ago

0 Need Guide To Install Clearml Server In Complete Disconnected Environment From Internet On Kubernetes Using Helm Chart. We Have Private Registry Harbour Which I Can Use For This. Please Let Me Know What Can Be Done.

@<1523701087100473344:profile|SuccessfulKoala55> Do you think below YML is okay “apiserver:
image:
registry: " harbor.example.com/projectname "
repository: "allegroai/clearml"
pullPolicy: IfNotPresent
tag: "1.10.0-357"
service:
type: ClusterIP
ingress:
enabled: true
hostName: " api.clearml.example.com "
fileserver:
image:
registry: " [harbor.example.com/projectname](http://harbo...

one year ago

0 ..

@<1523701827080556544:profile|JuicyFox94> tlsSecretName for clearML web server , api server and File server all ?? In serving YAML ? I am getting error on pod clearml-serving-inference-6bdb9c757d-ww4vx"for /auth.login"

one year ago

0 ..

@<1523701827080556544:profile|JuicyFox94> I see an bug is open for the same on clearML page None

one year ago

Yes, Just want to know where to provide private registry name when deploying this helm chart for clearML server as well as for its dependent chart like elastic search , mango DB.

one year ago

0 Is It Possible To Serve Model With Frontend Html Page To Allow Input To Be Entered. Something Like Image Upload To Predict Number On It For Minst Dataset

@<1537605940121964544:profile|EnthusiasticShrimp49> Yes

one year ago

0 Clearml Agent Can Work On Gpu Machine In No Internet Environment Where We Have Proxy For Pip Packages And Ubuntu Updates ? If Yes, How To Configure These Proxy In Agent Settings. I See At Launch Of Script Saying "Hello" It Install Many Packages Befor

Script i am running is hello.py with code "from clearml import Task

task = Task.init(project_name="mlops", task_name="Say Hellow")
task.execute_remotely(queue_name="P2000")
print("Hello")" console output " clearml-session --jupyter-lab true --queue P2000 --base-task-id=515159dab92d4baabcb6b3647263a144
clearml-session - CLI for launching JupyterLab / VSCode on a remote machine
Verifying credentials
Use previous queue (resource) 'P2000' [Y]/n? Y

Interactive session config:
{
"base_task_...

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Thanks a lot , it worked !!! However i am getting Error when i open ClearML web application - Fetch tag failed "Error 0 : You can't write against a read only replica." DO you now if this is known issue and fix available for it.

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Agent is running outside Kubernetes on a standalone VM running Ubuntu 22.04

one year ago

0 I Am Getting Error When Cloning Tfs Repository With Error . We User Https To Clone And Have No Possibility Yet With Ssh. "Check If Remote-Worker Has Valid Credentials [See Worker Configuration File]." I Have Added Below Config In /Home/Ubuntu-User/Clearm

@<1523701205467926528:profile|AgitatedDove14> Thanks for quick replay. You are correct, issue resolved after removing https.

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Yes, I am able to create Clearml task and perform training from same machine. only when i start clearml-session this error coming. Do i need to specia config in clearml.conf file for clearml session to work ? Just to add However when i run this command , it work and execute task but do not give any interative jupyter or code url.
clearml-session --jupyter-lab true --queue P2000 --base-task-id=515159dab92d4baabcb6b3647263a144 , it run the task...

one year ago

0 Is It Possible To Serve Model With Frontend Html Page To Allow Input To Be Entered. Something Like Image Upload To Predict Number On It For Minst Dataset

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Thanks .. I will try it and let you know. I have one more question . I have installed latest version of clearML server and now I see issue with Urllib3 V2 which will fix next week with new releases. How can I install old version with helm chart which is stable and working ?

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> When I use docker I see it go out for NVIDIA , Ubuntu and pip package. I can fix pip via above but what about other NVIDIA and Ubuntu ?

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Yes, We have Load balancer which provide IP to ClearML Server and it is working for all operation like normal task creation , remote training and all but only clearml-session is not working.

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> It was blocked on Load balancer and after allowing traffic , it is working. Thanks a lot !!

one year ago

0 Need Help ! I Am Able To Train Models From Our Local Machines And Log Everything On The Clearml Server Without Any Issues, The Same Training Gets Stuck When I Use Remote Training. The Logs Do Not Provide Any Useful Information, And The Last Line In The L

@<1523701087100473344:profile|SuccessfulKoala55> Any idea why it is going to internet only when I run training with PyTorch framework download.PyTorch.org

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Thanks a lot !!! Its fixed after i redeployed container. Could you please help me to fix clearml-session, I am running command clearml-session --jupyter-lab but getting blow error "Launch interactive session [Y]/n? Y
Removing stale interactive sessions
Creating new session
Retrying (Retry(total=237, connect=240, read=237, redirect=240, status=240)) after connection broken by
'ProtocolError('Connection aborted.', ConnectionResetError(10054, 'A...

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> As I mentioned earlier, If I do not specify —base-task-Id than error is as below @Jake command clearml-session --jupyter-lab but getting blow error "Launch interactive session [Y]/n? Y
Removing stale interactive sessions
Creating new session
Retrying (Retry(total=237, connect=240, read=237, redirect=240, status=240)) after connection broken by
'ProtocolError('Connection aborted.', ConnectionResetError(10054, 'An existing connection w...

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> It’s hosted on kubernetes and behind the ingress controller. I use helm char provided on clearML page with ingress set as true. I can access web UI from browser and currently it is on http only.

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> It’s on prem server and remote agent . Both remote agent and my machine are in same network and I can ssh agent from my machine. Do we needs to be open others than SSH to make jupyterlab working from my computer to agent or agent to ClearML server ?

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> When I add extra index url , it gives error for certificate and I am not sure where to configure all these settings in agent settings

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> How can I install latest one. Do you have link to refer ?

one year ago

0 Where I Can Find Logs Of Clearml Server Which Should Go Into Elk (Elastic Search ) For Security.

@<1523701070390366208:profile|CostlyOstrich36> I am looking for pod logs and api server logs

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Sorry for delay reply , i have attached the logs and issue is only happening when do ML training with PyTorch. Training with other framework is working fine like tensor flow and sklearn.

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> Yes, this is end of logs and nothing happening after it. i am using this command clearml-agent daemon --detached --gpu 0 --queue A40 to launch the agent.

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> it works once i allow traffic to download.PyTorch.org from proxy. 🙂

one year ago

Yes , machine is connected to on prem ClearML server.

one year ago

@<1523701087100473344:profile|SuccessfulKoala55> after enabling debug mode below are logs , just to let you know this agent do not have internet and pip packages are installed vis proxy which i can working but for pytorch it seems to going to internet "DEBUG:urllib3.connectionpool: http://api.clearml.domain.com:80 "GET /v2.5/tasks.started HTTP/1.1" 200 353
Executing task id [d3807deae2644e00824e774ff8997eaa]:
repository =
branch =
version_num =
tag =
dock...

one year ago