AgitatedDove14

49 Questions, 8126 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

25 × Eureka!

Answers 8126

0 Hi. Question About Dataset Upload Errors: When Uploading A

setting max_workers to 1 prevents the error (but, I assume, it may come the cost of slower sequential uploads).

This seems like a question to GS storage, maybe we should open an issue there, their backend does the rate limit

My main concern now is that this may happen within a pipeline leading to unreliable data handling.

I'm assuming the pipeline code will have max_workers, but maybe we could have a configuration value so that we can set it across all workers, wdyt?

If
...

3 years ago

0 Hello! I Try Add Dataset To Clearml Using Clearml-Data. All Images In One Folder, Size Around 5Gb. After Upload To Cloud I Get This Error Clearml.Metrics - Error - Failed Reporting Metrics: <400/0: Unknown (Error: Events.Add_Batch Request Exceeds Limit 75

Thanks!

4 years ago

0 Hi I Came Across Some Inconsistency In The Iteration Reporting In The Clearml With Pytorch-Lightning When Calling Trainer.Fit Multiple Times, Before I Dive In I Wondered If There Is A Known Issue Related To This?

but the debug samples and monitored performance metric show a different count

Hmm could you expand on what you are getting, and what you are expecting to get

4 years ago

0 Hi, I'M Having Trouble Using Task.Clone And Task.Create- I'M Running Two Experiments One After The Other, And I Would Like To Report The Second Experiment To A New Task (New Experiment On The Server) But It Doesn'T Work. The Flow Is Task.Init -> Experimen

Hi HappyLion37
It seems that you are "reusing" the Tasks. Which means the second time you open them you are essentially resetting the old run and starting all over.
Try to do:
task1 = Task.init('examples', 'step one', reuse_last_task_id=False) print('do stuff') task1.close() task2 = Task.init('examples', 'step two', reuse_last_task_id=False) print('do some more stuff') task2.close()

5 years ago

0 I Have Set

there is almost zero overhead if your docker container alreadyt has everything (including the agent) preinstalled and you set it with CLEARML_AGENT_SKIP_PYTHON_ENV_INSTALL=1
it then should basically just run the code.

one year ago

0 Hi, Is There A Means To Leverage On Clearml To Run A Ml Inference Container That Does Not Terminate?

Can clearml-serving does helm install or upgrade?

Not sure I follow, how would a helm chart install be part of the ml running ? I mean clearml-serving is installed via helm chart, but this is a "one time" i.e. you install the clearm-serving and then you can via CLI / python send models to be served there, this is not a "deployed per model" scenario, but a deployment for multiple models, dynamically loaded

10 months ago

0 Hello,

I can reproduce 😞 give me a moment to verify

2 years ago

0 Hey! I'M Having A Weird Issue When I Run Pip Freeze Locally It'S Showing Version "Clearml==0.17.5Rc6" But When I Initiate The Task It'S Always Starting With "Clearml==0.17.2" - This Version Isn'T Accepting Tags Through The Code Etc. (I'M Manually Fixing I

SmallBluewhale13 in your code what are you getting when you print the version:
from clearml import __version__ print(__version__)

4 years ago

0 Is It Possible To Give The Agent Access To Install Private Pip Packages (Needs To Be Installed From The Repo)?

I’m not sure if

https

will work because I want to use ssh keys for creds.

BTW: I was not aware github provide pypi like artifactory, do they ?
Regrading SSH keys, they are passed from the host machine (i.e. in venv mode it will use the SSH keys from the user running the agent, and n docker mode, they are automatically mapped into the container)

4 years ago

0 Hi, Quick Question: Is There An Easy Way To Spin Multiple Clearml-Agents At Once? (Maybe Something Like Clearml-Agent Daemon --Queue -N 16) I Know This Can Be Done With Simple Shell Scripts Or Other Workarounds But Wondered If There'S A Native/Recommended

HugeArcticwolf77 you can add --services-mode to the agent, and it will basically keep on spinning Tasks in parallel (unfortunately the open source version does not include a way to limit it to a maximum of concurrent Tasks)

3 years ago

0 Is It Possible To Disable Vcs-Cache? I Tried To Change Value From True To False In The Trains.Conf, But It Does Not Affect Anything. I Want To Disable It, Because It Gives Error When I Run A Project Firstly On Docker Then On Venv.

Hi MysteriousBee56 ,
Yes this is permissions issue, the docker creates all folders as root (as it is the root user running inside the docker), Then when you execute in venv mode, you are running it from your user, which obviously cannot change root created folders.

5 years ago

0 When I Try To Create Experiment In The Ui All I See Is This Dialogue

and the agent default runtime mode is docker correct?

Actually the default is venv mode, to run in docker mode add --docker to the command line

So I could install all my system dependencies in my own docker image?

Correct, inside the docker it will inherit all the preinstalled packages, But it will also install any missing ones (based on the Task requirements. i.e. "installed packages" section)

Also what is the purpose of the

aws

block in the clearml.c...

3 years ago

0 Hi All, I Observed That When I Get A Dataset With

Is there any documentation on versioning for Datasets?

You mean how to select the version name ?

3 years ago

0 How Can I Log My Configuration Like This? I Have A Dict Params = {'Data':{'Data_Key':123}, 'Model':{'Model_Key':123}}, But It Become Data/Datakey Instead Of An Foldable Config. In Addition, I Don'T Want To Name It As "General", Where Can I Change It?

Hi EnviousStarfish54
I think this is what you are after
task.connect_configuration(my_dict_here, name='my_section_name')
BTW:
if you do task.connect(a_flat_dict, name='new section') you will have the key/value in a section name called "new section"

5 years ago

0 Can I Import A Tensorboard File Straight To The Clearml Ui?

Sadly no 😞
(I mean you could quickly write a reader for TB and report it, but it is not built into the SDK)

4 years ago

0 Hey! I'M Trying To Play With The

Are you running the agent in docker mode? or venv mode ?
Can you manually ssh on port 10022 to the remote agent's machine ?
ssh -p 10022 root@agent_ip_here

4 years ago

0 Hi, Guys! Thank You A Lot For Your Great Software, But I'Ve Got A Problem. I Have Got Two Remotes: Gitlab And Gitea. The Branch From Which I Run The Code Is Upstreamed With Gitea. However, In The Clearml Experiment, Gitlab Repository Is Automatically Sele

MinuteGiraffe30 if you are running the following command while your current directory is where you code is, what are you getting?

$ git ls-remote --get-url origin

3 years ago

0 Clearml Doesn'T Pick Up Model Checkpoints Automatically. Any Idea What Might Be Wrong? (Code Attached In The Thread). Thanks

Hi @<1631102016807768064:profile|ZanySealion18>

ClearML doesn't pick up model checkpoints automatically.

What's the framework you are using?
BTW:

Task.add_requirements("requirements.txt")

if you want to specify Just your requirements.txt, do not use add_requirements use:

Task.force_requirements_env_freeze(requirements_file="requirements.txt")

(add requirements with a filename does the same thing, but this is more readable)

2 years ago

0 I Am Also Experiencing A Weird Behaviour When Running A Script Using The Module Flag. For Example I Run:

Sure

5 years ago

0 I Am Trying To Do A Remote Execution Of A Test Task, But It Fails During Env Setup Due To Trying To Install An Obscure Version Of Pytorch. Been Trying To Solve This For Three Days! The Script:

AdventurousButterfly15

Despite having manually installed this torch version, during task execution agent still tries to install it somehow and fails:

Are you running the agent in venv mode? or docker mode?
Notice that in docker mode it inherits the python packages from the container, and adds/reinstalls missing packages. In venv mode it creates a New clean venv (there is no way to inherit a venv, venv can only inherit from system wide installed packages)

The idea is that you cannot e...

3 years ago

0 Any Specific Reason For Modelling Experiments As Separate Tasks Rather Than A Single Entity With Multiple Runs?

Yes, experiments are standalone as they do not have to have any connecting thread.
When would you say a new "run" vs a new "experiment" ? when you change a parameter ? change data ? change code ?
If you want to "bucket them" use projects 🙂 it is probably the easiest now that we have support for nested projects.

4 years ago

0 Hi, Is There A Means To Leverage On Clearml To Run A Ml Inference Container That Does Not Terminate?

Hi @<1523701304709353472:profile|OddShrimp85>
You mean something like clearml-serving ?
None

10 months ago

0 Is It Possible To Give The Agent Access To Install Private Pip Packages (Needs To Be Installed From The Repo)?

Can you copy the "Installed Packages" here, and point to the package causing the issue?

4 years ago

0 Anyone Know How To Override The Task Start Up Shell Script Via The Helm Charts? I Have A Aws Eks K8S Cluster W/ Port 80 Closed To Force All Traffic Over Port 443 Which Is Causing Issues W/

Hmm that is a good question, are you mounting the clearml.conf somehow ?

2 years ago

0 Hi Team, I Am Running Task Using This Command "Clearml-Agent Execute --Id <Taskid>". And My Config File Is Look Like Below, Do I Need To Change Anything In Config File. To Run My Task ,It Taking Too Much Of Time.

How can i find queue name

You can generate as many as you like, the default one is called "default" but you can add new queues in the UI (goto workers & queus page, then Queues, and click "+ New Queue"

2 years ago

0 Hi, I Am Trying To Execeute My Code On Nvidia/Cuda Docker, But It Keeps Running, It Is Not Failed Or Not Aborted. The Last Log Message Is

MysteriousBee56 that is very strange definitely explains it, kudos on debugging it !!!

5 years ago

0 Hi, I Am Trying To Start A Poc With Server And Agent And A Git Repository That Has A Submodule. I Don'T Need The Agent To Try To Fetch The Submodule, Is There A Way To Control The Clone Command? Avoid Calling Submodules?

FYI all the git pulls are cached even in docker mode so there is no "tax" to pay for pulling the sub-modules (only the first time of course)

5 years ago

0 Hey I’M Running This Script And Initialise The Clearml Task Also In This File

Hi ClumsyElephant70
What's the clearml you are using ?
(The first error is a by product of python process.Event created before a forkserver is created, some internal python issue. I thought it was solved, let me take a look at the code you attached)

4 years ago

0 Hi. Help

at least you did not change permission of your K8s etcd folder 😄

3 years ago

AstonishingSeaturtle47 yes it does. But I have to ask how come you have sub modules that one will have credentials for the master repo and not the sub ones? Also it sounds like a good solution would be for the trains-agent to try and pull the sub-modules and if it cannot, it should just print a warning and continue. What do you think?

5 years ago

Show more results