TimelyPenguin76

Administrator

0 Questions, 711 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Answers 711

0 How Is The New Gui "Setup Shell Script" Option Supposed To Be Used? I Would Like To Add An Apt Install Command To One Of My Tasks, And From Reading The Related Github Issues I Presumed I Could Just Type Those In To This New Textbox In The Tast Exectution-

You can try set_base_docker :

t = Task.init(project_name="examples", task_name="set docker parames") t.set_base_docker( docker_cmd="nvidia/cuda:11.1", docker_arguments="-e ENV=1", docker_setup_bash_script=['apt update', 'apt-get install -y gcc'] )

3 years ago

0 Hi Everybody. I Have Setup Clearml With Some Agents That Execute Experiments In Docker Mode. This Is Working Perfectly. I Am Now Also Trying To Use The Pycharm Plugin To Debug An Issue. Basically I Am Confused If "Remote Debugging" Should Work / Kick In

But, if you like, you can connect a remote interpreter and debug with PyCharm, locally, without clearml-agent

3 years ago

0 Hi Guys! What Can Be Wrong Here And How To Fix It?

Hi ItchyHippopotamus18 , can you try with
torch.save(model_jit, os.path.join(checkpoint_path, f'{epoch_num}_{round(acc_full, 4)}.pt'))?

4 years ago

0 Hi. I'D Like To Try The Gcp Autoscaler.

Hi PanickyMoth78 , thanks for the logs, I think I know the issue, i’m trying to reproduce it my side, keeping you updated about it

2 years ago

0 Looks Like The Website Is Down. When I Try To Create Task, I Get The Following Error

Thanks ImpressionableAlligator9 and MagnificentWorm7 for reporting this, I will double check it

2 years ago

Hi ThickDove42 ,

The SETUP SHELL SCRIPT is the bash script to run at the beginning of the docker before launching the Task itself.

You can just try edit it, for example:

apt update apt-get install -y gcc

3 years ago

0 Should Calling

Hi MammothGoat53 ,

which clearml version are you using? I run the same and all worked as expected (I changed the project_name and the task_name to be 4 chars length)

2 years ago

0 Hi Again, I Am Trying To Make The Aws Autoscaler Work With Ec2 Instances, But It Fails To Setup The Agent In The Machine: The Logs Of The User-Data Script Show That It Fails Updating The Machine (See Below)

agree

E: Could not get lock /var/lib/apt/lists/lock - open (11: Resource temporarily unavailable)Another process is using the lock, can you specify the ami (and region) so I can try to reproduce it?

3 years ago

0 Another Problem Is That When Using Mp.Spawn To Init Distributed Training In Pytorch,

Hi PompousHawk82 . Are you running in parallel the several instances of the same code on the same task?

3 years ago

0 Hi, Could You Provide An Example Of Setting Up Sdk.Storage.Direct_Access?

Hi VictoriousPenguin97

sdk.storage.direct_access is part of the extended support in the paid version.
But I think its not required since ClearML will simply try to access the path directly as it is, and you don’t need to configure it.

3 years ago

0 Hi, I Know There Is Cleanup Service To Delete Archived Experiments Periodically. For The Next Version, Can You Edit Ui So That We Can Delete Experiments That We Want To Delete. Because, I Want To Save Some Of Old Experiments In The Archive Folder.

Hi MysteriousBee56 ,

The https://github.com/allegroai/trains/blob/master/examples/services/cleanup/cleanup_service.py is an example how you can add services to manage your experiments.
You can change the criteria for fetching the tasks in this script (in the https://github.com/allegroai/trains/blob/master/examples/services/cleanup/cleanup_service.py#L72 call) to something like a specific tag you can add to the experiments ( delete tag?, you can add tag to multi tasks) and it should...

4 years ago

0 There Is Some Specificity With The Way We Setup Our Environment At My Company That Prevents Me From Using The Full Features Of

and using https://github.com/allegroai/clearml-agent/blob/21c4857795e6392a848b296ceb5480aca5f98e4b/docs/clearml.conf#L140 for running scripts at docker startup

3 years ago

0 Hi, I Have Started To Receive The Following Error Message:

Hi ArrogantBlackbird16 ,

How do you generate and run your tasks? Do you use the same flow as in the https://clear.ml/docs/latest/docs/fundamentals/agents_and_queues#agent-and-queue-workflow ? Some other automation?

3 years ago

0 I’M Running The Clearml Agent In Standalone Mode But It Still Installs The Virtual Env, Is That The Expected Behavior?

Hi LazyTurkey38 ,

Yes, it will create a virtual env for the task

3 years ago

0 I Am Using

You can add a limitation to the query page size:
task_filter = {"page_size": <your-limit>, "page": 0}
what do you think?

3 years ago

0 Hi, How Can I Get Programmatically A Task'S Logged Stderr Stream?

Hi ImmensePenguin78 ,

You can get all the console outputs using task.get_reported_console_output() . can this do the trick?

3 years ago

0 Can I Upload Files That Are Created On The Remote Machine? So Far It Just Get Stuck (The Files Are Small 10 Kb Or So) I Have A Script That Creates A Folder With Some Files, Some Of Them I Want To Keep, So I Tried Uploading To Storage, No Luck.

Hi GleamingGiraffe20 ,

Without adding Task.init , i’m getting some OSError: [Errno 9] Bad file descriptor error, do you get those too?

Do you run your script from CLI or IDE (pycharm maybe?)?

4 years ago

0 Hi, I'M Trying Out The

which docker image do you use? can you try pulling the image manually?

3 years ago

0 Hi Allegroal Team, I Recently Saw Your Project And Tried It. This Is A Great Tool! I Have A Question To Ask:

FierceFly22 like Elior wrote, you can use Task.execute_remotely , just need to supply the queue name 🙂

4 years ago

0 Hi, I'M Trying To Run A Pipeline Object Using Clearml Agent On Mac Os And It Fails. I See In The Error I Get That It Doesn'T Succeed To Install The Relevant Packages. When I Run Each Script Separately, It Works Well; And When I Run The Pipeline In Linux,

Hi ShinyLobster84 , where do you usually install XXXXX package from? or some artifactory?

3 years ago

0 Hi, Thank You For This Great Tool. We Have An Issue When Resuming A Task When A Machine Is Being Preempted. In Some Cases (Not Always), The Scalars Are Not Logged Properly Anymore When Resuming The Task. The State Still Is Changed To Resume, But The Scala

Hi DefiantShark80 ,

task.report_scalar() # does not always workwhat do you mean? report_scalar not sending the info or raising an error?

3 years ago

0 Hi, Every One, What Is The Algorithm To Schedule A Task After Pulling It From A Queue In Agent? I Am Using Clearml Open Source. Thanks In Advance.

Hi FloppyDeer99 ,

It depends on you setup:
if you have on prem machines, you can start more than one clearml-agent on the machine with the resources and assign for example each gpu on the machine to a https://clear.ml/docs/latest/docs/clearml_agent#docker-mode . You can have the same for cloud machine, and if you are using the AWS you can run the https://clear.ml/docs/latest/docs/guides/services/aws_autoscaler/ as a service. K8S: there is a great example for k8s glue https://github.com/...

3 years ago

0 Hi, Every One, What Is The Algorithm To Schedule A Task After Pulling It From A Queue In Agent? I Am Using Clearml Open Source. Thanks In Advance.

Sure, with clearml and clearml-agent you get autoscaling for your machines (with monitoring) and automation for your tasks that will handle all for you (docker images, manage the credentials …).
There are much more parts in the system, so maybe you can share a use case so I can help you with it?

3 years ago

0 Hey! Probably Missed Something, But I Recently Upgraded To 1.1.1, And I'Ve Just Noticed That Png Artifacts Are Not Displayed In The Preview. Is This Intentional? Edit: Ah, It Has To Be A

Hi UnevenDolphin73 , Do you get any preview run this example - https://github.com/allegroai/clearml/blob/master/examples/reporting/artifacts.py ?

3 years ago

0 Wondering Why My Plots Are Not In Plot Section But Under Debug Section

PompousParrot44 since imshow display data as an image, we currently log it into the debug samples section as this is the section natively used to display and interact with images.

Do you think it should be under plots?

4 years ago

0 Hi All, I Am Trying To Execute Somewhat Custom Hpo Scheme With Clearml. I Would Want That A Single Running Python Script Will Be Able To Sample The Optimizer, Init A Task And Report The Result Multiple Times. I Didn'T Find Anything Similar In The Docs Or

Checking that for you

3 years ago

0 I Started Two Pipelines (Using Aws Autoscaler In App.Clear.Ml ). The Pipelines Ran Concurrently, Using The Same Pipeline Code. Both Failed In The Same Component Half-Way Though The Pipeline Run With:

Hey PanickyMoth78 ,

Regarding
clearml.utilities.locks.exceptions.LockException: [Errno 11] Resource temporarily unavailableI read a bit https://bugs.python.org/issue43743 , can you try the suggested workaround (just for the check)?

adding

import shutil shutil._USE_CP_SENDFILE = Falseon top?

2 years ago

0 There Is Some Specificity With The Way We Setup Our Environment At My Company That Prevents Me From Using The Full Features Of

Hi LazyTurkey38 ,

Basically, when the agent clones the repo and switches into the dev, I want it to run a script that is in the repo that:
Installs some pip dependencies (I kind of want these regardless of any extra deps specific to the task).

what about adding all your packages to the task? can this help? you can do it with Task.add_requirements(package_name, version)

Generate some

py

files/symlinks. This is needed when you use custom C extensions / ANTLR ...

3 years ago

0 There Is Some Specificity With The Way We Setup Our Environment At My Company That Prevents Me From Using The Full Features Of

So just after the clone, before creating the env?

3 years ago

0 Hi, Every One, What Is The Algorithm To Schedule A Task After Pulling It From A Queue In Agent? I Am Using Clearml Open Source. Thanks In Advance.

Hi FloppyDeer99 ,

In other words, docs introduce that ClearML Open Source supports orchestration, how can I found the relating codes?

You can find many examples https://clear.ml/docs/latest/docs/getting_started/mlops/mlops_first_steps/ , if you have a specific use case you want to check, please share and I can send an example of it.

And what the role of clearml-agent in orchestration, a combination of kube-scheduler and kubelet?

ClearML agent is an ML-Ops tool for users to r...

3 years ago

Show more results