TimelyPenguin76

Administrator

0 Questions, 711 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Answers 711

0 Hi! I Am Setting Up A Few Clearml Agents To Run On A Local Gpu Server. They Have To Run In Their Own Docker Containers, Since We'Re Not Allowed To Install A Python Runtime On The Server Directly. I Got One Agent Running Listening To The Default Queue, Bu

Ho StrongHorse8 ,

Yes, each clearml agent can listen to a different queue and use a specific GPU, you can view all the use cases and example in this link https://clear.ml/docs/latest/docs/clearml_agent/#allocating-resources

4 years ago

0 Can I Upload Files That Are Created On The Remote Machine? So Far It Just Get Stuck (The Files Are Small 10 Kb Or So) I Have A Script That Creates A Folder With Some Files, Some Of Them I Want To Keep, So I Tried Uploading To Storage, No Luck.

Hi GleamingGiraffe20 , still getting those errors?

4 years ago

Can you point me to a specific example?

5 years ago

0 When I Create Clearml-Dataset From The Cli, I Get An Id. The Same Doesn'T Happen When I Use The Python Api. Is There Any Way To Get The Id In Python?

not the parameters, but maybe this can help - https://clear.ml/docs/latest/docs/clearml_data

4 years ago

0 Hey Again

yap, that should do the trick

5 years ago

Can you send me the logs with and without? (you can send the logs in DM if you prefer)

5 years ago

0 Hi, We Have Clearml On K8 Setup. Using The Below, We Run Dynamic Pods On The Cluster.

Hi DeliciousBluewhale87

So now you don’t have any failures but gpu usage issue? How about running the ClearML agent in docker mode? You can choose an Nvidia docker image and all the Cuda installations and configuration will be part of the image.
What do you think?

4 years ago

0 Hi, Although

JitteryCoyote63 which version are you using?

4 years ago

👍 great, so if you have an image with clearml agent, it should solve it 😀

4 years ago

0 With Clearml 1.0 It Seems That Console Logs Are Only Shown In The Web Ui When The Task Has Finished. Is This Expected Behaviour? With Previous Versions I Was Able To See "Live" Output. I Tested This With The Pytorch_Tensorboardx.Py Example. I Run The Scri

👍 let me check that

4 years ago

0 I Am Trying To Use Clearml In My Work And I Am Facing Some Problems So Could Anyone Help Me With That? I Have Connected My Workstation With The Clearml Server As An Agent And When I Run The Code In My Local Device Then Clone It And Enqueue It To Run The C

Hi WickedBee96 ,

Are you running a standalone script or some code part of a git repository?

3 years ago

0 Hi My Friend

Hi CooperativeSealion8 ,

Sure, we will check that and reactivate soon

5 years ago

0 Hi, Another Issue Is Faced When Using Mmdetection/Mmcv With Clearml. The Automatic Uploading Of Checkpoint Meets The Following Error:

NonchalantDeer14 thanks for the logs, do you maybe have some toy example I can run to reproduce this issue my side?

3 years ago

For 'TRAINS Monitor: Could not detect iteration reporting, falling back to iterations as seconds-from-start' message - the iteration reporting is automatically detected if you are using tensorboard , matplotlib , or explicitly with trains.Logger
Assuming there were no reports, so the monitoring falls back to report every 30 seconds.

Thanks for the examples, will try to reproduce it now.

5 years ago

0 Hi. I'D Like To Try The Gcp Autoscaler.

Hi PanickyMoth78 , thanks for the logs, I think I know the issue, i’m trying to reproduce it my side, keeping you updated about it

3 years ago

can you build your own docker image with clearml-agent installed in it?

4 years ago

0 Are Trains Able To Discover The Docker Image And Docker Command That Is Used For Running The Current Experiment, Or Do I Need To Set It Up Manually By

Hi HealthyStarfish45

If you are running the task via docker, we dont auto detect the image and docker command, but you have more than one way to set those:

You can set the docker manually like you suggested. You can configure the docker image + commands in your ~/trains.conf https://github.com/allegroai/trains-agent/blob/master/docs/trains.conf#L130 (on the machine running the agent). You can start the agent with the image you want to run with. You can change the base docker image...

5 years ago

0 How Do I Get Args Like Epochs To Show Up In The Ui Configuration Panel Under Hyperparameters? I Want To Be Able To Change Number Of Epochs And Learning Rate From Within The Ui.

Try to clone the task (right click on the task and choose “clone”) and you will get a new task in draft mode, that you can configure ( https://clear.ml/docs/latest/docs/getting_started/mlops/mlops_first_steps#clone-an-experiment )

4 years ago

0 Hi, I'M Using Clearml Python Package On Sagemaker Notebook Instance. If I Want To Edit The Clearml.Conf On The Notebook Instance, How Could I Do That?

I will check the aws token, just to verify, you imported the StorageManager after the os.environ calls?

4 years ago

0 Hi. One Question Regarding Instantiation Of Tasks. The Docu States That Providing

Do you inherit from SearchStrategy in you implementation (you can read about it https://allegro.ai/docs/automation_optimization_searchstrategy.html#automation-optimization-searchstrategy )? If not, can you share how?

About the docstring, thanks 🙂 we will update it with the exceptions.

5 years ago

0 Hi There. I Have My Own Clearml Server Working On

Hi MotionlessCoral18 , can you check the configuration you added under you profile? is the bucket entry contains the HOST (ENDPOINT) section?

3 years ago

When you are not using the StorageManager you don’t get the OSError: [Errno 9] Bad file descriptor errors?

5 years ago

0 Different Question. How Can I Pass Pythonpath Env Variable To A Task, Run By Agent (So Python Can Find Classes Inside M Subdirectories)?

HelpfulHare30 are you running it from a repository?

3 years ago

How do you load the file? Can you find this file manually?

5 years ago

0 Can I Ask How Often Does The Hosted Clearml Reset? I'M In A Hackathon And Thought Of Using It.

http://app.community.clear.ml doesnt reset

4 years ago

0 Hi, Is There Any Way To Say Clearml Which Python To Use Inside A Docker Container To Run The Task? Or Maybe Is There An Option That Says Clearml To Install The Needed Python?

Hi LazyFish41 , You can specify the pip version in the agent’s configuration file: https://github.com/allegroai/clearml-agent/blob/master/docs/clearml.conf#L57

the ClearML agent will install pip

4 years ago

0 Any Plans To Add Support For Multiindex Tables? That Comma-Separation Feels A Bit Odd. (Also This Shows Up In

So something like https://github.com/allegroai/clearml/blob/master/examples/reporting/pandas_reporting.py#L28 but multi index?

Was wondering if there are plans to add better support for it

Not currently, can you add a https://github.com/allegroai/clearml/issues/new so we do not forget to add such?

4 years ago

0 Hi, Is There Any Way To Say Clearml Which Python To Use Inside A Docker Container To Run The Task? Or Maybe Is There An Option That Says Clearml To Install The Needed Python?

Hi LazyFish41 ,

You can use agent.docker_init_bash_script to execute any command at the startup of any docker, so you can use it to install the python version you want to use.

You can specify Set the python version to use when creating the virtual environment and launching the experiment with agent.python_binary

4 years ago

0 Hi, I'M Trying Out The

which docker image do you use? can you try pulling the image manually?

4 years ago

0 Hey All. Question About The Bucket Config For Clearml. Is It Possible To Have Two Buckets In The Config? We Have Partners In Eu And Us, And Due To Data Governance Legislation, We Cannot Store All The Data In One Bucket. Any Insights?

Hi TenseOstrich47 ,

Try using aws credentials with region too https://github.com/allegroai/clearml/blob/master/docs/clearml.conf#L88

credentials: [ specifies key/secret credentials to use when handling s3 urls (read or write) { bucket: "my-bucket-name" key: "my-access-key" secret: "my-secret-key" region: "my-region" },

4 years ago

Show more results