CostlyOstrich36

0 Questions, 3782 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Answers 3782

0 Hi, Is There A Quick Way To Create New Pipeline From Few Other/"Sub" Pipelines?

Hi @<1559711623147425792:profile|PlainPelican41> , you can re-run an existing pipeline using different parameters from the UI. Otherwise, you need to create new pipelines with new code 🙂

one year ago

0 Hi There, Maybe This Was Already Asked But I Don'T Remember: Would It Be Possible To Have The Clearml-Agent Switch Between Docker Mode And Virtualenv Mode At Runtime, Depending On The Experiment

I guess that's a good point but really applicable if your training is CPU intensive. If your training is GPU intensive I guess most of the load goes on the GPU so running over VM (EC2 instances for example) shouldn't have much of a difference but this is worthy of testing.

I found this article talking about performance
https://blog.equinix.com/blog/2022/01/04/3-reasons-why-you-should-consider-running-containers-on-bare-metal/

But it doesn't really say what the difference in performance is...

one year ago

0 Hey, Can Clearml Uploads Data To Private Blob Storage Azure? I Have Authorization Errors. Is It Due To The Fact That I Do Not Have The Required Permission On The Storage Account (Could Be Possible) Or The Fact That The Storage Account Is Set As Private A

@<1556812486840160256:profile|SuccessfulRaven86> , I think this is because you don't have the proper permissions 🙂

one year ago

0 In

and when you do pip show clearml the version is 1.6.4?

one year ago

0 Hi Guys, I Have Problem Understanding To Set Requirements.Txt And Running Remotelty Using Clearml-Agent. So I Tried To Set Requirements, And Want To Execute Remotely. When In Draft Mode, The Installed Package Is Correct And Match Like Requirements.Txt, Bu

Hi @<1523701260895653888:profile|QuaintJellyfish58> , can you please provide a standalone snippet that reproduces this?

one year ago

    from src.net import Classifier
ModuleNotFoundError: No module named 'src'

one year ago

0 Hi Community, I Want To Get The List Of Available Worker-Queues In My Pyhton Api. Based On The Sdk Documentation I Cannot Find A Implemented Way Expect Using The Rest-Api. I Struggle To Get The Rest-Api Up An Running. I'Ve Got My Api_Secret And Access_Key

Hi @<1632913959445073920:profile|IratePigeon23> , please look at the following thread - None

That is a nice example for using the API. After you handle the login issues, you can use the web UI as a reference for the API (use dev tools - F12 to see what the UI sends to the backend).

Let me know if this helps 🙂

one year ago

0 Hi, A Google Update Led To Breaking Tensorflow Versions. This Requires To Downgrade Protobuf To <=3.20.1. Is There A Way To Make The Agent Ignore Installation Of Protobuf==4.21.1 And Install Above Instead? Before Running The Task Of Course. I Am Runnnig

Hi AbruptWorm50 ,

After cloning the experiment you can actually edit the installed packages and specify which package version you want.

You can also do this via code using this method:
https://clear.ml/docs/latest/docs/references/sdk/task#taskadd_requirements

2 years ago

0 Does Anyone Know Why Sometimes I Can'T Delete Folder Even Though I Delete All The Inside Items (Even In Archive)

Hi @<1570583237065969664:profile|AdorableCrocodile14> , is it possible you have some models inside?

one year ago

0 Im Trying To Run This Exmple :

Pending means it is enqueued. Check to which queue it belongs by looking at the info tab after clicking on the task :)

3 years ago

0 Is There A Way To Configure A Clearml-Agent So That It Shutdown The Server After It Has Been Idle For A Certain Time Period? We Are Using Gpu Resources From A Provider That Autoscaling Doesn'T Support (Such As Sagemaker Training Jobs).

With the autoscaler it's also easier to configure a large variety of different compute resources. Although if you're only interested in p4 equivalent instances and on fast demand I can understand the issue

one year ago

Hi @<1632913939241111552:profile|HighRaccoon77> , the most 'basic' solution would be adding a piece of code at the end of your script to shut down the machine but obviously it would be unpleasant to run locally without Task.execute_remotely() - None

Are you specifically using Sagemaker? Do you have any api interface you could work with to manipulate shutdown of machines?

one year ago

I guess you could probably introduce some code into the clearml agent as a configuration in clearml.conf or even as a flag in the CLI that would send a shutdown command to the machine once the agent finishes running a job

one year ago

Maybe even make a PR out of it if you want 🙂

How are you launching the agents?

one year ago

BTW, considering the lower costs of EC2, you could always use longer timeout times for the autoscaler to ensure better availability of machines

one year ago

Keeping machines up for a longer time for a fairly cheaper cost (especially if you're using spot instances)

one year ago

Any specific reason not to use the autoscaler? I would imagine it would be even more cost effective

one year ago

And you use the agent to set up the environment for the experiment to run?

one year ago

0 Hello, I Am Using Clearml In Docker Mode. I Have A Simple Script That Runs Locally, Runs On The Target Machine Running The Same Tensorflow Container, But Doesn'T Run When I Deploy It Using Clearml. Here'S The Log Of The Error:

What version of clearml / clearml-agent are you using? Are you running in docker mode? Can you add your agent command here?

2 years ago

Can you compare the installed packages between the original experiment to the cloned one? Do you see anything special or different between the two?

2 years ago

0 When I Upload And Publish Data To Clearml-Data, It Says Successful. Now When I Try To Get It Using Id, I Get The Following Error. Error: __Get_Tasks() Got Multiple Values For Keyword Argument 'Tags'. This Occurs Both In Cli And In Python.

VexedCat68 Hi 🙂

Please try with pip install clearml==1.1.4rc0

3 years ago

0 Dear All, Great To Join Your Community. We Are Working On Plant Growth Stage Models At Basf For Farmers And I Was Wondering If Clearml Can Be Used Also For Data Versioning Of Tabular Data, Structured Data. I Would Like To Track If This And That Row Is Par

Hi @<1543766544847212544:profile|SorePelican79> , ClearML can certainly do that. For this you have the Datasets feature.
None
This will allow you to version and track your data super easily 🙂

one year ago

0 Hi, I’M Trying To Upload Output Model Files (Like .Pth) To Clearml Server. Assume My

Why go into the environment variable and not just state it directly?

task = Task.init(
    project_name="my_project",
    task_name="my_task",
    output_uri="

"
)

10 months ago

0 Hi, I’M Trying To Upload Output Model Files (Like .Pth) To Clearml Server. Assume My

Hi @<1523721697604145152:profile|YummyWhale40> _, what if you specify the output_uri through the code in Task.init() ?

10 months ago

0 Hello! I Have A Clearml-Task That Reads Args From Command Line Using Argparse. Before This Worked Fine. Now, Args Are Show Up In The Configuration Tab Of The Experiment, But Are Not Being Passed To The Script, Printing Passed Argyments Yields An Empty Lis

Hi @<1688721797135994880:profile|ThoughtfulPeacock83> , can you add a standalone script that reproduces this?

2 months ago

0 Another Question Regarding A Curiousity. Given A Task Is Sent To An Agent To Run And It Has A Specific Version Of Tensorflow/Pytorch, Which Requires A Specific Version Of Cuda/Cudnn. Does The Agent Just Automatically Get Cuda/Cudnn For That Task Or Does C

That's a good question. In case you're not running in docker mode, the agent machine that runs the experiment needs to have Cuda/Cudnn installed. If you're running in docker mode you need to select a docker that already has those installed 🙂

2 years ago

0 Is The Only Available Resource To Learn And Use Clearml-Serving, The Github Repo, Or Are Their Other Resources As Well? Also, In The Repo, Once The Model Is Served, It Says I Can Curl To The End Point And It Mentions <Serving-Engine-Ip> But I Have No Ide

I think the serving engine ip depends on how you set it up

3 years ago

0 Hey There, Is It Possible For A Clearml Pipeline Step To Log A Folder Instead Of Numpy/Pickle Objects? Looking At The Docs,

JitteryCoyote63 , heya, yes it is :)
You can save the entire folder as an artifact.

2 years ago

0 I Encountered A Weird Edge Case With The Aws Auto-Scaler, Wondering If There Are Any Solutions Or If This Is A Known Issue. Something As Follows Happened:

UnevenDolphin73 , that's an interesting case. I'll see if I can reproduce it as well. Also can you please clarify step 4 a bit? Also on step 5 - what is "holding" it from spinning down?

2 years ago

0 Seems Like Clearml Tasks In Offline Mode Cannot Be Properly Closed, We Get

Hi @<1523701083040387072:profile|UnevenDolphin73> , looping in @<1523701435869433856:profile|SmugDolphin23> & @<1523701087100473344:profile|SuccessfulKoala55> for visibility 🙂

one year ago

Show more results