AgitatedDove14

49 Questions, 8060 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

25 × Eureka!

Answers 8060

0 When I Do

he problem is due to tight security on this k8 cluster, the k8 pod cannot reach the public file server url which is associated with the dataset.

Understood, that makes sense, if this is the case then the path_substitution feature is exactly what you are looking for

2 years ago

0 Hi! I Am Getting The Following Error On An Agent:

Hi GrievingTurkey78
Can you test with the latest clearml-agent RC (I remember a fix just for that)
pip install clearml-agent==1.2.0rc0

3 years ago

0 Is There Any Reason Why Doing The Following Is Not Possible? Am I Doing It Right? I Want To Run A Pipeline With Different Parameters But I Get The Following Error?

Right! I just noticed that! this is odd... and yes defiantly has something to do with the multi pipeline executed on the agent, I think I know what to look for ...
(just making sure (again), running_locally produced exactly what we were expecting, is that correct?)

3 years ago

0 I Want To Run My Clearml Task On An Agent In K8S Together With A Memory Profiler (Maybe

No worries, I'll see if I can replicate it anyhow

3 years ago

0 Hi, I Have This Issue With Clearml Datasets. Do You Know Hot To Solve It?

Yep that is it 🙂

3 years ago

0 Sorry Folks Too Many Questions - If I Have A Project (And I Set The Output Uri In It While Creating, To A S3 Folder) How Can I Ensure That A Experiment (Task) That I Run On My Local Outputs The Model To The Uri?

Let me chech

3 years ago

0 Hi I Came Across Some Inconsistency In The Iteration Reporting In The Clearml With Pytorch-Lightning When Calling Trainer.Fit Multiple Times, Before I Dive In I Wondered If There Is A Known Issue Related To This?

Thanks!!

3 years ago

0 Hi, I'M Getting A Lot Of The Following Logs

Hi PompousBeetle71
Try this one, let me know if it helped
logging.getLogger('trains.frameworks').setLevel(ERROR)

4 years ago

0 Is Trains Adaptable For Federated Learning Scenarios?

With pleasure 🙂

4 years ago

0 Hi, It Seems Like We Have A Bug In Metrics Reporting While Comparing Between Several Experiments (Under Scalars). The Loss Report Includes Only One Experiment Results While All The Other Metrics Show All Of Them. The Data Is Exist At Each Experiment, But

What's the trains-server version ?

4 years ago

0 I'M Trying To Configure The Glue Agent To Use Aws Ecr Via Helm Charts. Below Is My Configuration. It Is Not Pulling The Image Though, It Is Failing With

Yes! Thanks so much for the quick turnaround

My pleasure 🙂

BTW: did you see this (it seems like the same bug?!)
https://github.com/allegroai/clearml-helm-charts/blob/0871e7383130411694482468c228c987b0f47753/charts/clearml-agent/templates/agentk8sglue-configmap.yaml#L14

2 years ago

0 Gm Folks, Really Liking Clearml So Far As My Top Choice (After Looking At Dvc, Mlflow), And Thank You For Your Help Here! I Had Another Q: Is There A Recommended Workflow To Be Able To “Drop Into” The

So if I do this in my local repo, will it mess up my git state, or should I do it in a fresh directory?

It will install everything fresh into the target folder (including venv and code + uncommitted changes)

2 years ago

0 Hello, My Name Is Gabriel, I'M Using Clearml For Our Machine Learning Experiments, Which Is An Amazing Tool To Manage This Type Of Stuff So Thank You Guys For Creating This. But The Last Time I Tried To Use It Some Unexpected Error Came Up For Which I Can

The file is never touched, nowhere in the process that file is deleted.

it should never have gotten there, this is not the git repo folder, it one level above...

3 years ago

0 Hi, I Am New Here. I Was Wondering Where Can I Configure Which Machines Trains (Or Trains-Agent?) Use For Queueing Tasks, And How Do I Create Such Queues. Thanks.

Check the examples on the github page, I think this is what you are looking for 🙂
https://github.com/allegroai/trains-agent#running-the-trains-agent

4 years ago

0 Hi All! Is There Any Simple Way To Use

Yes Task.init should be called on each subprocess (because torch forks them before they ar epatched)
I think the main issue is that we patch the argparse on the Subprocess (this is assuming you did not manually parse non argv argument)
If you can create a mock test I think we can work around the issue, as long as the way you spin it is the standard pytorch distub way

one year ago

0 Hi, When I Run A Pipeline, The Artifacts From My Tasks Are Saved On My Ec2 Server. When I Archive The Pipeline And Go Into The Archive And Delete The Pipeline, The Artifacts Are Not Deleted. It Looks Like They Are Only Deleted If I Delete The Whole Pipel

Yep only in 1.7 🙂

2 years ago

0 Hi, I Tried To Setup Clearml Serving And Ran The Example Given

So everything works ?

2 years ago

0 Hi All, I Am Starting To Use Clearml-Agent. Run It With

CooperativeFox72
Could you try to run the docker and then inside the docker try to do:
su root whoami

3 years ago

0 Question About The Storage Manager. Assuming I Have An Object That Updates Frequently And Always Saved At The Same Path (E.G.

WackyRabbit7
Long story short, yes, only by name (hashing might be too slow on large files)
The easiest solution, if the hash is incorrect, delete the local copy it returns, and ask again, it will download it.
I'm not sure if the hashing is exposed, but if it is not, we can add it.
What do you think?

4 years ago

0 Hello, Is It Possible To Run Trains Offline Where There'S No Http Connection Between The Node Running The Job And Where The Web Ui Runs? I See In Your Diagram The Connection Between Training Machine And Trains Server (Which Contains The Web Ui) Is Over Ht

But this will require some code changes...

4 years ago

0 Is There Any Example Showing How To Work With Nested Pipelines? In My Case I Have Several Functions Decorated With

In the main pipeline I want to work with the secondary pipeline and other functions decorated with

PipelineDecorator

. Does ClearMl allow this? I have not been able to get it to work.

Usually when we think about pipelines or pipelines, the nested pipeline is just another Task you are running in the DAG (where the target queue is the services queue).
When you say nested pipelines with decorators, what exactly do you have in mind ?

3 years ago

0 I Wanted To Ask About Html Reporting, If I Want To Do A More Fancy Visualization (Like Overlay Of Two Images Depending On Mouse Hovering), I Have To Inject This Html Into The Reporting Code, Right? I Am Asking, As Perhaps It Is Possible To Have Custom Wid

HealthyStarfish45 you mean as in RestAPI ?

4 years ago

0 Hello! How Can I Use "Report_Scatter2D" In Order To Report Timestamp In The X-Axis?

but I cannot compare between them

I think we noticed it, and this will be fixed in the next server update (again, some plotly.js issue there)

3 years ago

0 Hi, I'M Trying To Run Task.Init Inside A Jupyter Notebook For The First Time (Used It A Lot Before In Normal Python Scripts), And I Get A Warning-

You can try calling
task._update_repository()I'm still trying to figure out how to reproduce it...

4 years ago

0 <no title>

Specifically for model files, if you set the Task.init(..., output_uri=True) it will automatically upload any saved model to the files server (you can also point to any object storage / shared folder)
What's the framework you are using ?

3 years ago

0 I'M Trying To Configure The Glue Agent To Use Aws Ecr Via Helm Charts. Below Is My Configuration. It Is Not Pulling The Image Though, It Is Failing With

Merged, is it working for you now?

2 years ago

Are you sure trains-server not trains package (i.e. backend)

4 years ago

0 Hey, Is It Possible To Use Clearml-Init (Configuration File Creation ) Under Root Permissions? I Get :

Hi EmbarrassedSpider34
clearml-init will try to create ~/clearml.conf I'm assuming that when you execute under root it is resolved to /root/clearml.conf That said you might be able to override it with:
CLEARML_CONFIG_FILE=$HOME/clearml.con sudo clearml-init

3 years ago

0 Hello! Since Today I Get

Thanks @<1523701868901961728:profile|ReassuredTiger98>
From the log this is what conda is installing, it should have worked

/tmp/conda_env1991w09m.yml:
channels:
- defaults
- conda-forge
- pytorch
dependencies:
- blas~=1.0
- bzip2~=1.0.8
- ca-certificates~=2020.10.14
- certifi~=2020.6.20
- cloudpickle~=1.6.0
- cudatoolkit~=11.1.1
- cycler~=0.10.0
- cytoolz~=0.11.0
- dask-core~=2021.2.0
- decorator~=4.4.2
- ffmpeg~=4.3
- freetype~=2.10.4
- gmp~=6.2.1
- gnutls~=3.6.13
- imageio~=2.9.0
-...

3 years ago

0 Is It Possible To Filter Tasks By There Output And Input Names Using .Get_Tasks?

correct, you can pass it as keys on the "task_filter" argument, e.g:
Task.get_tasks(..., task_filter={'status': ['failed']})

3 years ago

Show more results