GrievingTurkey78

34 Questions, 125 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

119 × Eureka!

Answers 125

0 Hi, I Was Getting A Really Weird Error Due To Mismatch On The Versions Between The Installed Libraries In My Environment And The Ones Ran In The Node (I Manually Changed The Installed Packages And Everything Worked). How Can I Force Trains To Use Exactly

Pigar is capturing different versions that the ones I have installed on my local machine (not a problem except for one). I just want to force the version of that package in a way that I don’t have to manually change it from the UI for every experiment.

4 years ago

0 Hi! Is There Something Happening With The

This works:
filepath = self.log_dir + os.sep + "checkpoint" self.callbacks.append( ModelCheckpoint( filepath, monitor="val_loss", mode="min", save_best_only=True, save_weights_only=True, ) )And this doesn’t:
` filepath = self.log_dir + os.sep + "checkpoint.hdf5"
self.callbacks.append(
ModelCheckpoint(
filepath,
...

3 years ago

0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

Yes! I think thats what I will do 👌 Let me know if there is a way to contribute a mode to keep logging off. We just don’t want to pollute the server when debugging.

3 years ago

0 Hi! Is There Something Happening With The

AgitatedDove14 Thanks! Im trying to figure out how to create a minimum working example! I am also working with Hydra so that may be a thing. The extension is whats causing it to fail (haven’t figured out why).

3 years ago

0 Hi! If I Have A Pipeline On Gitlab That Uses Clearml For Some Tests Is There Some Way To Setup The Credentials So That It Doesn’T Fail?

I need to fetch a dataset for some simple tests but since it doesn’t have credentials to the self-hosted server it wont find the dataset

3 years ago

0 Hi

I configured a firewall rule that opened the ports for the instance (not 100% sure if this is the right way) using network tags. Yes, the whole screen is black and no trains logo show up: Safari can’t open the page because the server where this page is located isn’t responding.

4 years ago

0 Hi! What Would Be The Way For Manually Uploading A Model? I Have Intermediate

Thanks AgitatedDove14 !

3 years ago

0 I Am Trying To Upgrade From Clearml Server 0.16 To The Newest Version But I Am Getting Some Errors When Spinning Up The New Containers:

Yes AgitatedDove14 ! I’ll PM you

3 years ago

0 Hi! I Am Currently Using Hydra+Clearml And Wanted To Know If There Are Still Some Updates Coming. At The Moment, If I Change The Defaults Hydra Uses From The

Sure! I enqueue the experiment from my local machine:
python -m src.train model=my_model loss=my_loss dataset=my_dataset

Then I go to the server and run the experiment and create a copy to run with a new model. On the copy, I go to the script path and modify it to be:
-m src.train model=my_other_model loss=my_loss dataset=my_dataset

The new experiment, even though the script path has my_new_model default, starts training using my_model .

I can also see ...

3 years ago

0 Hi! I Am Saving Some Intermediate

Hi CostlyOstrich36 ! The message is the following:
clearml.model - INFO - Selected model id: 27c1a1700b0b4e25a4344dc4ef9868faThey are not models, those are intermediate tensors I am caching to make training faster. I don't need to log them.

3 years ago

0 Hi! I Have The Previous Trains Server Configured With Multiple Experiments; I Created It Using The Gcloud Images Provided. If I Want To Update The Server To The Newest Clearml Version Should I Follow These Steps

Thanks AgitatedDove14

3 years ago

0 Hi

Also, should I allow 8080 , 8008 , and 8081 on ingress and egress on GCP or is only egress enough?

4 years ago

0 Hi! I Changed From Trains To Clearml And Ran Some Experiments Using Keras But It Seems The Metrics Are Not Being Tracked Automagically, Has Anyone Ran Into The Same Issue? I Can Even See The Metrics On The Progress Bar During The Fit Process.

Thanks TimelyPenguin76 , the example works fine! I’ll debug further on my side!

3 years ago

0 I Am Also Experiencing A Weird Behaviour When Running A Script Using The Module Flag. For Example I Run:

Yes, everything is that way (work dir and args are ok) except the script path . It shows -m module arg1 arg2 .

4 years ago

0 Hi! I Am Currently Using Hydra+Clearml And Wanted To Know If There Are Still Some Updates Coming. At The Moment, If I Change The Defaults Hydra Uses From The

Side note: When running src.train as a module the server gets the command as src and has to be modified to be src.train

3 years ago

0 Hi! I Am Trying To Download Data From Gs Using

Thanks AgitatedDove14 !

4 years ago

0 Hi! I Am Trying To Run Some Experiments On An Agent I Have Configured To Use The Requirements.Txt The Problem Is It Only Shows Cython On The List Of Installed Packages. It Crashes Due To Missing Packages.

` [package_manager.force_repo_requirements_txt=true] Skipping requirements, using repository "requirements.txt"
Using base prefix '/opt/conda'
New python executable in /home/ramon/.clearml/venvs-builds/3.7/bin/python3.7
Also creating executable in /home/ramon/.clearml/venvs-builds/3.7/bin/python
Installing setuptools, pip, wheel...
2021-06-10 09:57:56
done.
Collecting pip<20.2
Using cached pip-20.1.1-py2.py3-none-any.whl (1.5 MB)
Installing collected packages: pip
Attempting uninstall: p...

3 years ago

0 Hi! I Am Getting The Following Error On An Agent:

With pip I get the first error I showed, I tried conda and it starts running but at some point crashes with:
clearml_agent: ERROR: 'NoneType' object has no attribute 'lower'

2 years ago

0 Hi! I Am Using The Modelcheckpoint Callback From Tensorflow To Save The Best Model. When The Experiment Finishes If I Go On The Server To Experiment > Artifacts > Output Model I Can See The Model And Subsequently By Clicking On It The Weights. How Can I

I get the URL to the checkpoint/weights can I use this to download the weights?

3 years ago

0 Hi All! Is There A Way For Trains To Recognize The Cli Arguments When Using

Yes, exactly! Unfortunately I am not so familiar with the internals of the library but I could take a look and figure that out.

4 years ago

0 Hi! Is There Something Happening With The

Thanks AgitatedDove14 ! seems to be subclassed model + extension

3 years ago

0 I Am Also Experiencing A Weird Behaviour When Running A Script Using The Module Flag. For Example I Run:

I’ll show you what I have through PM!

4 years ago

0 Hi! Is There A Way To Run A Task Without Reporting To The Server? For Example If I Want To Debug A Script By Running It Locally Without It Appearing On The Server

AgitatedDove14 task.set_archived(True) + the cleanup service should do it 👌 If we run in debug mode the experiment goes directly to the archive and gets cleaned and we don’t pollute the main experiment page.

3 years ago

0 Hi! I Am Getting The Following Error On An Agent:

I have the agent configured to force install requirements.txt

2 years ago

0 Hi, Is There A Way To Force The Requirements.Txt? I Have A Package I Installed Directly From Github But The Version Is Always Wrong. Any Other Way To Do This?

Yes Martin! I have a package installed from github but its using the pypi version

3 years ago

0 Hi! I Am Currently Using Hydra+Clearml And Wanted To Know If There Are Still Some Updates Coming. At The Moment, If I Change The Defaults Hydra Uses From The

AgitatedDove14 Thanks! I’ll give it a try! Makes sense 👌

3 years ago

0 Hi! I Am Having Some Problems With A Loss After A Good Amount Of Training, What Would Be The Best Way To Log A Value To Have A Better Idea Of What Is Happening?

Awesome AgitatedDove14 Thanks a lot 🙌

2 years ago

0 Hi! Is There Something Happening With The

Any idea why this could happen?

3 years ago

0 Hi

SuccessfulKoala55 on both 8080 and 8008 I get: Safari can’t open the page http://<External IP>:80XX because Safari can’t establish a secure connection to the server http://<External IP>:80XX .

4 years ago

On the server through the command line?

3 years ago

Show more results