Reputation
Badges 1
119 × Eureka!Iβll show you what I have through PM!
AgitatedDove14 task.set_archived(True)
+ the cleanup service should do it π If we run in debug mode the experiment goes directly to the archive and gets cleaned and we donβt pollute the main experiment page.
I have the agent configured to force install requirements.txt
Yes Martin! I have a package installed from github but its using the pypi version
AgitatedDove14 Thanks! Iβll give it a try! Makes sense π
Awesome AgitatedDove14 Thanks a lot π
SuccessfulKoala55 on both 8080
and 8008
I get: Safari canβt open the page http://<External IP>:80XX
because Safari canβt establish a secure connection to the server http://<External IP>:80XX
.
On the server through the command line?
So I would have to disconnect pytorch? And then upload the model at the end
AgitatedDove14 Well I have a loss function which is something like:class MyLoss(...): def forward(...): weights = self.compute_weights(...) return (weights * (target-preds)).mean()
There seems to be a problem on certain batch when computing the weights. What would be the best way to log the batch that causes the problem, along with the weights being computed.
Thats really cool! But I would still prefer avoid using pip_freeze, is there a way?
Makes sense! Then where would I have to add output_uri
to save the weights?
Yes AgitatedDove14 , I added git user name and password on the trains.conf file. On the results tab of the UI the logs clone command shows the SSH
command instead of the HTTPS
:Repository cloning failed: Command ['clone',
mailto:'git@gitlab.com : ...
TimelyPenguin76 I found out its just one package that is causing the error ( cloudpickle
breaks everything). Is there a way to use Pigar but force a single package to have a version?
Sure, Iβll share It through a private message!
It works perfectly! AgitatedDove14 There is something weird on my side π’
Hey AgitatedDove14 does this work for you?
` from argparse import ArgumentParser
from tensorflow.keras import utils as np_utils
from tensorflow.keras.datasets import mnist
from tensorflow.keras.layers import Dense
from tensorflow.keras.optimizers import Adam
from tensorflow.keras.callbacks import ModelCheckpoint
import tensorflow as tf
from clearml import Task
class Linear(tf.keras.Model):
def init(self, in_shape=(784,), num_classes=10):
super().init()
self.l...
Works like a charm π thanks!
It is the latest RC, I get the following:
` Executing Conda: /opt/conda/bin/conda install -p /home/ramon/.clearml/venvs-builds/3.8 -c pytorch -c conda-forge -c defaults 'pip<20.2' --quiet --json
Pass
Trying pip install: /home/ramon/.clearml/venvs-builds/3.8/task_repository/my-rep.git/requirements.txt
Executing Conda: /opt/conda/bin/conda install -p /home/ramon/.clearml/venvs-builds/3.8 -c pytorch -c conda-forge -c defaults numpy==1.20.3 --quiet --json
Pass
Warning, could not locate PyTorch to...
Is this caused by running the script with the arguments?
AgitatedDove14 update here! Something like this should work:from trains import StorageManager from trains.storage.helper import StorageHelper bucket = 'gs://bucket' helper = StorageHelper.get(bucket) remote_files = helper.list('folder') for f in remote_files: StorageManager.get_local_copy(bucket + "/" + f)
the *
gives []
results since one the list
method startswith
is used which uses it as a string and not as a wildcard
For option 2 do I have to configure it on all agents or on the server?
AgitatedDove14 Downloading a dataset would not be possible using this right? I want to be able to access the data just avoid reporting the experiment results