
Reputation
Badges 1
533 × Eureka!the output above is what the agent has as it seems... obviously on my machine I have it installed
Oh I get it, I thought it is only a UI issue... but it actually doesn't send it O_O
no this is from the task execution that failed
` # Python 3.8.10 (default, Jun 2 2021, 10:49:15) [GCC 9.4.0]
clearml == 1.0.5
hyperopt == 0.2.5
matplotlib == 3.4.3
numpy == 1.21.2
pandas == 1.3.2
plotly == 5.3.0
python_dateutil == 2.8.2
scikit_learn == 0.24.2
statsmodels == 0.12.2
tqdm == 4.62.2
Detailed import analysis
**************************
IMPORT PACKAGE clearml
tasks/data_projection.py: 9
tasks/hp_optimization.py: 6
tasks/hpo_n_best_evaluation.py: 6
tasks/pipelines/monthly_predictions.py: 4
IMPORT PACKAGE hypero...
why not use my user and group?
yeah but I see it gets enquequed to the default
which I don't know what it is connected to
If I execute this task using python .....py
will it execute the machine I executed it on?
and also in the extra_vm_bash_script
variables, I ahve them under export TRAINS_API_ACCESS_KEY
and export TRAINS_API_SECRET_KEY
Now I see the watermarks are 2gb
TimelyPenguin76
alabaster==0.7.12 appdirs==1.4.4 apturl==0.5.2 attrs==21.2.0 Babel==2.9.1 bcrypt==3.1.7 blinker==1.4 Brlapi==0.7.0 cachetools==4.0.0 certifi==2019.11.28 chardet==3.0.4 chrome-gnome-shell==0.0.0 clearml==1.0.5 click==8.0.1 cloud-sptheme==1.10.1.post20200504175005 cloudpickle==1.6.0 colorama==0.4.3 command-not-found==0.3
I mean if I continue and build on the example in the docs, what will happen if the training
task is completed, and then I get it and log to it? Will it be defined as running again?
BTW is the if not cached_file: return cached_file
is legit or a bug?
This error just keeps coming back... I already made the watermarks like 0.5gb
I only found Project ID, which I'm not sure what this refers to - I have the project name
which permissions should it have? I would like to avoid full EC2 access if possible, and only choose the necessary permissions
I have them in two different places, once under Hyperparameters -> General
Sorry I meant this link
https://azuremarketplace.microsoft.com/en-us/marketplace/apps/apps-4-rent.clearml-on-centos8
AgitatedDove14 β¬ please help π
So could you re-explain assuming my piepline object is created by pipeline = PipelineController(...)
?
the Task
object has a method called Task.execute_remotely
Look it up here:
https://allegro.ai/docs/task.html#trains.task.Task.execute_remotely
I set it to true and restarted by agent
logger.report_table(title="Inference Data", series="Inference Values", iteration=0, table_plot=inference_table)
I'll check if this works tomorrow