AgitatedDove14

49 Questions, 8060 Answers

Active since 10 January 2023

Last activity 9 months ago

Reputation

Badges 1

25 × Eureka!

Answers 8060

0 Hello I'M Running A Local Agent . While Its Running The Task I Get This Error. Any Suggestion? Uccessfully Installed Numpy-1.24.4 Found Pytorch Version Torch==2.0.1 Matching Cuda Version 0 Found Pytorch Version Torchaudio==2.0.2 Matching Cuda Version 0 Er

You do not need the cudatoolkit package, this is automatically installed if the agent is using conda as package manager. See your clearml.conf for the exact configuration you are running
https://github.com/allegroai/clearml-agent/blob/a56343ffc717c7ca45774b94f38bd83fe3ce1d1e/docs/clearml.conf#L79

one year ago

0 Hi! I Use Self-Hosted Server. I Uploaded Datasets With

link with "localhost" in it Oo

Hmm I think this is the main issue, for some reason the dataset default upload destination is "localhost", what do you have configured in your clearml.conf under files server?

one year ago

0 Hi, How Would I Retrieve The Running Time (Using Python Sdk) For Any Clearml Task? Thank You.

Hi @<1547028052028952576:profile|ExuberantBat52>

task = Task.get_task(...)
print(task.data)

wdyt?

one year ago

0 I’M Trying To Use Minio With Clearml As A External Storage. I Am Having Problems With The Configuration File For The Clearml Client When I Use The Output_Uri Parameter Of Task.Init What Do I Put There? I Am Currently Doing Task.Init(… Output_Uri=“S3://I

odd message though ... it should have said something about boto3

one year ago

0 Hey Guys Trying To Save A Model Via The Outputmodel.Update_Weights Function I Get The Following Error:

Hmm whats the OS and python version?
Is this simple example working for you?
None

one year ago

0 I Have Noticed That For Some People The Repository Discovery Stores Github Repo In The Form:

Hi UpsetTurkey67

repository discovery stores github repo in the form:

...

while for others

git@github.com:...

Yes that depends on how they locally cloned the repo (via SSH or user/pass/token)

Interestingly in the former case the ssh config is ignored and cloning repository breaks on the worker

If you have passed git user/pass to the agent it should use them not SSH, how did you configure the agent ?

2 years ago

0 Hey Guys Trying To Save A Model Via The Outputmodel.Update_Weights Function I Get The Following Error:

btw: what's the OS and python version?

one year ago

0 Hi, Does Anyone Have Some Issues With Cloning Git Repos Within Alegro? I Always Got Some Error Massage: Fatal: Unable To Access '

Yes, the left side is the location of the file on the host machine, the right side is the location of the file inside the docker. in our case it is the same location

4 years ago

0 Hi, Does Anyone Have Some Issues With Cloning Git Repos Within Alegro? I Always Got Some Error Massage: Fatal: Unable To Access '

This will mount the trains-agent machine's hosts file into the docker

4 years ago

0 Hey, Using K8S With Trains 0.16.1-320, All Of A Sudden The Entire Data (I.E Experiments, Tasks, Api Creds) Is Not Showing In The Ui Anymore. All Logs Seems To Be Fine Afai Can Tell... Any Idea What Went Wrong?

https://stackoverflow.com/questions/37743683/why-is-an-empty-mongodb-database-so-big

3 years ago

0 Correct Way To Configure Ssh Authentication For Git In Agent With Docker Mode

I am struggling with configuring ssh authentication in docker mode

GentleSwallow91 Basically the agent will automatically mount the .ssh into the container , just make sure you set the following in the clearml.conf:
force_git_ssh_protocol: truehttps://github.com/allegroai/clearml-agent/blob/178af0dee84e22becb9eec8f81f343b9f2022630/docs/clearml.conf#L30

2 years ago

0 How Do You Create Parameters For Grid Search? I See Classes For Random Sampling Et C But I Don'T See Anything For Fixed Parameter List

Hi @<1539055479878062080:profile|FranticLobster21>
Like this?
https://github.com/allegroai/clearml/blob/4ebe714165cfdacdcc48b8cf6cc5bddb3c15a89f[…]ation/hyper-parameter-optimization/hyper_parameter_optimizer.py
[https://github.com/allegroai/clearml/blob/4ebe714165cfdacdcc48b8cf6cc5bddb3c15a89f[…]ation/hyper-parameter-opt...

one year ago

0 Hi There, I Used

BTW: UnevenDolphin73 you should never actually do "task = clearml.Task.get_task(clearml.config.get_remote_task_id())"
You should just do " Task.init() " it will automatically take the "get_remote_task_id" and do all sorts of internal setups, you will end up with the same object but in an ordered fashion

Yes even without any arguments give to Task.init() , it has everything from the server

2 years ago

0 Hi There, I Used

JitteryCoyote63 I think I found the bug in clearml-task it adds it at the end instead of before everything else

2 years ago

0 Hi There, I Used

JitteryCoyote63 Are you calling clearml-task with --skip-task-init ?

2 years ago

0 I Want To Clone An Experiment And Then Try With A Different Hyperparameter Value . I Was Wondering If There Is A Way To Explicitly Specify A Hyperparameter. Right Now, I Guess Its Picking It Up From Tensorboard And I Don'T See The Hyperparameters That Are

Or you can do:
param={'key': 123}
task.connect(param)

4 years ago

0 Hi - Quick Question. I Am Using The Pipelinecontroller With Abort_On_Failure Set To False. I Have A Pipe With A First Task That Branch Out In 3 Branches.

Hi @<1523715429694967808:profile|ThickCrow29> , thank you for pinging!
We fixed the issue (hopefully) can you verify with the latest RC? 1.14.0rc0 ?

one year ago

0 Hi All! I’M Currently Working On A Project Where I’M Making Use Of Clearml For Hyperparameter Tuning. In My Workflow, I Have A Python Script That I Usually Run With The Following Command:

@<1566959357147484160:profile|LazyCat94>
I found the issue, the import of clearml should be before anything else, this way it patch the Argparser before using it

from clearml import Task

Move it to the first line, everything should work 🙂

one year ago

0 Is There Any Testing Suite That Ships With Clearml? If We'D Like To Make Some Unit Tests For Our Code?

mostly by using

Task.create

instead of

Task.init

.

UnevenDolphin73 , now I'm confused , Task.create is Not meant to be used as a replacement for Task.init, this is so you can manually create an Additional Task (not the current process Task). How are you using it ?

Regarding the second - I'm not doing anything per se. I'm running in offline mode and I'm trying to create a dataset, and this is the error I get...

I think the main thing we need to...

2 years ago

0 Hi, I Am Trying To Start A Poc With Server And Agent And A Git Repository That Has A Submodule. I Don'T Need The Agent To Try To Fetch The Submodule, Is There A Way To Control The Clone Command? Avoid Calling Submodules?

FYI all the git pulls are cached even in docker mode so there is no "tax" to pay for pulling the sub-modules (only the first time of course)

4 years ago

0 Hey I’M Running This Script And Initialise The Clearml Task Also In This File

Actually I saw that the

RuntimeError: context has already been set

appears when the task is initialised outside

if name == "main":

Is this when you execute the code or when the agent ?
Also what's the OS of your machine/ agent ?

3 years ago

0 Hi! I’M Running An Experiment As Follows:

Yes, actually ensuring pip is there cannot be skipped (I think in the past it cased to many issues, hence the version limit etc.)
Are you saying it takes a lot of time when running? How long is the actual process that the Task is running (just to normalize times here)

2 years ago

0 Hello, How Do I Deploy An Agent In

I think the easiest way is to add another glue instance and connect it with CPU pods and the services queue. I have to admit that it has been a while since I looked at the chart but there should be a way to do that

one year ago

0 Quick Question, Can Trains Log Keras Loss Values And/Or Metrics Automatically? Or Would I Have To Attach A Tensorboard Callback?

ElegantCoyote26 It means we need to have a keras logger that logs everything to trains, then we need to hook it automatically.
Do you feel like PR-ing the logger (the hooking I can take care of 🙂 )?

4 years ago

0 Hi, Does Anyone Know Where Trains Stores Tensorboard Data? Because I Am Used To Using Tensorboard To Record Experimental Data And Store Data, I Hope I Can Access The Folder Where Tensorboard Stores Data When I Use Command Like

Hi FierceFly22

Hi, does anyone know where trains stores tensorboard data

Tesnorboard data is stored wherever you point your file-writer to 🙂
What trains is doing is while tensorboard writes it's own data to disk, it takes the data (in-flight) and sends it to the trains-server. The trains-server puts everything in the DB, so later everything is viewable & searchable.
Basically you don't need to store your TB files after your experiment is done, you have all the data in the trains-s...

4 years ago

0 I Want To Retrieve The Logged Metrics To Be Able To Save The Best Model From My Training. This Is My Step:

Here you go 🙂
(using trains_agent for easier all data access)
from trains_agent import APIClient client = APIClient() log_events = client.events.get_scalar_metric_data(task='11223344aabbcc', metric='valid_average_dice_epoch') print(log_events)

4 years ago

0 What Sort Of Integration Is Possible With Clearml And Sagemaker? On The Page

and this

server_info['url'] = f"http://{server_info['hostname']}:{server_info['port']}/{server_info['base_url']}/"

one year ago

0 Is There Any Customization Options With Respect To The Ui Of The Debug Samples Tab In Results? Specifically I Am Looking For Something More Similar To Tensorboard, Namely The Slider That Lets You Scroll Conveniently Through The Debug Samples Across The E

https://github.com/allegroai/trains-web

4 years ago

0 Hey Guys Trying To Save A Model Via The Outputmodel.Update_Weights Function I Get The Following Error:

Do you have to have a value there ?

one year ago

0 Hello Everyone, I Have A Question About Clearml-Agent + Ssh. I Have Valid Ssh Credentials In

Hi ReassuredTiger98
I think DefiantCrab67 solved it 🙂
https://clearml.slack.com/archives/CTK20V944/p1617746462341100?thread_ts=1617703517.320700&cid=CTK20V944

https://clearml.slack.com/archives/CTK20V944/p1617746640342900?thread_ts=1617703517.320700&cid=CTK20V944

3 years ago

Show more results