BattyLion34

7 Questions, 47 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

45 × Eureka!

Questions 7
Answers 47

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Hi There. I'M Following The Training Instructions For Testing Clearml Agent (

Hi there. I'm following the training instructions for testing clearml agent ( https://allegro.ai/clearml/docs/docs/tutorials/tutorial_tuning_exp.html#step-3-...

mlops

4 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi Again. While Configuring Clearml-Agent, I'Ve Got Broken Ui Page. After Completely Reinstalling Docker Image For Docker Desktop For Windows, To Run Clearml-Server On My Workstation, I Tried To Set It Up Using "Clearml-Init", But Can'T Set Credentials. I

Hi again. While configuring clearml-agent, I've got broken UI page. After completely reinstalling docker image for Docker Desktop for Windows, to run clearml...

clearml

4 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Hi All, Sorry For Simple Question, But Is It Possible To Directly Specify The Gitlab Repository To Save Experiment Statistics And Artifacts In Config File (I Have Two Gitlab Accounts And A Github Account)?

Hi all, sorry for simple question, but is it possible to directly specify the gitlab repository to save experiment statistics and artifacts in config file (I...

clearml

4 years ago

0 Votes

28 Answers

2K Views

0 Votes 28 Answers 2K Views

Hi. After Upgrading Clearml To Latest Version, Got This Error From My Pipeline (Windows10, Configured And Running Tensorflowod For Tf 2.3.):

Hi. After upgrading ClearML to latest version, got this error from my pipeline (Windows10, configured and running TensorflowOD for TF 2.3.): File "C:\Users\S...

clearml

4 years ago

0 Votes

31 Answers

144K Views

0 Votes 31 Answers 144K Views

Hi Everyone. I Have An Issue With The Simple Pipeline - It Runs Two Similar Nn Training Steps (Tf2.3, Windows10, Python 3.7) With Only Difference Is A Batch Size. I'M Running First Separately Each Step To Have Them In Clearml Project Page. Then I Run Pipe

Hi everyone. I have an issue with the simple pipeline - it runs two similar nn training steps (tf2.3, windows10, python 3.7) with only difference is a batch ...

clearml

4 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Hi All! Not Sure This Is Appropriate Place To Ask For Help - Direct Me To The Write Place, If So. But I Need Help With Running Training Process On My Single Workstation (Trying To Run Few Experiments Consequently On Tensorflow Object Detection). The Issue

Hi all! Not sure this is appropriate place to ask for help - direct me to the write place, if so. But I need help with running training process on my single ...

clearml

4 years ago

0 Votes

33 Answers

138K Views

0 Votes 33 Answers 138K Views

Hi Guys, I Have Many Questions To Ask, Sorry If This Questions Were Posted Already - If The Answer Exist, Please, Point Me To It. Thank You For Your Help. I'M Training Object Detection Model Using Tf 2.3 Object Detection Api And Use Clearml On Local Serve

Hi guys, I have many questions to ask, sorry if this questions were posted already - if the answer exist, please, point me to it. Thank you for your help. I'...

clearml

4 years ago

0 Hi Everyone. I Have An Issue With The Simple Pipeline - It Runs Two Similar Nn Training Steps (Tf2.3, Windows10, Python 3.7) With Only Difference Is A Batch Size. I'M Running First Separately Each Step To Have Them In Clearml Project Page. Then I Run Pipe

AgitatedDove14 Yes, the difference in installed packages is large - the training stage, which runs ok has all the following:

4 years ago

No another agent running

4 years ago

0 Hi All, Sorry For Simple Question, But Is It Possible To Directly Specify The Gitlab Repository To Save Experiment Statistics And Artifacts In Config File (I Have Two Gitlab Accounts And A Github Account)?

Okay. I see, I didn't understand clearly the structure and logic behind ClearML. I though that exernal git repository should be set up to keep logs, stats, etc. So, all these are kept on the ClearML host, correct? However, if I want to keep logs on outer repo, is it possible to config ClearML to keep all these files there?

4 years ago

0 Hi Again. While Configuring Clearml-Agent, I'Ve Got Broken Ui Page. After Completely Reinstalling Docker Image For Docker Desktop For Windows, To Run Clearml-Server On My Workstation, I Tried To Set It Up Using "Clearml-Init", But Can'T Set Credentials. I

https://clearml.slack.com/archives/CTK20V944/p1610481348165400?thread_ts=1610476184.162600&cid=CTK20V944
Indeed, that was a cookie issue. After deleting cookies, everything works fine. Thanks. Interesting enough, I had this issue both on Chrome and FF.

4 years ago

Exactely.

4 years ago

Thanks. Not yet, but will watch, by all means.

4 years ago

AgitatedDove14 It works!!! Thanks a lot!

4 years ago

Completed task:

4 years ago

AgitatedDove14 According to the logs (up to traceback message), the only difference between those two tasks is task id name

4 years ago

0 Hi Guys, I Have Many Questions To Ask, Sorry If This Questions Were Posted Already - If The Answer Exist, Please, Point Me To It. Thank You For Your Help. I'M Training Object Detection Model Using Tf 2.3 Object Detection Api And Use Clearml On Local Serve

4 years ago

Well, I'm pretty sure that nntraining is executed in the same queue for these two cases:

4 years ago

Yes, exactly.

4 years ago

Ok, ran (just used point instead of comma in print statement - comment if someone reading this will run this code). Attached to this message.

4 years ago

Here's also the log of failed pipeline - maybe it may give a clue.

4 years ago

Failed task:

4 years ago

AgitatedDove14 Looks like that. First, I've created a toy task running in "services" queue (you didn't tell that but I guess you assumed). I haven't found how to specify the queue to run in code ( Task.equeue(task, queue_name='services') returned an error), so I ran toy.py first in "default" queue, aborted toy.py, started nntraining in "default" queue. Then I reset toy.py and enqueued it to "services" queue. Toy.py failed shortly. I've also reset both toy.py and nntraining and enqueue...

4 years ago

These libraries are absent in the option, which fails. The only libraries of that option (all are present in correct-working option) are:
absl_py==0.9.0
boto3==1.16.6
clearml==0.17.4
joblib==0.17.0
matplotlib==3.3.1
numpy==1.18.4
scikit_learn==0.23.2
tensorflow_gpu==2.2.0
watchdog==0.10.3

4 years ago

Exactly! To be more specified - the same base_task_id fails, if the pipeline is cloned and started from UI. I've checked the queues for failed and completed tasks - they are the same (default, gpu-all).

4 years ago

AgitatedDove14 Yes, that's what I have - for me it's weird, too.

4 years ago

0 Hi There. I'M Following The Training Instructions For Testing Clearml Agent (

TimelyPenguin76 Yes, that's a new file - I haven't added it to repository yet. What I see for original taks "uncommitted changes" - "no changes logged".

4 years ago

astunparse==1.6.3
attrs==20.3.0
botocore==1.19.63
cachetools==4.2.1
certifi==2020.12.5
chardet==4.0.0
cycler==0.10.0
Cython==0.29.21
furl==2.1.0
future==0.18.2
gast==0.3.3
google-auth==1.25.0
google-auth-oauthlib==0.4.2
google-pasta==0.2.0
grpcio==1.35.0
h5py==2.10.0
humanfriendly==9.1
idna==2.10
importlib-metadata==3.4.0
jmespath==0.10.0
jsonschema==3.2.0
Keras-Preprocessing==1.1.2
kiwisolver==1.3.1
Markdown==3.3.3
oauthlib==3.1.0
opt-einsum==3.3.0
orderedmultidict==1.0.1
pathlib2==2.3.5
pat...

4 years ago

No, I have only two agents pulling from different queue:

4 years ago

0 Hi. After Upgrading Clearml To Latest Version, Got This Error From My Pipeline (Windows10, Configured And Running Tensorflowod For Tf 2.3.):

AgitatedDove14
No, I meant different thing. It's not easy to explain, sorry. Let me try. Say, I have a project in folder "d:\object_detection". There I have a script, which converts annotations from labelme format to coco format. This script name is convert_test.py and it runs a process, registered under the same name in clearml. This script, being run separately from command prompt creates new file in project folder - test.json . I delete this file, synch local and remote repos, both...

4 years ago

Ок, thanks.

4 years ago

0 Hi There. I'M Following The Training Instructions For Testing Clearml Agent (

Yes, this works, thank you!

4 years ago

0 Hi. After Upgrading Clearml To Latest Version, Got This Error From My Pipeline (Windows10, Configured And Running Tensorflowod For Tf 2.3.):

AgitatedDove14
No, I do not use --docker flag for clearml agent In Windows setting system_site_packages to true allowed all stages in pipeline to start - but doesn't work in Lunux. I've deleted tfrecords from master branch and commit the removal, and set the folder for tfrecords to be ignored in .gitignore. Trying to find, which changes are considered to be uncommited. By cache files I mean the files in folder C:\Users\Super.clearml\vcs-cache - based on error message, cle...

4 years ago

AgitatedDove14 How can the first process corrupt the second and why doesn't this occur if I run pipeline from command line? Just to be precise - I run all the processes as administrator. However, I've tested running the pipeline from command line in non-administrator mode, it works fine.

4 years ago

Show more results