DilapidatedParrot58

42 Questions, 205 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

186 × Eureka!

Answers 205

0 Hey Guys, I Keep Getting

do you have any idea why cleanup task keeps failing then (it used to work before the update)

3 years ago

0 Hey Guys, I Keep Getting

yeah, we did. let me check if explicitly setting credentials helps

3 years ago

0 Yo Guys, I'M Getting

no, we've only been using metric and model logging so far

4 years ago

0 Yo Guys, I'M Getting

everything is working as expected

4 years ago

0 Yo Guys, I'M Getting

we're using EC2

4 years ago

0 Hey Guys, I Keep Getting

default docker-compose

3 years ago

0 Hey Guys, I Keep Getting

SuccessfulKoala55 grrrrr it keeps happening, I have no idea what's wrong

3 years ago

0 Hey Guys, I Keep Getting

nope, old clenup task fails with trains_agent: ERROR: Could not find task id=e7725856e9a04271aab846d77d6f7d66 (for host: )
Exception: 'Tasks' object has no attribute 'id

weirdly enough, curl http://apiserver:8008 from inside the container works

3 years ago

0 Hey Guys, I Keep Getting

well, the server wouldn't work without them?

3 years ago

0 Yo Guys, I'M Getting

I get "The connection has timed out" when I'm trying to reach 8081 port

4 years ago

0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

well okay, it's probably not that weird considering that worker just runs the container

4 years ago

0 It Would Be Nice To Group Experiments Within Projects Use Cases:

tags are somewhat fine for this, I guess, but there will be too many of them eventually, and they do not reflect sequential nature of the experiments

2 years ago

0 Is Is Possible To Pass Custom

nice!

2 years ago

0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

perfect!

4 years ago

0 It Would Be Nice To Group Experiments Within Projects Use Cases:

nope, that's the point, quite often we run experiments separately, but they are related to each other. currently there's no way to see that one experiment is using checkpoint from the previous experiment since we need to manually insert S3 link as a hyperparameter. it would be useful to see these connections. maybe instead of grouping we could see which experiments are using artifacts of this experiment

2 years ago

0 Is Is Possible To Pass Custom

yes

2 years ago

0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

that's right, I have 4 GPUs and 4 workers. but what if I want to run two jobs simultaneously at the same GPU

4 years ago

0 It Would Be Nice To Group Experiments Within Projects Use Cases:

parents and children. maybe tags, maybe separate tab or section, idk. I wonder if anyone else is interested in this functionality, for us this is a very common case

2 years ago

0 Is Is Possible To Pass Custom

we're using os.getenv in the script to get a value for these secrets

2 years ago

0 I Keep Getting Errors When Trying To Compare A Lot Of Experiments At The Same Time (>10). What'S Evern Worse Is That Trains Start Working Much Slower In General After These Attempts, The Only Way To Fix It Is To Restart The Whole Thing. Would Getting Bett

0.16.1

4 years ago

we do log a lot of the different metrics, maybe this can be part of the problem

4 years ago

some of the POST requests "tasks.get_all_ex" fail as far as I can see

4 years ago

tail of the api server log

4 years ago

network logs

4 years ago

0 Anyone Having Problems With Clearml Slowing Down Pytorch Experiments? Auto_Connect_Framework={“Pytorch”: False} Helps, But It’S Not A Great Solution. We Think It’S Related To Clearml Trying To Do Something At Each Dataloader Iteration. We’Ll Try To Provid

it’s a pretty standard pytorch train/eval loop, using pytorch dataloader and https://docs.monai.io/en/stable/_modules/monai/data/dataset.html

2 years ago

0 Downloading Output Artifacts From S3 By Clicking On The Download Button Next To Model Url Was Great, But Since We Moved From Aws To Yandex.Cloud, This Feature Doesn'T Work. Any Chance You Could Support Other Cloud Providers?

okay, I will try it with port

2 years ago

0 Is Is Possible To Pass Custom

this is probably what I need, thanks. I'll check if it works

2 years ago

0 Is Is Possible To Pass Custom

will it pass variables to the training containers?

2 years ago

even without port

2 years ago

0 Hey Guys, I Keep Getting "Failed Parsing Task Parameter" Warning For The Arguments Such As This One:

done!

3 years ago

Show more results