PompousBeetle71

11 Questions, 49 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Badges 1

49 × Eureka!

Questions 11
Answers 49

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

I Wanted To Suggest Something. We'Re Creating A Lot Of Projects And It Starts Getting A Bit Difficult To Navigate Through Them. I Think An Option To Have A Hierarchy In The Projects Can Be Very Useful.

I wanted to suggest something. We're creating a lot of projects and it starts getting a bit difficult to navigate through them. I think an option to have a h...

clearml

5 years ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Hi Everyone, Additional Arguments To The Script Execution, Is It Possible? How Can It Be Done? So At The Moment When My Script Is Being Executed The

Hi everyone, Additional arguments to the script execution, is it possible? how can it be done? So at the moment when my script is being executed the sys.argv...

clearml

5 years ago

0 Votes

9 Answers

2K Views

0 Votes 9 Answers 2K Views

Hi There, I'M Training A Pytorch Model And Save It Every Epoch. It Seems Like The Model Wights Are Overridden And I Can'T Choose The Best Model After The Experiment Ends. This Feature Is Missing Or I'M Not Using The Library Well?

Hi there, I'm training a pytorch model and save it every epoch. It seems like the model wights are overridden and I can't choose the best model after the exp...

pytorch

5 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Hi, I'M Having Some Issues That I Can'T Seem To Find Where The Problem Is Or How To Solve It. I'M Running Some Code On The Worker When I'M Trying To Download One Of The Artifacts That Can Be Found In The Input Model Task I'M Getting:

Hi, I'm having some issues that I can't seem to find where the problem is or how to solve it. I'm running some code on the worker when I'm trying to download...

clearml

5 years ago

0 Votes

13 Answers

2K Views

0 Votes 13 Answers 2K Views

Hi, I'Ve Recently Upgraded To 0.15.1 From 0.14.2, And For Some Reason A Code That Previously Worked In Which I'M Getting The Tags Of A Model Using

Hi, I've recently upgraded to 0.15.1 from 0.14.2, and for some reason a code that previously worked in which I'm getting the tags of a model using InputModel...

clearml

5 years ago

0 Votes

7 Answers

2K Views

0 Votes 7 Answers 2K Views

Hi All, After Solving My Multiprocessing Issue I'Ve Found The Following Issue: I Have A Machine With 2 Gpus. Starting An Agent There Specifying

Hi all, After solving my multiprocessing issue I've found the following issue: I have a machine with 2 GPUs. Starting an agent there specifying --gpus all di...

mlops

5 years ago

0 Votes

24 Answers

2K Views

0 Votes 24 Answers 2K Views

Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

Hi there, I've encountered a problematic behavior in python. When defining an argument a default value of None connecting to trains server then copy the task...

clearml

5 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Hi, Another Question. I Tried To Not

Hi, another question. I tried to not auto_connect_arg_parser but in that case I need to do the Task.init before I'm parsing the arguments. I don't want that ...

mlops

5 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Hi, I'Ve Found A Possible Bug. I'M Cloning/Running A Project Without Any Input Model. Which Is As Expected. But, After I Code Actually Start Running An Input Model Shows Up In The So When I Reset The Experiment I Need To Manually Remove The Input Model Na

Hi, I've found a possible bug. I'm cloning/running a project without any input model. Which is as expected. But, after I code actually start running an input...

clearml

5 years ago

0 Votes

16 Answers

2K Views

0 Votes 16 Answers 2K Views

Hi, I'M Getting A Lot Of The Following Logs

Hi, I'm getting a lot of the following logs trains.frameworks - WARNING - Could not retrieve model location, skipping auto model logging I'm fine with it and...

clearml

5 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Hi There, I'Ve Been Trying To Work With Trains And I Wanted To Save A Folder As The Model Like When Using The "Transformers" Library. They Have This "Save_Pretrained" Method To Their Models. It Saves The Pytorch Model And You Detect It Well, But Only That

Hi there, I've been trying to work with trains and I wanted to save a folder as the model like when using the "transformers" library. They have this "save_pr...

pytorch

5 years ago

0 Hi, I'M Getting A Lot Of The Following Logs

AgitatedDove14
These were the loggers names I can see locally running the code, it might differ running remotely.
['trains.utilities.pyhocon.config_parser', 'trains.utilities.pyhocon', 'trains.utilities', 'trains', 'trains.config', 'trains.storage', 'trains.metrics', 'trains.Repository Detection']
regarding repreduce it, have a long data processing after initializing the task and before setting the input model/output model.

5 years ago

0 Hi There, I'M Training A Pytorch Model And Save It Every Epoch. It Seems Like The Model Wights Are Overridden And I Can'T Choose The Best Model After The Experiment Ends. This Feature Is Missing Or I'M Not Using The Library Well?

AgitatedDove14 Yes.

5 years ago

0 Hi, I'Ve Recently Upgraded To 0.15.1 From 0.14.2, And For Some Reason A Code That Previously Worked In Which I'M Getting The Tags Of A Model Using

TimelyPenguin76 yes, both 0.15.1

5 years ago

0 Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

the trains version is still 0.14 it will take time to switch it

5 years ago

0 Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

I thought to change to connected ditionary though.

5 years ago

0 Hi Everyone, Additional Arguments To The Script Execution, Is It Possible? How Can It Be Done? So At The Moment When My Script Is Being Executed The

AgitatedDove14 I'm using both argpraser and sys.argv to start different processes that each of them will interact with a single GPU. So each process have a specific argument with a different value to differentiate between them. (only the main interact with trains). At the moment I encounter issues with getting the arguments from the processes I spawn. I'm explicitly calling python my_script.py --args... and each process knows to interact with the other. It's a bit complicated to explai...

5 years ago

0 Hi Everyone, Additional Arguments To The Script Execution, Is It Possible? How Can It Be Done? So At The Moment When My Script Is Being Executed The

I created a wrapper to work like executing python -m torch.distributed.launch --nproc_per_node 2 ./my_script.py but from my script. I do call trains.init in the subprocesses, I the actually difference between the subproceses supposed to be, in terms or arguments, local_rank that's all.It may be possible and that I'm not distributing the model between the GPUs in an optimal way or at least in a way that matches your framework.
If you have an example it would be great.

5 years ago

0 Hi Everyone, Additional Arguments To The Script Execution, Is It Possible? How Can It Be Done? So At The Moment When My Script Is Being Executed The

AgitatedDove14 Hi, So I solve that by passing to the created processes the arguments injected into the argprase as part of the commandline. The examples helped.

5 years ago

0 Hi, I'M Having Some Issues That I Can'T Seem To Find Where The Problem Is Or How To Solve It. I'M Running Some Code On The Worker When I'M Trying To Download One Of The Artifacts That Can Be Found In The Input Model Task I'M Getting:

AgitatedDove14 yes, it's the same.

5 years ago

AgitatedDove14 Well, after starting a new project it works. I guess it's a bug.

5 years ago

AgitatedDove14 Yes, I can. I didn't delete the previous project yet.

5 years ago

SteadyFox10 ModelCheckpoint is not for pytorch I think, couldn't find anything like it.

5 years ago

0 Hi, I'Ve Recently Upgraded To 0.15.1 From 0.14.2, And For Some Reason A Code That Previously Worked In Which I'M Getting The Tags Of A Model Using

TimelyPenguin76 I see it in the web-app under the model.

5 years ago

0 Hi All, After Solving My Multiprocessing Issue I'Ve Found The Following Issue: I Have A Machine With 2 Gpus. Starting An Agent There Specifying

AgitatedDove14 yes, you're right. it was 10.2 or 10.1 if I recall.

5 years ago

0 Hi All, After Solving My Multiprocessing Issue I'Ve Found The Following Issue: I Have A Machine With 2 Gpus. Starting An Agent There Specifying

AgitatedDove14 I'm using that code in the meanwhile
` ### This script checks the number of GPUs, create a list like 0,1,2...

Then adds '--gpus' before that list of GPUs

NUM_GPUS=nvidia-smi -L | wc -l
NUM_GPUS=$(($NUM_GPUS-1))
OUT=()
if [ $NUM_GPUS -ge 0 ]
then
for i in $(seq 0 $NUM_GPUS); do OUT+=( "$i" ); done
echo ${OUT[*]// /|} | tr ' ' ',' | awk '{print "--gpus "$1}'
else
echo ""
fi `

5 years ago

0 Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

AgitatedDove14 no, there's no reason in my case to pass an empty string. that's why I removed the type=str part.

5 years ago

0 Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

the version of the agent (the worker that received the job was 0.14.1)
the one that created the template was 0.14.2

5 years ago

0 I Wanted To Suggest Something. We'Re Creating A Lot Of Projects And It Starts Getting A Bit Difficult To Navigate Through Them. I Think An Option To Have A Hierarchy In The Projects Can Be Very Useful.

AgitatedDove14 Good to know! 🙂
I think it's good the way you described it (the second option).
let's call it an applicative project which has experiments and an abstract/parent project, or some other name that group applicative projects.

5 years ago

0 Hi, I'Ve Found A Possible Bug. I'M Cloning/Running A Project Without Any Input Model. Which Is As Expected. But, After I Code Actually Start Running An Input Model Shows Up In The So When I Reset The Experiment I Need To Manually Remove The Input Model Na

AgitatedDove14 Thanks Martin, I know that. I just say it's a bug.

5 years ago

0 Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

AgitatedDove14 v0.14

5 years ago

0 Hi Everyone, Additional Arguments To The Script Execution, Is It Possible? How Can It Be Done? So At The Moment When My Script Is Being Executed The

AgitatedDove14 thanks, I'll check it out.

5 years ago

0 Hi, I'M Getting A Lot Of The Following Logs

AgitatedDove14 It didn't help 😕

5 years ago

0 Hi, I'M Getting A Lot Of The Following Logs

AgitatedDove14 thanks, I'll try

5 years ago

0 Hi, I'M Getting A Lot Of The Following Logs

I use torch and yes, I use save so your code will catch it.

5 years ago

0 Hi, I'Ve Recently Upgraded To 0.15.1 From 0.14.2, And For Some Reason A Code That Previously Worked In Which I'M Getting The Tags Of A Model Using

AgitatedDove14 You were right. I can get them as system tags.
I've wrote a class that wraps an training session and interaction with trains as upon loading/saving the experiment I need more than just the 'model.bin'
So I use these tags to match a specific aux files that were saved with their model.

5 years ago

0 Hi, Another Question. I Tried To Not

awesome 🙂

5 years ago

0 Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

yes, it was.

5 years ago

0 Hi There, I'Ve Encountered A Problematic Behavior In Python. When Defining An Argument A Default Value Of

yes, there's a use for empty strings, for example in text generation you may generate the next word given some prefix, the prefix may be an empty string.

5 years ago

0 Hi, I'Ve Recently Upgraded To 0.15.1 From 0.14.2, And For Some Reason A Code That Previously Worked In Which I'M Getting The Tags Of A Model Using

TimelyPenguin76 the tags names are 'Epoch 1', 'Step 5705'
the return value of the InputModel(<Put a string copy from the UI with the tag id>).tags is an empty array.

5 years ago

SteadyFox10 AgitatedDove14 Thanks, I really did change the name.

5 years ago

Show more results