SourSwallow36

4 Questions, 21 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

21 × Eureka!

Questions 4
Answers 21

0 Votes

7 Answers

982 Views

0 Votes 7 Answers 982 Views

Hi There! I'M Trying To Understand How The

Hi there! I'm trying to understand how the trains-server works. So I have a few questions. 1. For a linux deployment, we need to use docker-compose which run...

clearml

4 years ago

0 Votes

16 Answers

1K Views

0 Votes 16 Answers 1K Views

Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

Hello, I'm running trains-server launched on EC2 using the AMI provided by allegroai, however the web app doesn't seem to be working properly. It takes a rea...

clearml

4 years ago

0 Votes

15 Answers

1K Views

0 Votes 15 Answers 1K Views

Hi, In My Setup I Run Multiple Experiments In Parallel From The Same Script. I Understand That There Can Only Be One Execution

Hi, In my setup I run multiple experiments in parallel from the same script. I understand that there can only be one execution Task in a script. I would like...

clearml

4 years ago

0 Votes

4 Answers

1K Views

0 Votes 4 Answers 1K Views

Also, Small Question On Logging Inference Data: I Ran An Experiment To Train A Model. Now I Want To Run Inference Using That Model And Log Inference Metrics To The Same Experiment Which Has Training Details. So Overall There Is Just One Experiment Which

Also, small question on logging inference data: I ran an experiment to train a model. Now I want to run inference using that model and log inference metrics ...

clearml

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

I just tried the .16.1 and am seeing the same behavior.

Here is the AMI id: allegroai-trains-server-0.16.1-320-273-c5c210e4-5094-4eb9-a613-a32c0378de31-ami-06f5e9f4dfa499dca.4

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

How did you figure out that there was no communication between the server and the web-app?

4 years ago

0 Hi, In My Setup I Run Multiple Experiments In Parallel From The Same Script. I Understand That There Can Only Be One Execution

how are you thinking of running those HP tests?

I'm not sure if I completely understand the question. Here is what I do presently. This maybe achieved more efficiently in trains (that's why I'm trying to move to trains).

Example:
I have a set of 10 user defined HPs. I have a scheduler that runs them independently in parallel. Once the training is complete, I run inference on the test set for these experiments. The data for both training and inference is logged under the respective exp...

4 years ago

0 Hi, In My Setup I Run Multiple Experiments In Parallel From The Same Script. I Understand That There Can Only Be One Execution

Great, yes that makes sense.

4 years ago

0 Hi, In My Setup I Run Multiple Experiments In Parallel From The Same Script. I Understand That There Can Only Be One Execution

Ok, cool. Thanks. This clears up things. I need to read more about the trains agent then. I have another question, I'll post it as a separate thread.

4 years ago

0 Also, Small Question On Logging Inference Data: I Ran An Experiment To Train A Model. Now I Want To Run Inference Using That Model And Log Inference Metrics To The Same Experiment Which Has Training Details. So Overall There Is Just One Experiment Which

And yes, I'm logging different metrics

4 years ago

0 Hi, In My Setup I Run Multiple Experiments In Parallel From The Same Script. I Understand That There Can Only Be One Execution

Mostly they are a set of user defined hyper-parameters. I've been reading about hyper-param optimization since posting this. It seems like I would have to use hyper-param opt to achieve that.

4 years ago

0 Hi, In My Setup I Run Multiple Experiments In Parallel From The Same Script. I Understand That There Can Only Be One Execution

Yes every run is log as a new experiment (with it's own set of HP). Do notice that the execution itself is done by the "trains-agent". Meaning the HP process creates experiments with new set of HP an dputs them into the execution queue, then

trains-agent

pulls them from the queue and starts executing them. You can have multiple

trains-agent

on as many machines as you like with specific GPUs etc. each one will pull a single experiment and execute it, once...

4 years ago

0 Hi, In My Setup I Run Multiple Experiments In Parallel From The Same Script. I Understand That There Can Only Be One Execution

For HPO (hyper-param opt), are all experiments which are part of the optimization process logged? I understand the HPO process takes a base experiment and runs subsequent experiments with the new HPs. Are these experiments logged too (with the train-valid curves, etc)?

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

That was it! I had not added 8008.

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

I understand regarding not opening up the ports for entire world. I'm just testing the setup 🙂

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

Here is when I try to load the profile page

4 years ago

try:

Task.init('examples', 'training', continue_last_task='<previous_task_id_here>')
Just tried this and it works. Thanks! Really appreciate the great response!

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

Oh, I've opened 8080, 8081 in my security group and NOT 8008.

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

Here is the network tab when trying to load the projects page

4 years ago

'by the same name' you mean names of the metrics and not the experiment name, right?

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

Got it. Thanks for the help and explanation!

4 years ago

0 Hello, I'M Running Trains-Server Launched On Ec2 Using The Ami Provided By Allegroai, However The Web App Doesn'T Seem To Be Working Properly.

AMI : allegroai-trains-server-0.15.1-366-248-c5c210e4-5094-4eb9-a613-a32c0378de31-ami-0bc20623da659a8cd.4

Hmm, I'm using 0.15.1 which I guess is an old version. I just created a new one with 0.16.1 and will test it out

4 years ago

0 Hi There! I'M Trying To Understand How The

Thanks, I appreciate the answer!

So not the latter. I can always log metrics during training and visualize them.

I'm thinking of a few plots in my current in-house tooling which are slightly different than the standard charts we look at. For example a custom parallel coordinate chart that can use aggregations, categorical variables, etc.
To move over to trains, I'd like to have all these custom plots in my dashboard. I haven't tried to do them in trains yet (I'm just starting).

So my quest...

4 years ago

0 Hi There! I'M Trying To Understand How The

I agree it would be better to have it fully configurable. But if every marginal feature adds complexity, we might have to think how applicable that is to the general use cases. I'm thinking of examples in my domain which might not be useful in other domains. Maybe if that becomes an issue, there could be a domain specific feature base?!

I haven't fully compared all the things that I am currently doing with the in-house tool and what we can do with trains. I think I will have more concrete id...

4 years ago

0 Hi There! I'M Trying To Understand How The

Hi AgitatedDove14
Thanks for the quick response.
I will try that out. Great! This is a great tool and I will start contributing. Why use both and not just one of them? What does one offer that the other doesn't?
Also, I would like to add some other plots to the dashboard. I see the plotting is done using Plotly Javascript. I'm a Python developer and don't know much Javascript. Do you have any suggestions on how to go about that or I should just get going with Javascript?

4 years ago