Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DefeatedCrab47
Moderator
15 Questions, 42 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

41 × Eureka!
0 Votes
5 Answers
624 Views
0 Votes 5 Answers 624 Views
3 years ago
0 Votes
7 Answers
561 Views
0 Votes 7 Answers 561 Views
Hello, I'm trying to run the docker-compose version of the trains-server, but with the command sudo docker-compose -f /opt/trains/docker-compose.yml up --bui...
3 years ago
0 Votes
10 Answers
571 Views
0 Votes 10 Answers 571 Views
Hello! We are trying to upgrade from Trains server 15.1 to 16.1 using Docker, but are running into a permission error: trains-elastic | "stacktrace": ["org.e...
3 years ago
0 Votes
2 Answers
493 Views
0 Votes 2 Answers 493 Views
3 years ago
0 Votes
18 Answers
551 Views
0 Votes 18 Answers 551 Views
PyTorch Lightning question about logging a figure. I have the following code: # turn confusion matrix into a figure (Tensor cannot be logged as a scalar) fig...
3 years ago
0 Votes
3 Answers
514 Views
0 Votes 3 Answers 514 Views
Where did the TrainsLogger go in PyTorch Lightning (Bolts)? First it was moved from Lightning ( from pytorch_lightning.loggers import TrainsLogger ) to Bolts...
3 years ago
0 Votes
7 Answers
545 Views
0 Votes 7 Answers 545 Views
The links to PyTorch Lightning are broken in the https://allegro.ai/docs/integrations/integration_pytorch_lightening/ . Both "Use the PyTorch Lightning https...
3 years ago
0 Votes
3 Answers
577 Views
0 Votes 3 Answers 577 Views
In your https://allegro.ai/blog/allegro-trains-v-0-15-release/ , the link to an example of Hyper-Parameter Optimizer ( https://github.com/allegroai/trains/bl...
3 years ago
0 Votes
9 Answers
604 Views
0 Votes 9 Answers 604 Views
3 years ago
0 Votes
7 Answers
512 Views
0 Votes 7 Answers 512 Views
3 years ago
0 Votes
4 Answers
583 Views
0 Votes 4 Answers 583 Views
3 years ago
0 Votes
1 Answers
523 Views
0 Votes 1 Answers 523 Views
Just a pull request for a small bug fix found in 2 examples: https://github.com/allegroai/trains/pull/148
3 years ago
0 Votes
2 Answers
534 Views
0 Votes 2 Answers 534 Views
Port remapping of the webserver is not supported (documentation only mentions 8080 , 8081 and 8008 need to be available)? On our server we have JupyterHub ru...
3 years ago
0 Votes
11 Answers
643 Views
0 Votes 11 Answers 643 Views
I want to upgrade to the latest TRAINS 0.15.1, so I followed the instructions under "Upgrading" here: https://allegro.ai/docs/deploying_trains/trains_server_...
3 years ago
0 Votes
1 Answers
629 Views
0 Votes 1 Answers 629 Views
3 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

trains ( 0.15.1-367 ) appears to be the version, same as you. Thank you. Appears Trains is up to date.

Apparently there should be 6 of them:

3 years ago
0 In Your

Thank you

3 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

Hmm, after connecting with the VPN again and using ctrl + F5, there is no complaint anymore. Although a colleague uploaded a Seaborn plot, but it's still not showing up, which I thought was fixed in the new version?
The plots page is pure white of that experiment, and not the usual "No chart data" if no plot was uploaded.

3 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

After a while I get the message:

New version available
Click the reload button below to reload the web page

I click the "RELOAD" button and the "newer version" message disappear. However, some plots still don't show up (fixed in 0.15.1). If I refresh the TRAINS webinterface, the "newer version" message appears again.

3 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

Is there anyway how I can figure out in the webinterface what version of Trains is actually running?

3 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

It's my colleague's experiment (with scikit-learn), so I'm not sure about the details.

3 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

TimelyPenguin76 The colleague is actually a her, but she replied that how it's looking now is correct? We're actually both already passed our work time (weekend :D), so we'll take a look at it after the weekend. If there is still something wrong, I'll get back to you. Thanks for offering help though :)

3 years ago
0 Hello, I'M Trying To Run The Docker-Compose Version Of The Trains-Server, But With The Command

The only change I made in the .yml file was:
` ports:

  • "8080:80" to ports:
  • "8082:80" `
    I already had something running on 8080, but since it's the trains-apiserver and not the webserver, this shouldn't be an issue.
3 years ago
0 Hello, I'M Trying To Run The Docker-Compose Version Of The Trains-Server, But With The Command

First I tried without build, but same problem. --build just means that it will re-download all layers instead of using the ones already cached.

3 years ago
0 Hello, I'M Trying To Run The Docker-Compose Version Of The Trains-Server, But With The Command

Exactly, so that remapping of port 8080 should not be the reason for this issue

3 years ago
0 Hello, I'M Trying To Run The Docker-Compose Version Of The Trains-Server, But With The Command

Ah my bad, it seems I had to run
docker-compose -f /opt/trains/docker-compose.yml pullonce. I quickly tried trains like half a year ago, so maybe it was using the old images? However, I thought --build would take care of that.

Now it's working 🙂

3 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

With PyTorch Lightning, I only use this line at the beginning of a Jup Notebook:
Task.init(project_name=project_name, task_name=task_name)The code to log the confusion matrix is in some .py file though that does not have any Trains code.

Is it possible to log it in a TB compatible way, that will be automatically picked up by Trains? I prefer to keep the .py Trains free.

3 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

AgitatedDove14 TB has the confusion matrix like this:

3 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

That's useful to know! But actually in this case I want to just test if the code works (run 2 epochs and see if it works). I don't want this to be logged, so I don't Task.init in those cases.
I don't want the code to crash on Trains in those cases.

I see that Task.current_task() returns None if no task is running, so I can use that with an if statement 🙂

3 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

Aah, I couldn't find it under PLOTS, but indeed it's there under DEBUG SAMPLES.

3 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

AgitatedDove14 There is only a events.out.tfevents.1604567610.system.30991.0 file.
If I open this with a text editor, most is unreadable, but I do find a the letters "PNG" close to the name of the confusion matrix. So it looks like the image is encoded inside the TB log file?

3 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

So if I want it under plots, I would need to call e.g. report_confusion_matrix right?

3 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

I have a numpy array, but I indeed didn't see a TB way of doing it. I guess that's not really an issue to add. The code should also be usable without Trains. How should I test if there is a current task? (I need a VPN on to log to TRAINS, which can be annoying for small tests)

3 years ago
0 In Relation To Pytorch Lightning V1.X, Usage In Combination With Trains Has Become Much Smoother (Just Pure Tensorboard). However, When Checking The "Configuration" Tab Of An Experiment, It'S Empty. How Do I Get Trains To Log The Hyperparameters? I'Ve Tr

As there are quite some hparams, which also change depending on the experiment, I was hoping there was some automatic way of doing it?

For example that it will try to find all dict entries that match "yet_another_property_name": "some value" , and ignore those that don't.
The value has to be converted to a string btw?

3 years ago
0 Suddenly All Experiments We Try To Log Run Into An Error. I Think It'S A Server Thing At Our Side, Because As Far As I Know Nothing Changed About Trains (We Didn'T Update Or Anything) And Yesterday It Was Working Well. Can Anyone Provide Some Insights At

SuccessfulKoala55 Thank you. I stared myself dead at trains-apiserver , but by coincidence I found this message:
` trains-elastic | {"type": "server", "timestamp": "2020-11-10T06:11:08,956Z", "level": "WARN", "component": "o.e.c.r.a.DiskThresholdMonitor", "cluster.name": "trains", "node.name": "trains", "message": "flood stage disk watermark [95%] exceeded on [QyZ2i1mxTG6yR7uhVWjV9Q][trains][/usr/share/elasticsearch/data/nodes/0] free: 43.3gb[4.7%], all indices on this node will be ...

3 years ago
0 Suddenly All Experiments We Try To Log Run Into An Error. I Think It'S A Server Thing At Our Side, Because As Far As I Know Nothing Changed About Trains (We Didn'T Update Or Anything) And Yesterday It Was Working Well. Can Anyone Provide Some Insights At

It seems to be related to trains-apiserver , based on the log inside the Docker compose:

` trains-apiserver | [2020-11-10 04:40:14,133] [8] [ERROR] [trains.service_repo] Returned 500 for queues.get_next_task in 20ms, msg=General data error: err=('1 document(s) failed to index.', [{'index': {'_index': 'queue_metrics_d1bd92a3b039400cbafc60a7a5b1e52b_2020-11', '_type': '_doc', '_id': 'rkh0sHUBwyiZSyeZUAov', 'status': 403, 'error': {'type': 'cluster_block_exception', 'reason': 'index [queu...

3 years ago
0 Is There A Way How I Can Get How Many Minutes The Gpu Has Been Used In A Month? The Duration Of An Iteration Is For Every Run Different If You Vary Batch Size. Model, Or Other Stuff. I Want To Do A Crude Energy Consumption Calculation By Doing A Sum Over

Hi AgitatedDove14
Not using trains-agent yet. Just using PyTorch Lightning in Jupyter Notebook with as Logger Trains.
So I'm talking about runtime and GPU usage in experiments.

3 years ago
0 The Links To Pytorch Lightning Are Broken In The

I see that Trains has been removed 2 days ago: https://github.com/PyTorchLightning/pytorch-lightning/commit/41f5df18a4b96ce753263fadd9c27f1d30e5d7a2

and instead has been moved to Bolts: https://github.com/PyTorchLightning/pytorch-lightning-bolts

However, I cannot find a reason why only Trains has been moved?

3 years ago
Show more results compactanswers