Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DefeatedCrab47
Moderator
15 Questions, 42 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

41 × Eureka!
0 Votes
3 Answers
950 Views
0 Votes 3 Answers 950 Views
Where did the TrainsLogger go in PyTorch Lightning (Bolts)? First it was moved from Lightning ( from pytorch_lightning.loggers import TrainsLogger ) to Bolts...
4 years ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
I want to upgrade to the latest TRAINS 0.15.1, so I followed the instructions under "Upgrading" here: https://allegro.ai/docs/deploying_trains/trains_server_...
4 years ago
0 Votes
2 Answers
932 Views
0 Votes 2 Answers 932 Views
4 years ago
0 Votes
9 Answers
1K Views
0 Votes 9 Answers 1K Views
4 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
4 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
4 years ago
0 Votes
2 Answers
987 Views
0 Votes 2 Answers 987 Views
Port remapping of the webserver is not supported (documentation only mentions 8080 , 8081 and 8008 need to be available)? On our server we have JupyterHub ru...
4 years ago
0 Votes
7 Answers
920 Views
0 Votes 7 Answers 920 Views
4 years ago
0 Votes
18 Answers
1K Views
0 Votes 18 Answers 1K Views
PyTorch Lightning question about logging a figure. I have the following code: # turn confusion matrix into a figure (Tensor cannot be logged as a scalar) fig...
4 years ago
0 Votes
1 Answers
1K Views
0 Votes 1 Answers 1K Views
4 years ago
0 Votes
10 Answers
1K Views
0 Votes 10 Answers 1K Views
Hello! We are trying to upgrade from Trains server 15.1 to 16.1 using Docker, but are running into a permission error: trains-elastic | "stacktrace": ["org.e...
4 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
In your https://allegro.ai/blog/allegro-trains-v-0-15-release/ , the link to an example of Hyper-Parameter Optimizer ( https://github.com/allegroai/trains/bl...
4 years ago
0 Votes
1 Answers
952 Views
0 Votes 1 Answers 952 Views
Just a pull request for a small bug fix found in 2 examples: https://github.com/allegroai/trains/pull/148
4 years ago
0 Votes
7 Answers
965 Views
0 Votes 7 Answers 965 Views
Hello, I'm trying to run the docker-compose version of the trains-server, but with the command sudo docker-compose -f /opt/trains/docker-compose.yml up --bui...
4 years ago
0 Votes
7 Answers
961 Views
0 Votes 7 Answers 961 Views
The links to PyTorch Lightning are broken in the https://allegro.ai/docs/integrations/integration_pytorch_lightening/ . Both "Use the PyTorch Lightning https...
4 years ago
0 Hello! We Are Trying To Upgrade From Trains Server 15.1 To 16.1 Using Docker, But Are Running Into A Permission Error:

/opt/trains/ :

` $ ls -al
total 120
drwxrwsrwx 7 root miniconda 4096 Nov 2 18:15 .
drwxr-xr-x 15 root root 4096 Oct 5 15:12 ..
drwxrwxrwx 38 root miniconda 4096 Nov 2 18:15 agent
drwxrwxrwx 2 root miniconda 4096 Jun 19 14:43 config
drwxrwxrwx 8 root miniconda 4096 Nov 2 18:11 data
-rwxrwxrwx 1 root miniconda 4383 Jun 19 14:46 docker-compose_0.15.0.yml
-rwxrwxrwx 1 root miniconda 4375 Jun 26 15:06 docker-compose_0.15.1.yml
-rwxrwxrwx 1 root miniconda 4324 Nov 2 18:...

4 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

TimelyPenguin76 The colleague is actually a her, but she replied that how it's looking now is correct? We're actually both already passed our work time (weekend :D), so we'll take a look at it after the weekend. If there is still something wrong, I'll get back to you. Thanks for offering help though :)

4 years ago
0 Hello! We Are Trying To Upgrade From Trains Server 15.1 To 16.1 Using Docker, But Are Running Into A Permission Error:

Ok, it was indeed something with permission. When I chown everything to root (1000) and chmod 777 it worked. 777 is of course not desirable, so I'm going to narrow it down now.

Thank you for the reply! The migration indeed created this elastic_7 folder.

4 years ago
0 Hello! We Are Trying To Upgrade From Trains Server 15.1 To 16.1 Using Docker, But Are Running Into A Permission Error:

Ok it's that the user group also has to be root. I ran the following:
sudo chmod 775 -R /opt/trains/ sudo chown -R root:root /opt/trainsand it works.

It seems that it has to be 775 with both user and group as root. E.g. 771 does not work, because than the docker command has to be used with sudo (if I want to use my default sudo-user account)

4 years ago
0 Hello! We Are Trying To Upgrade From Trains Server 15.1 To 16.1 Using Docker, But Are Running Into A Permission Error:

AppetizingMouse58 If I:
sudo chmod 771 -R /opt/trains/(taking all permission away from other except execution)
The file permission error comes back, even though everything is under the root user.

4 years ago
0 Hello! We Are Trying To Upgrade From Trains Server 15.1 To 16.1 Using Docker, But Are Running Into A Permission Error:

` trains-elastic exited with code 1
trains-elastic | OpenJDK 64-Bit Server VM warning: Option UseConcMarkSweepGC was deprecated in version 9.0 and will likely be removed in a future release.
trains-elastic | {"type": "server", "timestamp": "2020-11-02T08:04:57,699Z", "level": "ERROR", "component": "o.e.b.ElasticsearchUncaughtExceptionHandler", "cluster.name": "trains", "node.name": "trains", "message": "uncaught exception in thread [main]",
trains-elastic | "stacktrace": ["org.elast...

4 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

With PyTorch Lightning, I only use this line at the beginning of a Jup Notebook:
Task.init(project_name=project_name, task_name=task_name)The code to log the confusion matrix is in some .py file though that does not have any Trains code.

Is it possible to log it in a TB compatible way, that will be automatically picked up by Trains? I prefer to keep the .py Trains free.

4 years ago
0 Is There A Way How I Can Get How Many Minutes The Gpu Has Been Used In A Month? The Duration Of An Iteration Is For Every Run Different If You Vary Batch Size. Model, Or Other Stuff. I Want To Do A Crude Energy Consumption Calculation By Doing A Sum Over

Hi AgitatedDove14
Not using trains-agent yet. Just using PyTorch Lightning in Jupyter Notebook with as Logger Trains.
So I'm talking about runtime and GPU usage in experiments.

4 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

trains ( 0.15.1-367 ) appears to be the version, same as you. Thank you. Appears Trains is up to date.

Apparently there should be 6 of them:

4 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

After a while I get the message:

New version available
Click the reload button below to reload the web page

I click the "RELOAD" button and the "newer version" message disappear. However, some plots still don't show up (fixed in 0.15.1). If I refresh the TRAINS webinterface, the "newer version" message appears again.

4 years ago
0 Suddenly All Experiments We Try To Log Run Into An Error. I Think It'S A Server Thing At Our Side, Because As Far As I Know Nothing Changed About Trains (We Didn'T Update Or Anything) And Yesterday It Was Working Well. Can Anyone Provide Some Insights At

It seems to be related to trains-apiserver , based on the log inside the Docker compose:

` trains-apiserver | [2020-11-10 04:40:14,133] [8] [ERROR] [trains.service_repo] Returned 500 for queues.get_next_task in 20ms, msg=General data error: err=('1 document(s) failed to index.', [{'index': {'_index': 'queue_metrics_d1bd92a3b039400cbafc60a7a5b1e52b_2020-11', '_type': '_doc', '_id': 'rkh0sHUBwyiZSyeZUAov', 'status': 403, 'error': {'type': 'cluster_block_exception', 'reason': 'index [queu...

4 years ago
0 Suddenly All Experiments We Try To Log Run Into An Error. I Think It'S A Server Thing At Our Side, Because As Far As I Know Nothing Changed About Trains (We Didn'T Update Or Anything) And Yesterday It Was Working Well. Can Anyone Provide Some Insights At

SuccessfulKoala55 Thank you. I stared myself dead at trains-apiserver , but by coincidence I found this message:
` trains-elastic | {"type": "server", "timestamp": "2020-11-10T06:11:08,956Z", "level": "WARN", "component": "o.e.c.r.a.DiskThresholdMonitor", "cluster.name": "trains", "node.name": "trains", "message": "flood stage disk watermark [95%] exceeded on [QyZ2i1mxTG6yR7uhVWjV9Q][trains][/usr/share/elasticsearch/data/nodes/0] free: 43.3gb[4.7%], all indices on this node will be ...

4 years ago
0 I Want To Upgrade To The Latest Trains 0.15.1, So I Followed The Instructions Under "Upgrading" Here:

Is there anyway how I can figure out in the webinterface what version of Trains is actually running?

4 years ago
0 The Links To Pytorch Lightning Are Broken In The

What's the abc issue ? Something Lightning team is responsible for?

4 years ago
0 The Links To Pytorch Lightning Are Broken In The

I see that Trains has been removed 2 days ago: https://github.com/PyTorchLightning/pytorch-lightning/commit/41f5df18a4b96ce753263fadd9c27f1d30e5d7a2

and instead has been moved to Bolts: https://github.com/PyTorchLightning/pytorch-lightning-bolts

However, I cannot find a reason why only Trains has been moved?

4 years ago
0 Hey, I Hope This Is The Right Place To Ask. We'Re A Small Data Science Team That Wants To Log Everything About Our Ml Models. Looking Around On The Internet, Mostly Mlflow Is Being Recommended, But Occasionally The Name Trains Pop-Ups. According To You,

Thank you for your impression! I get a bit more of a Airflow feel for running many tasks to train models with different parameters, which is a good thing.

I'm still skimming through the documents, but TRAINS documentation on how models are stored is a bit vague to me. The https://allegro.ai/docs/examples/examples_models/ only quickly mentions that you can set an output location. Which is a bit shallow compared with the https://mlflow.org/docs/latest/model-registry.html . Any good resource...

4 years ago
0 Hello, I'M Trying To Run The Docker-Compose Version Of The Trains-Server, But With The Command

First I tried without build, but same problem. --build just means that it will re-download all layers instead of using the ones already cached.

4 years ago
0 Hey, I Hope This Is The Right Place To Ask. We'Re A Small Data Science Team That Wants To Log Everything About Our Ml Models. Looking Around On The Internet, Mostly Mlflow Is Being Recommended, But Occasionally The Name Trains Pop-Ups. According To You,

FrothyDog40 Thank you for your reply. I agree that MLflow's serving solution is not going to be of much help for real deployment. However, to me the advantage of quickly setting-up an API access point with just 1 line of code helps with some internal trying out. To colleague: "Hey, this new model seems to do good, want to give it a try?".

I've setup my own Docker container with Sanic (like Flask) and indeed it's not too difficult. However, you'll still hit issues like " https://stackoverflo...

4 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

So if I want it under plots, I would need to call e.g. report_confusion_matrix right?

4 years ago
0 Pytorch Lightning Question About Logging A Figure. I Have The Following Code:

AgitatedDove14 There is only a events.out.tfevents.1604567610.system.30991.0 file.
If I open this with a text editor, most is unreadable, but I do find a the letters "PNG" close to the name of the confusion matrix. So it looks like the image is encoded inside the TB log file?

4 years ago
0 The Links To Pytorch Lightning Are Broken In The

Would have been nice if they would have reached out to you guys/gals before removing Trains 😅

4 years ago
Show more results compactanswers