Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
DilapidatedParrot58
Moderator
42 Questions, 205 Answers
  Active since 10 January 2023
  Last activity one year ago

Reputation

0

Badges 1

186 × Eureka!
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Step 3 Task ( https://github.com/allegroai/trains/blob/master/examples/pipeline/step3_train_model.py ) - Loads the processed data (from Step 2) and clearml a...
3 years ago
0 Votes
3 Answers
1K Views
0 Votes 3 Answers 1K Views
hey guys, thanks for creating Slack workspace, that's really cool. question - are we missing smth or is currently not possible to pass S3 credentials via env...
4 years ago
0 Votes
2 Answers
1K Views
0 Votes 2 Answers 1K Views
two annoying visual bugs in ClearML Server UI after latest update: experiment status is still shown as “Aborted” after successful resetting until you refresh...
2 years ago
0 Votes
16 Answers
1K Views
0 Votes 16 Answers 1K Views
yo guys, I'm getting Retrying (Retry(total=2, connect=2, read=5, redirect=5, status=None)) after connection broken by 'ConnectTimeoutError(, 'Connection to O...
4 years ago
0 Votes
2 Answers
965 Views
0 Votes 2 Answers 965 Views
one year ago
0 Votes
30 Answers
1K Views
0 Votes 30 Answers 1K Views
4 years ago
0 Votes
27 Answers
1K Views
0 Votes 27 Answers 1K Views
hey guys, I keep getting trains_agent: ERROR: Connection Error: it seems *api_server* is misconfigured. Is this the TRAINS API server http://apiserver:8008 ?...
3 years ago
0 Votes
8 Answers
1K Views
0 Votes 8 Answers 1K Views
4 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
3 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
hey guys, here I am again with another question πŸ˜ƒ after the latest update, I’m getting this error when I’m trying to compare scalars for more than 10 experi...
4 years ago
0 Votes
25 Answers
1K Views
0 Votes 25 Answers 1K Views
I'm probably stupid, but how do I specify worker name? usecase - I want to create two workers using the same GPU, and new worker just overwrites the old one
4 years ago
0 Votes
14 Answers
1K Views
0 Votes 14 Answers 1K Views
hey guys the first time I'm seeing this behavior I'm adding a new user to /opt/trains/config/apiserver.conf and restarting the containers. all old users are ...
4 years ago
Show more results questions
0 I'M Using Tensorboard Summarywriter To Add Scalar Metrics For The Experiment. If Experiment Crashed, And I Want To Continue It From Checkpoint, For Some Reason It Plots Metrics In A Really Weird Way. Even Though I Pass Global_Step=Epoch To The Summarywrit

overwriting this value is not ideal though, because for :monitor:gpu and :monitor:machine values I would like to continue from the latest iteration

but for the metrics, I explicitly pass the number of epoch that my training is currently on. it'ls kind of weird that it adds offset to the values that are explicitly reported, no?

3 years ago
0 Hey Guys, I Keep Getting

new version worked

3 years ago
0 I'M Using Tensorboard Summarywriter To Add Scalar Metrics For The Experiment. If Experiment Crashed, And I Want To Continue It From Checkpoint, For Some Reason It Plots Metrics In A Really Weird Way. Even Though I Pass Global_Step=Epoch To The Summarywrit

not sure what you mean. I used to do task.set_initial_iteration(task.get_last_iteration()) in the task resuming script, but in the training code I explicitly pass global_step=epoch to the TensorBoard writer

3 years ago
4 years ago
0 Hi

all our workers went down after starting the slack bot, is it expected?)

4 years ago
0 Hi

we've already restarted everything, so I don't have any logs on hands right now. I'll let you know if we face any problems πŸ˜ƒ slack bot works though! πŸŽ‰

4 years ago
0 Hi

python3 slack_alerts.py --channel trains-alerts --slack_api "OUR_KEY" --include_completed_experiments --include_manual_experiments

4 years ago
0 Hi

new icons are slick, it would be even better if you could upload custom icons for the different projects

4 years ago
0 I'M Using Tensorboard Summarywriter To Add Scalar Metrics For The Experiment. If Experiment Crashed, And I Want To Continue It From Checkpoint, For Some Reason It Plots Metrics In A Really Weird Way. Even Though I Pass Global_Step=Epoch To The Summarywrit

task = Task.get_task(task_id = args.task_id)
task.mark_started()
task.set_parameters_as_dict(
{
"General": {
"checkpoint_file": model.url,
"restart_optimizer": False,
}
}
)
task.set_initial_iteration(0)
task.mark_stopped()
Task.enqueue(task = task, queue_name = task.data.execution.queue)

3 years ago
3 years ago
0 I'M Getting A Lot Of Errors When Running Cleanup Service

self-hosted ClearML server 1.2.0
SDK version 1.1.6

2 years ago
0 I'M Probably Stupid, But How Do I Specify Worker Name? Usecase - I Want To Create Two Workers Using The Same Gpu, And New Worker Just Overwrites The Old One

another stupid question - what is the proper way to delete a worker? so far I've been using pgrep to find the relevant PID πŸ˜ƒ

4 years ago
0 Hi

we are working on the medical projects, so probably images of the different body parts πŸ˜ƒ

4 years ago
Show more results compactanswers