Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
RoundMosquito25
Moderator
30 Questions, 94 Answers
  Active since 10 January 2023
  Last activity 2 years ago

Reputation

0

Badges 1

94 × Eureka!
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hi, suppose, that I run Server on machine A and ClearML Agents on machines B_1, B_2, ....B_n . Is it required to open some ports on machines B_i ? I mean, is...
3 years ago
0 Votes
9 Answers
2K Views
0 Votes 9 Answers 2K Views
Tasks can be put in draft state - if we will execute: task.execute_remotely()it will not start Is there a way to do the same with pipeline (not run it immedi...
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
2 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
Hi! I am researching different MLOps libraries / platforms. I don't want to use platform as a service solutions. Could you suggest me what are the main diffe...
3 years ago
0 Votes
19 Answers
2K Views
0 Votes 19 Answers 2K Views
Hi, can I run an ClearML Agent on multiple computers (on-premise)? Is there any example in the documentation on how to do that?
3 years ago
0 Votes
30 Answers
2K Views
0 Votes 30 Answers 2K Views
Hi, I am trying to use agent, but I have a problem. Execution of task stucks like this
3 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
Hi! In UI I see USER for each task. Where is this variable stored in the system. How to specify a different user when running a task?
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
Hi, I have a question about logs, when I am looking into apiserver logs, I have for example [2023-04-18 14:47:42,043] [9] [INFO] [clearml.service_repo] Retur...
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
Hi, is there a way to abort task (not reset, not delete) from code?
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
Hi, when we use Task.clone()we need to pass project id (not project name). Is there a way to get project id from project name somehow?
2 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
2 years ago
0 Votes
20 Answers
2K Views
0 Votes 20 Answers 2K Views
Hi, I am trying to use agent with a sample, very simple task. But it stucks and task does not finish. In UI in console I see what I pasted on image. Do you k...
3 years ago
0 Votes
9 Answers
2K Views
0 Votes 9 Answers 2K Views
Hi, is there a possibility to use one GPU card with 2 agents concurrently (there are tasks that need only fraction of a card)
2 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
Hi, I think I've found a strange bug in scheduler. I would like to run some job every Monday at 15:40 UTC. So I run this at 15:35: scheduler.add_task( name='...
2 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
When we run a task on GPU, we can access GPU monitoring. But can we access it from code? Usecase is: when we see that there is enough resources for some task...
2 years ago
0 Votes
13 Answers
2K Views
0 Votes 13 Answers 2K Views
3 years ago
0 Votes
8 Answers
2K Views
0 Votes 8 Answers 2K Views
2 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
In UI such strange artifact is visible sometimes. After reloading page looks fine.
2 years ago
0 Votes
7 Answers
2K Views
0 Votes 7 Answers 2K Views
Hi, I'm using a self hosted version of ClearML. I am querying experiments from UI and I am getting such error: Fetch experiments for selection failed Error 1...
2 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
I also have a general question. Most frequently, we work with our ml models not as a single script, but a project - multi level directory of files. How do we...
3 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
I have question about a situation, when agent is running on a queue, but also we'll use clearml-agent execute --idHow will it work, a) will be run immediatel...
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
Hi, is there a possibility to switch off the agent just after if finishes current task (not to fail the task, but to wait minimal time)?
2 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
Hi, by accident I've scheduled 1000 experiments. I would like to abort them. In the UI I can do "load more", but it loads 15 more. Is there any option to do ...
2 years ago
0 Votes
14 Answers
2K Views
0 Votes 14 Answers 2K Views
Hi, we have quite an unusual issue. We run some agents, we attach them to queue. They are doing the job (they are doing hyperparameter optimization), however...
2 years ago
0 Votes
1 Answers
2K Views
0 Votes 1 Answers 2K Views
Hi, are there available somewhere examples of testing in ClearML? For example unit tests that check if parameters are passed correctly to new tasks etc.?
2 years ago
0 Votes
3 Answers
2K Views
0 Votes 3 Answers 2K Views
Hi! PipelineController has method: - add_pipeline_tags ( bool ) – (default: False) if True, add pipe: tag to all steps (Tasks) created by this pipeline. Can ...
2 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
Hi, I know that ClearML uses local changes for patching and running script. Can it also do the same with local commits?
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
Hi, I have such a usecase: I have different types of tasks that use GPU: task that uses 1 card task that uses 0.5 of card task that uses [other number] of ca...
2 years ago
0 Votes
4 Answers
2K Views
0 Votes 4 Answers 2K Views
Hi. When we construct a pipeline consisting of steps and when we are aborting a pipeline, the steps aren't aborting (they run until they are done). Is there ...
2 years ago
0 Votes
2 Answers
2K Views
0 Votes 2 Answers 2K Views
2 years ago
0 Hi, Can I Run An

AgitatedDove14 shouldn't it be
while not an_optimizer.wait(timeout=1.0):instead of
while an_optimizer.wait(timeout=1.0):in the first code block?

3 years ago
0 I Have Question About A Situation, When Agent Is Running On A Queue, But Also We'Ll Use

Can I do this to specify which worker should execute that task?
CLEARML_WORKER_NAME=<worker_name> clearml-agent execute --id <task_id>

2 years ago
0 Hi, Suppose, That I Run Server On Machine

AgitatedDove14 do you know if it possible not to open ports on machines B_i where agents reside?

3 years ago
0 Hi, Do You Know More Or Less How Many Workers Can Server Work With. Have You Make Such Stress-Tests? The Thing Is That We Have Some Power Run Agent On (For Example 200 Agents). Will It Handle That? Suppose, That Machine That The Server Runs On Has Around

SuccessfulKoala55 hmm, we are trying to do something like that and we are encountering problems. We are doing big hyperparameter optimization on 200 workers and some tasks are failing (while with less workers they are not failing). Also, UI also has some problems with that. Maybe there are some settings that should be corrected in comparison to classic configuration?

3 years ago
0 Hi, I Have A Case When I Want To Clone Tasks And Set Some Parameters For Them. I Noticed, That I Can'T Pass Numbers, Only Strings Are Possible There. When I'M Trying To Pass A Number, The Default Value Is Not Overriden. Do You Know Maybe If Numbers Can Be

version 1.8.1
No, there are no error messages. The behaviour is just very strange (or even incorrect)

Suppose that this is a task that is cloned:
` base_task = replacement_task.create_function_task(
func=some_func, # type: Callable
func_name=f'func_id_run_me_remotely_nr', # type:Optional[str]
task_name=f'a func task', # type:Optional[str]
# everything below will be passed directly to our function as arguments
some_argument=message,
some_argument_2=message,
rand...

2 years ago
0 Hi, Can I Run An

AgitatedDove14 suppose that we are doing some optimization task (parameter search). This is a task where generally we want to minimize some metric m , but it will be enough to have, say 3 occurences when m<THRESHOLD and when it will happen, we stop the search (and free the resources, that can be needed for some further step)

3 years ago
0 Hi, Can I Run An

AgitatedDove14 in fact in our case we want to use simple strategies, RandomSearch is enough, but the problem is that we need to change the ranges dynamically

3 years ago
0 Hi, Can I Run An

In fact, as I assume, we need to write our custom HyperParameterOptimizer, am I right?

3 years ago
0 Hi, Do You Know More Or Less How Many Workers Can Server Work With. Have You Make Such Stress-Tests? The Thing Is That We Have Some Power Run Agent On (For Example 200 Agents). Will It Handle That? Suppose, That Machine That The Server Runs On Has Around

SuccessfulKoala55 We are encountering some strange problem. We are spinning N agents using script, in a loop

But not all agents are visible as workers (we check it both in UI, but also running workers_list = client.workers.get_all() ).

Do you think that is it possibility that too much of them are connecting at once and we can solve that by setting a delay between running subsequent agents?

2 years ago
0 Hi, Suppose, That I Run Server On Machine

Great to hear that, thanks! 🙂

3 years ago
0 In Ui Such Strange Artifact Is Visible Sometimes. After Reloading Page Looks Fine.

Because it has no coincidence with some specific actions

2 years ago
0 Hi, We Have Quite An Unusual Issue. We Run Some Agents, We Attach Them To Queue. They Are Doing The Job (They Are Doing Hyperparameter Optimization), However They Are Not Visible Either In:

SuccessfulKoala55 So, we have two problems:
Probably minor one, but strange. We run some number of workers using given compose file, that is attached in .zip. We can do:docker compose -f docker-compose-worker.yaml build docker compose -f docker-compose-worker.yaml upand in theory there should be 10 agents running, but frequently, not 10 are shown in UI (for example on last run we got 3 of them). When we run htop , we can see 10 agents in our system. What is even more strange, those...

2 years ago
0 Hi, Do You Know More Or Less How Many Workers Can Server Work With. Have You Make Such Stress-Tests? The Thing Is That We Have Some Power Run Agent On (For Example 200 Agents). Will It Handle That? Suppose, That Machine That The Server Runs On Has Around

SuccessfulKoala55 How should I pass this variable? Do I need to create a file apiserver.conf in folder /opt/clearml/config and write there just CLEARML_USE_GUNICORN=1 . Do I need to restart a server after that?

3 years ago
0 Hi, I'M Using A Self Hosted Version Of Clearml. I Am Querying Experiments From Ui And I Am Getting Such Error:

I am using UI and I am clicking select all. If it is calling API server then yes

2 years ago
0 Hi. When We Construct A Pipeline Consisting Of Steps And When We Are Aborting A Pipeline, The Steps Aren'T Aborting (They Run Until They Are Done). Is There Any Possibility To Change This Behaviour?

building from code: pipe.add_step()2. not locally, but also not with services queue
pipe.set_default_execution_queue(DEFAULT_EXECUTION_QUEUE)
Is there a need to use just services queue?

2 years ago
0 Probably My Question Will Be About Something Other Than Core Concepts Of Clearml, But Something Connected I Think.... I Want To Schedule Tasks On Gpu And I Want Many Agents To Use Given Gpu At One Time. I Also Want To Avoid Situation When Gpu Will Be Out

Yes, it is a good reason 🙂
Do you maybe know a tool that measures that during execution (to avoid looking on nvidia-smi during all training)?

So, suppose, that a task T uses 27% of GPU, means, that we can spawn 3 agents on this GPU (suppose that we will give them only task T). Does it make sense?

2 years ago
0 Hi, Can I Run An

SuccessfulKoala55 Thank you for the response! Let me elaborate a bit to check if I understand this correctly.
We have a time-consuming task T based on optimization for parameters. We want to run hyperparameter optimization for T, suppose that we want to run it for 100 sets of parameters.
We want to leverage the fact that we have n machines to make the work parallel.

So for that we use https://clear.ml/docs/latest/docs/references/sdk/hpo_optimization_hyperparameteroptimizer/ , we run Agent...

3 years ago
Show more results compactanswers