Examples: query, "exact match", wildcard*, wild?ard, wild*rd
Fuzzy search: cake~ (finds cakes, bake)
Term boost: "red velvet"^4, chocolate^2
Field grouping: tags:(+work -"fun-stuff")
Escaping: Escape characters +-&|!(){}[]^"~*?:\ with \, e.g. \+
Range search: properties.timestamp:[1587729413488 TO *] (inclusive), properties.title:{A TO Z}(excluding A and Z)
Combinations: chocolate AND vanilla, chocolate OR vanilla, (chocolate OR vanilla) NOT "vanilla pudding"
Field search: properties.title:"The Title" AND text
Profile picture
JitteryCoyote63
Moderator
214 Questions, 1021 Answers
  Active since 10 January 2023
  Last activity 7 months ago

Reputation

0

Badges 1

979 × Eureka!
0 Votes
17 Answers
1K Views
0 Votes 17 Answers 1K Views
Hi, I updated to clearml-server 1.4.0 and I am uncomfortable with the new Table/Detail view, is there a way to disable it and use the previous one (on click ...
2 years ago
0 Votes
12 Answers
1K Views
0 Votes 12 Answers 1K Views
Hi, I deleted some archived experiments in clearml server 1.0 and the popup in the dashboard showed “the following artifacts were not deleted”, with a list o...
3 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
Hi, are the experiments logs stored in s3 or in the trains-server? (When using s3 as artifact storage)
3 years ago
0 Votes
6 Answers
993 Views
0 Votes 6 Answers 993 Views
Hi, Is there a way to stop a clearml-agent from within an experiment? Or block it to prevent it running any other task?
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
2 years ago
0 Votes
5 Answers
964 Views
0 Votes 5 Answers 964 Views
Hi, I am using clearml with pytorch-ignite and its EarlyStopping handler. I would like to log the counter of the patience of this handler, how can I do that?
3 years ago
0 Votes
2 Answers
941 Views
0 Votes 2 Answers 941 Views
Hi guys; another idea: would be very cool to have a mattermost alert (monitor task), just like the one for Slack. Have a nice week-end all ๐Ÿ‘‹
3 years ago
0 Votes
2 Answers
927 Views
0 Votes 2 Answers 927 Views
First link in hyperparameter optimization page is broken > https://allegro.ai/docs/examples/examples_hyperparam_opt/
4 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
The “Manage queue” option in the right tab on a queued experiment is broken in v1.0 (it does nothing)
3 years ago
0 Votes
16 Answers
1K Views
0 Votes 16 Answers 1K Views
Hi guys, coming this time to share an idea of a killer feature for ClearML ๐Ÿš€ I am pretty sure you guys already heard of https://www.streamlit.io/ , which is...
3 years ago
0 Votes
23 Answers
1K Views
0 Votes 23 Answers 1K Views
Hi, I started a trains-agent (0.15) in services mode (full command: trains-agent daemon --services-mode --detached --queue services --create-queue --docker u...
4 years ago
0 Votes
30 Answers
1K Views
0 Votes 30 Answers 1K Views
Hi there ๐Ÿ™‚ Task.get_parameters() returns an empty dict from within a trains-agent task being executed. When I execute it outside, it works properly. Is it i...
4 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
Hello, is it possible for the clearml-agent in docker mode to not pull a specific docker image, but to build one from the experiment repository using the Doc...
2 years ago
0 Votes
15 Answers
1K Views
0 Votes 15 Answers 1K Views
Hi, how can I get the logs from the pytorch ignite early stopping handler to be logged in clearml?
3 years ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
3 years ago
0 Votes
18 Answers
960 Views
0 Votes 18 Answers 960 Views
Hi Guys, I had several times now the following errors poping in agents while executing a task: trains_agent: ERROR: Failed applying git diff: I attached the ...
4 years ago
0 Votes
4 Answers
671 Views
0 Votes 4 Answers 671 Views
Hi all, I updated from clearml-server 1.14.1 to 1.15.0 and I am getting the following error while trying to start the server after running docker-compose pul...
8 months ago
0 Votes
3 Answers
929 Views
0 Votes 3 Answers 929 Views
Hi, I am getting an error while running task.mark_stopped() , any idea why? (clearml 1.0.2, clearml-agent 1.0.0, python 3.6) File "/home/machine/.clearml/ven...
3 years ago
0 Votes
6 Answers
1K Views
0 Votes 6 Answers 1K Views
Hi there, maybe this was already asked but I don't remember: Would it be possible to have the clearml-agent switch between docker mode and virtualenv mode at...
one year ago
0 Votes
10 Answers
1K Views
0 Votes 10 Answers 1K Views
Hi, I have a local package that I use to train my models. To start training, I have a script that calls task._update_requirements([".", "torch==1.11.0"]) . I...
2 years ago
0 Votes
8 Answers
938 Views
0 Votes 8 Answers 938 Views
3 years ago
0 Votes
7 Answers
1K Views
0 Votes 7 Answers 1K Views
Hi, I recently updated clearml-server to 1.7 and I am getting a lot of the following errors since today on any experiment (I didn't had this error before): 1...
2 years ago
0 Votes
18 Answers
1K Views
0 Votes 18 Answers 1K Views
Hi, I just updated clearml server 1.0 using docker-compose down & docker-compose pull & docker-compose up -d , it worked ant it looks amazing! I found two pr...
3 years ago
0 Votes
23 Answers
940 Views
0 Votes 23 Answers 940 Views
Hi, I would like to bring awareness on this issue , this impacts my work as I cannot install the older version of torch (1.11.0)
one year ago
0 Votes
5 Answers
1K Views
0 Votes 5 Answers 1K Views
3 years ago
0 Votes
18 Answers
1K Views
0 Votes 18 Answers 1K Views
Hi, kudos for the 0.15 guys! I am having an issue related to git auth: I have an issue with trains-agent (0.15): it does not use git creds while trying to cl...
4 years ago
0 Votes
11 Answers
1K Views
0 Votes 11 Answers 1K Views
Hey, I moved my trains-server to another machine, zipping the /opt/trains/data folder as described in the docs https://allegro.ai/docs/deploying_trains/train...
4 years ago
0 Votes
19 Answers
1K Views
0 Votes 19 Answers 1K Views
one year ago
0 Votes
4 Answers
895 Views
0 Votes 4 Answers 895 Views
Is there a way to report a simple series with X and Y coords, X and Y being two lists of same length?
4 years ago
0 Votes
4 Answers
1K Views
0 Votes 4 Answers 1K Views
2 years ago
Show more results questions
0 Hi, I Would Like To Follow-Up In This

AgitatedDove14 SuccessfulKoala55 I just saw that clearml-server 1.4.0 was released, congrats ๐Ÿš€ ๐Ÿ™Œ Was this bug fixed with this new version?

2 years ago
0 Hi There,

clearml doesn't change the matplotlib backend under the hood, right? Just making sure ๐Ÿ˜„

one year ago
0 Hi, How Does

There was no possible cache, the agent was running on a new ec2 instance

one year ago
0 Hi, Although

SuccessfulKoala55 I can try to make one, letโ€™s see ๐Ÿ™‚

3 years ago
0 Hi, Although

Does that mean that agents do not read this parameter?

3 years ago
0 Hi, Although

What will this parameter do?

3 years ago
0 Hi, Although

so the task they execute must have clearml installed?

3 years ago
0 Hi There,

Disclaimer: I didn't check this will reproduce the bug, but that's all the components that should reproduce it: a for loop creating figures and clearml logging them

one year ago
0 Hey There, I Moved The Clearml S3 Bucket Where I Stored All My Clearml Data From One S3 Bucket To Another And Now I Realized That All The Models/Experiments Logged In The Clearml-Server Still Refer To The Old S3 Bucket. Is There A Way To Update All The Re

Yes, I would like to update all references to the old bucket unfortunatelyโ€ฆ I think Iโ€™ll simply delete the old s3 bucket, wait or his name to be available again and recreate it where on the other aws account and move the data there. This way I donโ€™t have to mess with clearml data - I am afraid to do something wrong and loose data

3 years ago
0 Hi There,

For me it is definitely reproducible ๐Ÿ˜„ But the codebase is quite large, I cannot share. The gist is the following:

import matplotlib.pyplot as plt
import numpy as np
from clearml import Task
from tqdm import tqdm

task = Task.init("Debug memory leak", "reproduce")

def plot_data():
    fig, ax = plt.subplots(1, 1)
    t = np.arange(0., 5., 0.2)
    ax.plot(t, t, 'r--', t, t**2, 'bs', t, t**3, 'g^')
    return fig

for i in tqdm(range(1000), total=1000):
    fig = plot_data()
  ...
one year ago
0 Hi, Together With

Alright, I will try with that one

4 years ago
0 Hi, Is It Possible To Pass Environment Variables To Agents Created By The Aws Autoscaler Service?

If I remove security_group_ids and just let subnet_id in the configuration, it is not taken into account (the instances are created in a the default subnet)

3 years ago
0 Hi, I Face A Strange Behavior From The Clearml-Agent: It’S Running In Services Mode, Not In Docker Mode, Cpu Only. I Want To Execute Two Tasks On This Service Agent. One Works, The Other Always Fails After Being Enqueued And Picked By The Agent With The E

and in the logs:
`
agent.worker_name = worker1
agent.force_git_ssh_protocol = false
agent.python_binary =
agent.package_manager.type = pip
agent.package_manager.pip_version = ==20.2.3
agent.package_manager.system_site_packages = true
agent.package_manager.force_upgrade = false
agent.package_manager.conda_channels.0 = pytorch
agent.package_manager.conda_channels.1 = conda-forge
agent.package_manager.conda_channels.2 = defaults
agent.package_manager.torch_nightly = false
agent.venvs_dir = /...

3 years ago
0 Hi There,

With a large enough number of iterations in the for loop, you should see the memory grow over time

one year ago
0 Hi! I Have A Question Regarding Performances Of The Clearml-Server: Are The Calls From The Agents Made Asynchronously/In A Non Blocking Separate Thread? Is The Connection To The Clearml-Server Expected To Be A Bottleneck If The Clearml-Server Is Far From

Is there one?

No, I rather wanted to understand how it worked behind the scene ๐Ÿ™‚

The latest RC (0.17.5rc6) moved all logs into separate subprocess to improve speed with pytorch dataloaders

Thatโ€™s awesome!

3 years ago
0 Hi, I Would Like To Bring Awareness

When running my training code

one year ago
0 Hi, I Would Like To Bring Awareness

oh seems like it is not synced, thank you for noticing (it will be taken care immediately)

Thank you!

does not contain a specific wheel for cuda117 to x86, they use the pip defualt one

Yes so indeed they don't provide support for earlier cuda versions on latest torch versions. But I should still be able to install torch==1.11.0+cu115 even if I have cu117. Before that is what the clearml-agent was doing

one year ago
0 Hi, I Would Like To Bring Awareness

I think we should switch back, and have a configuration to control which mechanism the agent uses , wdyt? (edited)

That sounds great!

one year ago
0 Hi, I Would Like To Bring Awareness

RuntimeError: CUDA error: no kernel image is available for execution on the device

one year ago
0 Hi, I Would Like To Bring Awareness

Could you please clarify? I don't get it

one year ago
0 Hi, I Would Like To Bring Awareness

So the wheel that was working for me was this one: [torch-1.11.0+cu115-cp38-cp38-linux_x86_64.whl](https://download.pytorch.org/whl/cu115/torch-1.11.0%2Bcu115-cp38-cp38-linux_x86_64.whl)
image

one year ago
0 Hi, I Would Like To Bring Awareness

So I suppose clearml-agent is not responsible, because it finds a wheel for torch 1.11.0 with cu117. It just happens that this wheel doesn't work in ec2 g5 instances suprizingly. Either I'll hardcode the correct wheel or I'll upgrade torch to 1.13.0

one year ago
0 Hi, I Would Like To Bring Awareness

This is not the case, I downloaded it and I got a cuda error at runtime

one year ago
0 Hi, I Would Like To Bring Awareness

Ha I just saw in the logs:

WARNING:py.warnings:/root/.clearml/venvs-builds/3.8/lib/python3.8/site-packages/torch/cuda/__init__.py:145: UserWarning:
NVIDIA A10G with CUDA capability sm_86 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_37 sm_50 sm_60 sm_70.
If you want to use the NVIDIA A10G GPU with PyTorch, please check the instructions at 
one year ago
0 Hi, I Would Like To Bring Awareness

I wouldn't do it, this is less code to maintain from your side and honestly too much auto magic makes it difficult for the user to control the environment (ie. to understand what happens behind the scenes). I am not sure what switching back will solve, here the wheel should have been correct, it's just the architecture of the card that is incompatible

one year ago
0 Hi, I Would Like To Bring Awareness

@<1537605940121964544:profile|EnthusiasticShrimp49> I'll try setting the cuda version clearml.conf, thanks for the tip!
@<1523701205467926528:profile|AgitatedDove14> Could you please push the code for that version on github?

one year ago
0 Hi, I Have An Agent That Is Running Two Experiments At The Same Time: One That Was Running For A Long Time (11H) And One That The Agent Picked Up Afterwards, While The First One Was Still Running. Context: I Have 3 Agents Up (Not In Docker Mode) And All O

Hoo I found:
user@trains-agent-1: ps -ax 5199 ? Sl 29:25 python3 -m trains_agent --config-file ~/trains.conf daemon --queue default --log-level DEBUG --detached 6096 ? Sl 30:04 python3 -m trains_agent --config-file ~/trains.conf daemon --queue default --log-level DEBUG --detached

4 years ago
Show more results compactanswers