AgitatedDove14

49 Questions, 8124 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Badges 1

25 × Eureka!

Answers 8124

0 Hi, Is There A General Github Actions Workflow Just To Login Into Your Clearml App (Demo Or Server) So I Can Run Python Files Related To Clearml. I'Ve Seen Clearml-Actions-Train-Model And Clearml-Actions-Get-Stats And They Seem To Be Very Specific. Maybe

LittleShrimp86 did you try to run the pipeline form the UI on remote machines (i.e. with the agents)? Did that work?

2 years ago

0 Hi, I'Ve Just Started To Evaluate Clearml For Internal Use At My Org And Am Wondering If There'S Anyway To Import Data From Old Experiments Into The Dashboard. Anyone Have Any Thoughts On This?

For reporting the console logs you can use :
logger.report_text("my log line here", print_console=False)https://github.com/allegroai/clearml/blob/b4942321340563724bc16f60ea5dd78c9161778d/clearml/logger.py#L120

3 years ago

0 Hi All, I Was Trying To Use Clearml-Task To Run A Custom Docker(With Poetry To Install All The Python Dependencies And Activated The Environment) Using Clearml Gpu, But It Seems Like Clearml Always Create A Virtual Environment And Run The Python Script Fr

Notice you should be able to override them in the UI (under Args seciton)

2 years ago

0 Hello Clearml Community, Does Anyone Have An Idea How I Could Integrate/Manager Carla (

Wow, thank you very much. And how would I bind my code to task?

you mean the code that creates pipeline Tasks ?
(remember the pipeline itself is a Task in the system, basically if your pipeline code is a single script it will pack the entire thing )

3 years ago

0 Hi, I Have A Main Task That Creates Additional Tasks In Subprocesses. I Wish To Call Task.Init From Inside Each Child-Task So They Would Be Indifferent To The Main Task (I Wish The Child Process To Behave As If It Was Executed Standalone). I Am Aware Trai

Try removing this magic environment that tells the sub-process there was already an Initialized Task.

import os env = dict(**os.environ) env.pop('TRAINS_PROC_MASTER_ID', None) 🙂

4 years ago

0 What Sort Of Integration Is Possible With Clearml And Sagemaker? On The Page

print(requests.get(url='


print(requests.get(url='

2 years ago

0 I'M Trying To Set Up Clearml Server On A New Vm But The Elasticsearch Container Is Erroring With The Following:

Hi WittyOwl57
Are you starting a new server from scratch or is it running on previously stored data?

2 years ago

0 How Do People Solve This? If I Am Pip Installing A Custom Package From .Tar.Gz, How Can I Ensure That If I Run The Experiment (Initially Run From A Notebook) Via The Queueing It Can Be Properly Installed Steps - Notebook -> Get A Tar.Gz From S3 -> Pip I

What's the pip verison?

4 years ago

0 Hi, Is There Any Code Examples Of How Dataops Is Being Established?

Hi SubstantialElk6
Generally speaking here, the idea is that actual code creates a Dataset (i.e. Dataset class created from code), plus you can add some metric reporting (like table reporting) to create a preview of the data stored for better visibility, or maybe create some statistics as part of the data ingest script. Then this ingest code can be relaunched / automated. The created Dataset itself can be tagged renamed added key/value for better cataloging. wdyt?

4 years ago

0 My Autoscaled Instance Fails When Running "Git Clone" On A Private Repo. I

'-v', '/tmp/clearml_agent.ssh.cbvchse1:/.ssh',

It's my bad, after that inside the container it does cp -Rf /.ssh ~/.ssh
The reason is that you cannot know the user home folder before spinning the container
Anyhow the point is, are you sure that you have ~/.ssh on the Host machine configured?
And if you do, are you saying this is part of your AMI? if not how did you put it there?

2 years ago

0 Hello, I Have The Following Scenario:

See the log:

Collecting keras-contrib==2.0.8
  File was already downloaded c:\users\mateus.ca\.clearml\pip-download-cache\cu0\keras_contrib-2.0.8-py3-none-any.whl

so it did download it, but it failed to pass it correctly ?!
Can you try with clearml-agent==1.5.3rc2 ?

2 years ago

0 Hi, I'M Getting Error 404 When Trying To See Debug Samples I'Ve Recorded With Record_Image. The Local Path I'Ve Provided Is Valid (Image Is Displayed Normally When I Read It Via Python For Example) But Trains Ui Tell Me In The Debug Samples "Unable To Loa

SmarmySeaurchin8 checks the logs, maybe you can find something there

4 years ago

0 Hi, I Would Like To Add Artifacts From Two Parallel Process In The Same Task. But One One Process Finished It Changed Task Status To Complete. May Be You Know Some Save Way To Deal With Such Situation? Or Maybe The Best Way To Check Task Status Before Upl

great!

4 years ago

0 Hi, I'M Having Some Trouble With Trains-Agent In Docker Mode With A Local Trains Server. I Pulled Allegroai/Trains-Agent:Latest And Spun It Up In A Container, Set The Appropriate Environment Variables To Point To My Trains Server, And Bind Mounted /Var/Ru

RobustGoldfish9 do you see the trains-agent listed as a machine in the UI (under workers)

4 years ago

0 Quick Question: How Can I Clone A Task And Change The Cloned Task Type? I See No Task.Set_Type() Function

JitteryCoyote63 you mean from code?

4 years ago

0 Hi, I'M Trying To Clone And Queue Experiments For Running Them On My Workers. I Am Able To Successfully Clone And Queue The Task, But Seems Like The Task Does Not Pass The Correct Parameters To My Python Script On The Worker. We Use Hydra For Configuring

Can you put here the task.connect line ? (btw: I would assume there is no need for additional connect, if using hydra+fire, no ?)

3 years ago

0 Hi Guys, I’M Trying To Install It My Lab Server, But When I Try To Create Credentials, It Says Error And Gives More Info: Error 301 : Invalid User Id: Id=F46262Bde88B4928997351A657901D8B, Company=D1Bd92A3B039400Cbafc60A7A5B1E52B

PompousHawk82 what do you mean by ?

but the thing is that i can only use master to log everything

4 years ago

0 When Using Docker Mode (And Specifically K8S Glue), What Are The Options For Caching? One Option Is Definitely Having A Base Image That Has The Things Needed. Anything Else? Thanks!

A few examples:
https://medium.com/@Sushil_Kumar/readwritemany-persistent-volumes-in-google-kubernetes-engine-a0b93e203180
https://docs.openshift.com/enterprise/3.1/install_config/storage_examples/shared_storage.html

4 years ago

We could use our 8xA100 as 8 workers, for 8 single-gpu jobs running faster than on a single 1xV100 each.

@<1546665634195050496:profile|SolidGoose91> I think that in order to have the flexibility there you need the "dynamic" GPU allocation that is only part of the "enterprise" offering 😞
That said, why not allocate a single a100 machine? no?

2 years ago

0 Hey Guys. We Have Been Using Clearml For A While Now And It Has Solved Quite Some Headaches Around Our Operations. We Are Self Hosting It Using Docker Swarm And Were Wondering If This Is Something That The Community Would Be Interested In. This Would Be

that really depends on hoe much data you have there, and the setup. The upside of the file server is you do not need to worry about credentials, the downside is storage is more expensive

2 years ago

0 Any Chance Storagemanager Could Re-Download Files Only If Their Size Is Different From File In Cache (As An Option)?

any chance StorageManager could re-download files only if their size is different from file in cache (as an option)?

I think there is force argument, to force download.
I think the main issue is getting the size from different backends (i.e. s3 /https / etc.)
Maybe we should add it as a GitHub feature request issue?
The main limitation is that the driver "list()" does not return file size.
For example it might be an issue with the default http files-server.
wdyt?

3 years ago

0 Hey There! I'M Encountering An Odd Issue - I'M Running My Agents As Python Processes On A Windows Pc Endpoints. I Recently Had A Bug That Forced Me To Delete All Cache And All (Non-Core) Venv-Builds. My Firstly Booted Agent Uses The ''First'' Venv-Build

is it a shared network mount ? could you just delete the entire ~/.clearml on the host machine ?

one year ago

0 Am Reading Through The New Docs…. How Do I Go About Creating The Cron Jobs In The Ui

Scheduled training is what I’m looking forward to

I'll try to remember to update here once we pushed into the GitHub repo, feedback is always appropriated 🙂
If in the next two weeks you hear nothing, please ping here to make sure I did not forget 😉

4 years ago

0 Hello, I Want To Report A Confusion Matrix With The Values

VirtuousFish83 I remember an issue on github with something similar , what's the cleamrl- server version you are using ?

3 years ago

0 What Is The Right Way To Increase Number Of Retries When Using

DilapidatedDucks58 I think they are used here:
https://github.com/allegroai/clearml/blob/3d3a835435cc2f01ff19fe0a58a8d7db10fd2de2/clearml/storage/helper.py#L1407

https://boto3.amazonaws.com/v1/documentation/api/latest/reference/core/session.html#boto3.session.Session.resource

3 years ago

0 Hi, I Would Like To Bring Awareness

@<1523701066867150848:profile|JitteryCoyote63>
I just created a new venv and run

pip install "torch==1.11.0.*" --extra-index-url

Then started python:

import torch
torch.cuda.is_available()

And I get True

what are you getting?

2 years ago

0 Two Annoying Visual Bugs In Clearml Server Ui After Latest Update:

DilapidatedDucks58 I'm assuming clearml-server 1.7 ?
I think both are fixed in 1.8 (due to be released wither next week, or the one after)

2 years ago

0 Hi. I Get Some Problem With Clearml Agent. I Start Training On My Local Device, Clone Run, And Start This Run In Docker On Cluster. But, Seems Like Clearml Agent Сaches Environment(Package Weels, Python Version, Etc). Can I Config Clearml Agent To Not Сac

StickyBlackbird93 the agent is supposed to solve for the correct version of pytorch based on the Cuda in the container. Sounds like for some reason it fails? Can you provide the log of the Task that failed? Are you running the agent in docker-mode , or inside a docker?

3 years ago

0 Hey! I Would Like To Connect To Same Task From Multiple Consumer And Upload Debug Image. Is It Possibile? It Seems Like I Can Connect To The Task. Get The Logger But Nothing Is Uploaded.

Should work out of the box, as long as the task was started. You can forcefully start the task with:
task.mark_started()

5 years ago

0 Hello, I Don'T Really Like The Idea Of Providing My Own Github Credentials To The Clearml Agent. We Have A Local Clearml Deployment. Is There A Way To:

Hi @<1573119962950668288:profile|ObliviousSealion5>

Hello, I don't really like the idea of providing my own github credentials to the ClearML agent. We have a local ClearML deployment.

if you own the agent, that should not be an issue,, no?

forward my SSH credentials using

ssh -A

and then starting the clearml agent?

When you are running the agent and you force git clonening with SSH, it will autmatically map the .ssh into the container for the git to use

Ba...

2 years ago

Show more results