ClearML FAQ | Questions with tag - mlops

Questions with tag mlops

Newest Votes Updated Unanswered Unapproved

Questions with tag mlops

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi, I'M Using The Aws Autoscaler To Spin Instances. I'D Like To Use The Clearml Agent On The Created Instances With Docker Containers. However Even If I Set Default_Docker_Image In The Parameters On The Ui To Nvidia/Cuda:11.1.1-Runtime-Ubuntu20.04 The Tas

Hi, I'm using the aws autoscaler to spin instances. I'd like to use the clearml agent on the created instances with docker containers. However even if I set ...

LovelyHamster1
0 × 1

aws mlops

4 years ago

0 Votes

9 Answers

2K Views

0 Votes 9 Answers 2K Views

Is There A Way To Interface With Clearml Agent (Cli?) To Handle Model Repositories And Data Versioning (But So, Not Experimentation, Tight Integration, Pipelining, Etc)?

Is there a way to interface with ClearML agent (CLI?) to handle model repositories and data versioning (but so, not experimentation, tight integration, pipel...

UnevenDolphin73
0 × 1

4 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

What Is The Python Version An Agent Runs A Task With? The One The Agent Was Launched With, Or The Task? I.E. If Iaunch An Agent With Python 3.8.5, But A Task Is Launched Using

what is the python version an agent runs a task with? the one the agent was launched with, or the task? i.e. if Iaunch an agent with python 3.8.5, but a task...

ElegantCoyote26
0 × 1

4 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Hi There. I'M Following The Training Instructions For Testing Clearml Agent (

Hi there. I'm following the training instructions for testing clearml agent ( https://allegro.ai/clearml/docs/docs/tutorials/tutorial_tuning_exp.html#step-3-...

BattyLion34
0 × 1

4 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi Again, It Seems Like The Aws Autoscaler Is Not Spinning Instances With The Ebs Configuration I Configured. Here Is The Configuration:

Hi again, it seems like the aws autoscaler is not spinning instances with the EBS configuration I configured. Here is the configuration: resource_configurati...

JitteryCoyote63
0 × 1

aws mlops

4 years ago

0 Votes

0 Answers

2K Views

0 Votes 0 Answers 2K Views

Hi All, Would It Be Possible To Make The Aws Autoscaler Log Each Scale In/Out Operation In The Console To Help Debugging/Understanding The Course Of Events?

Hi all, Would it be possible to make the aws autoscaler log each scale in/out operation in the console to help debugging/understanding the course of events?

JitteryCoyote63
0 × 1

aws mlops

4 years ago

0 Votes

17 Answers

2K Views

0 Votes 17 Answers 2K Views

Hi There, I Have A Problem With Pyjwt: I Am Using

Hi there, I have a problem with PyJWT: I am using trains==0.16.4 and trains-agent==0.16.3 in my agents. I installed PyJWT==1.7.1 in the agent (through extra_...

JitteryCoyote63
0 × 1

4 years ago

0 Votes

32 Answers

138K Views

0 Votes 32 Answers 138K Views

Very Weird Error, Trying To Run An Experiment Through An Agent In Docker Mode, And I Get This Error

Very weird error, trying to run an experiment through an agent in docker mode, and I get this error docker: Error response from daemon: create /home/elior/De...

WackyRabbit7
0 × 1

4 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hey There, Happy New Year To All Of You

Hey there, happy new year to all of you 🍾 I have several tasks that are stuck while training a model with pytorch/ignite, more precisely right after uploadi...

JitteryCoyote63
0 × 1

4 years ago

0 Votes

11 Answers

2K Views

0 Votes 11 Answers 2K Views

Continuing On

Continuing on https://allegroai-trains.slack.com/archives/CTK20V944/p1607012505242500 we'd like to minimize startup time for the agent-started experiments si...

MelancholyBeetle72
0 × 1

4 years ago

0 Votes

9 Answers

2K Views

0 Votes 9 Answers 2K Views

Hey, Great Product! I'Ve Installed Trains Agent On A Python3 Venv, But When I Run A Script On The Worker, It Calls Python2 Instead Of Python 3. How To Change It?

Hey, great Product! I've installed trains agent on a python3 venv, but when I run a script on the worker, it calls python2 instead of python 3. How to change...

VivaciousWalrus99
0 × 1

4 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Dear

Dear , We are happy to announce that we will be officially changing the Trains product name. The official announcement will be made in January, but we wanted...

GrumpyPenguin23
0 × 1

4 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi Guys, I Would Like To Start Using The Aws Autoscaler Shipped In Trains. I Need To Create A Iam User To Get And I Would Like To Know What Are The Minimal Permissions Required For The Autoscaler To Work?

Hi guys, I would like to start using the AWS autoscaler shipped in trains. I need to create a IAM user to get and I would like to know what are the minimal p...

JitteryCoyote63
0 × 1

4 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi! In My Project I Need To Run A Lot Of Experiments On Different Subsets Of My Trainset, Collect Score And Perform Some Calculations Based On It. I Have

Hi! In my project I need to run a lot of experiments on different subsets of my trainset, collect score and perform some calculations based on it. I have mai...

UpsetCrocodile10
0 × 1

4 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hi, I Was Running A Trains Agent (Version

Hi, I was running a trains agent (version 1.16.1 ) on a remote machine. I notice that even if in the trains.conf agent.git_user, agent.git_pass was set, the ...

SmugOx94
0 × 1

4 years ago

0 Votes

18 Answers

2K Views

0 Votes 18 Answers 2K Views

Hello There, I Would Like To Do Run Cleanup Code In Case The User Aborts One Task From The Dashboard (The Agent Is Not Using The Task In Docker). What Signal Should I Listen For In The Task?

Hello there, I would like to do run cleanup code in case the user aborts one task from the dashboard (the agent is not using the task in docker). What signal...

JitteryCoyote63
0 × 1

4 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Hey Guys. I Tried Running The Pytorch Mnist Example On A Train-Agent By Running It Locally And Then Resetting The Experiment And Then Enqueue-Ing It To The Default Queue. All Works Well But It Seems The Environment Building Process Gets Stuck On A Manual

Hey guys. I tried running the pytorch mnist example on a train-agent by running it locally and then resetting the experiment and then enqueue-ing it to the d...

ColossalAnt7
0 × 1

4 years ago

0 Votes

7 Answers

2K Views

0 Votes 7 Answers 2K Views

Hi

Hi AgitatedDove14 , I'd appreciate your thoughts on trains-agent on the following topic. To run an experiment by a trains-agent , it must have already been r...

SarcasticSparrow10
0 × 1

4 years ago

0 Votes

8 Answers

2K Views

0 Votes 8 Answers 2K Views

Quick Question On

Quick question on trains-agent and HPO. Say I have 10 experiments enqueued to a trains-agent . I understand the agent runs the experiment one-by-one. But can...

SarcasticSparrow10
0 × 1

4 years ago

0 Votes

2 Answers

2K Views

0 Votes 2 Answers 2K Views

Hi Guys, Last Night One Of Our Agents (0.16.1) Was Disconnected From Our Trains-Server While Executing An Experiment. I Saw That Because The Experiment It Was Running Had The Status Aborted And I Could Not See The Agent In The List Of Available Workers. H

Hi guys, Last night one of our agents (0.16.1) was disconnected from our trains-server while executing an experiment. I saw that because the experiment it wa...

JitteryCoyote63
0 × 1

4 years ago

0 Votes

21 Answers

2K Views

0 Votes 21 Answers 2K Views

I'M Looking To Utilize The Trains Aws Autoscaler Functionality, But After Going Through Its Docs A Few Times I Still Don'T Get It. Ultimately, My Setup Is That I Have Multiple Data Scientists Working On Static Instances, And They Have Queues Available To

I'm looking to utilize the Trains AWS autoscaler functionality, but after going through its docs a few times I still don't get it. Ultimately, my setup is th...

WackyRabbit7
0 × 1

5 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Thank You For Your Help So Far. I Have A Question About Trains Authentication And Privacy When Deploying On K8S. I Want Integrate Building A Trains-Server Into Our Iac. Now That I Got A Server To Work With An Agent Deployment Im Thinking About Authorizati

Thank you for your help so far. I have a question about trains authentication and privacy when deploying on k8s. I want integrate building a trains-server in...

ColossalAnt7
0 × 1

5 years ago

0 Votes

29 Answers

2K Views

0 Votes 29 Answers 2K Views

Hey Guys, Another Question About Deploying My Own Trains Server. I Have A Trains-Server Deployed On My K8S Cluster Using The Trains Helm Chart (Which Is Awesome). Now I Want To Create A Deployment Running Trains-Agent As Specified In The [Trains-Helm Repo

Hey guys, another question about deploying my own trains server. I have a trains-server deployed on my k8s cluster using the trains helm chart (which is awes...

ColossalAnt7
0 × 1

5 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hey There, Is There A Way To Access The Trains Configuration Programmatically At Runtime In A Task (The Configuration That Is Dumped By The Agent In The Logs Before Executing A Task)

Hey there, is there a way to access the trains configuration programmatically at runtime in a task (the configuration that is dumped by the agent in the logs...

JitteryCoyote63
0 × 1

5 years ago

0 Votes

4 Answers

2K Views

0 Votes 4 Answers 2K Views

Hi, I'M Using The Dockerized Version Of Trains Get An Understanding Of Trains. While Trying To Play With The Trains.Conf Settings In ~/Trains.Conf I Got In A State, Where The Agent Is Not Been Able To Clone My Repo From

Hi, I'm using the dockerized version of trains get an understanding of trains. While trying to play with the trains.conf settings in ~/trains.conf I got in a...

WickedGoat98
0 × 1

5 years ago

0 Votes

23 Answers

2K Views

0 Votes 23 Answers 2K Views

Hi, I Need Your Help Setting Up An Trains Agent Running In Docker. I Have An Python Script Calling Wget As System Command Which Runs Fine On My Dev Engine. When Cloning The Experiment And Scheduling It Into The Services Queue I Get An Error That The Call

Hi, I need your help setting up an trains agent running in docker. I have an python script calling wget as system command which runs fine on my dev engine. W...

WickedGoat98
0 × 1

5 years ago

0 Votes

5 Answers

2K Views

0 Votes 5 Answers 2K Views

Hi Everyone, Looking For Ml Management Tools I Stumbled Upon Trains, I Must Say It Has Been Awesome So Far. I Just Have A (Probably Stupid) Question: I'M Trying To Setup A Multi-Node Training Environment And I Thought I Could Solve This With Agents, But A

Hi everyone, Looking for ML management tools I stumbled upon Trains, I must say it has been awesome so far. I just have a (probably stupid) question: I'm try...

SmilingFrog76
0 × 1

5 years ago

0 Votes

1 Answers

2K Views

0 Votes 1 Answers 2K Views

Hi Guys, Firstly, Thank You For Your Efforts And Your Support. I'M Trying To Use Allegro Trains To Handle The Experiments Of A Git Repo. The Repo Is Structured As Follows:

Hi guys, Firstly, thank you for your efforts and your support. I'm trying to use allegro trains to handle the experiments of a git repo. The repo is structur...

SmugOx94
0 × 1

5 years ago

0 Votes

3 Answers

2K Views

0 Votes 3 Answers 2K Views

Sorry For The Bombarding With Errors.. But Here Comes Another One

Sorry for the bombarding with errors.. but here comes another one 🙂 I have torch installed locally (through the transformers library) and when sending it to...

WackyRabbit7
0 × 1

5 years ago

0 Votes

6 Answers

2K Views

0 Votes 6 Answers 2K Views

Hey, Trying To Use Trains-Agent To Run An Experiment On My Computer. When Trying To Execute A Job From The Queue On My Agent Im Getting An Error That Numpy Is Not Installed. How Do I Have The Trains-Agent Install My

hey, trying to use trains-agent to run an experiment on my computer. when trying to execute a job from the queue on my agent im getting an error that numpy i...

CloudyHamster42
0 × 1

5 years ago

Show more results