CostlyOstrich36

0 Questions, 4213 Answers

Active since 10 January 2023

Last activity 2 years ago

Reputation

Answers 4213

0 Hi, I'M Using Fractional Gpu Container, But I Can'T Profile Any Thing, The Nsys Says Only "Segmentation Fault (Core Dumped)" Can You Guys Help Me And Do You Know What Is Wrong With This?

What command did you run? What were you trying to do? What was the setup?

3 months ago

0 I Have The User/Pass Set Up On The

Also, make sure to install virtualenv, I see there was a failure in the log on that as well

11 months ago

0 Hi, I Am Trying To Set Up The Gcp Autoscaler. I Created A Gcp Service Account And Granted It The Following Roles: Compute Admin, Service Account User, And Logs Writer. I Then Added Its Credentials Under The Gcp Credentials Section In Clearml. Vm Instanc

Hi @<1826791494376230912:profile|CornyLobster42> , can you add logs from the VMs themselves? They should be saved on the Autoscaler

6 months ago

0 How To Version Models While Training In Production

Hi @<1639074542859063296:profile|StunningSwallow12> , here are the docs for the agent - None

one year ago

0 Hi Everyone. I’M Struggling To Setup Minio Storage. Below Is What I’M Adding In My Credentials And When I Try To Create A New Dataset Using Below Command; I Get Errors: Configs:

Try running the following script

from clearml import Task 
import time

task = Task.init(output_uri="

")

print("start sleep")
time.sleep(20)
print("end sleep")

Please add the logs

one year ago

0 Hi, I'M Experiencing Some Fairly Slow Uploads Of A New Dataset Version. I'M Running A Local Server And I'M Uploading A ~20Gb Update To A ~30Gb Dataset Consisting Of Few Hundreds Files, Each Up To Several Hundred Mbs. It Seems That Compressing And Upload I

Hi @<1547028074090991616:profile|ShaggySwan64> , so the issue is when writing to the files server? Is it possible that the machine itself is having a hard time to write the data?

one year ago

0 Hi Community, I Am Trying To Run A Pipeline That Has Only A Single Step Defined As A Task, But I Get A Bizarre Error When I Run

Hi @<1618780810947596288:profile|ExuberantLion50> , can you please a code snippet that reproduces this?

one year ago

0 Hi!! I Have A Question, I Have A Dataset Saved In S3 Using Clearml, Is There A Way Of Getting The Full Path Of This Dataset In S3?? Something Like

Hi @<1673863788857659392:profile|HomelyRabbit25> , the Dataset object should have artifacts and those should have a url attribute. I'd suggest poking around there!

one year ago

0 Does

Hi ElegantCoyote26 ,

What happens if you delete ~/.clearml (This is the default cache for ClearML) and rerun?

3 years ago

0 Hello Guys, Can You Tell Me Where The Console Outputs Are Stored? For Some Reason, All Outputs Have Disappeared From All My Pipelines. Any Explanation, Does Anyone Have An Idea What Might Have Happened?

Hi @<1702492411105644544:profile|YummyGrasshopper29> , console logs are saved in Elastic. I would check on the status of your container

one year ago

0 Hi Guys! Im Running Into An Issue... Im Trying To Do My Master Thesis And Clearml Is A Framework Im Investing A Lot In. And Rn Im Trying To Set Up The Serving Module Into The Server, But If I Run It On The Server I Get Connection Issues Using External Ip+

Also, I don't think the serving should run on the same machine as the server as serving can require quite a lot of resources

8 months ago

0 If Possible, I Would Like All Together Prevent The Fileserver And Write Everything To S3 (Without Needing Every User To Change Their Config)

In the UI check under the execution tab in the experiment view then scroll to the bottom - You will have a field called "OUTPUT" what is in there? Select an experiment that is giving you trouble?

3 years ago

0 Hello All, I'Ve A Question On S3 Integration. I'Ve Deployed Clearml And Clearml-Agent Helm Charts In My Ovh Managed K8S Cluster. I'Ve Jupyterhub Running In Same Namespace. I Was Trying To Make Connection With My Ovh S3 Bucket From Jupyter Notebook By Fetc

Hi @<1665891247245496320:profile|TimelyOtter30> , not sure I follow. It looks like a misconfiguration. I think you need to see the correct settings here: None , also note the direct reference to minio 🙂

one year ago

0 How Should I Edit The

Very similar to a task, a project has also a unique identifier - the ID (Although I think project names are also unique)

You can get the project ID either from UI (If you go to a specific project, the project ID will be in the url) or from the api as documented in:
https://clear.ml/docs/latest/docs/references/api/projects#post-projectsget_all
or from the sdk as documented here:
https://clear.ml/docs/latest/docs/references/sdk/task#taskget_project_id

Plug that project ID into the filter ...

4 years ago

0 Hi, I'M Trying To Upload Data To Clearml Parallelly. Is It Impossible To Use

MagnificentWorm7 , I'm taking a look if it's possible 🙂
As a workaround - I think you could split the dataset into different versions and then use Dataset.squash to merge into a single dataset
https://clear.ml/docs/latest/docs/references/sdk/dataset#datasetsquash

3 years ago

0 Hi All! I Am In The Process Of Setting Up Clearml-Serving On My Kubernetes Cluster Using The Provided Helm Charts. Currently I Am Stuck With Running The Control Task. When I Call

Hi @<1526371965655322624:profile|NuttyCamel41> , can you add the full log?

one year ago

0 Hi Everyone, I'M Pau, From Spain, And New To This Community. I Am Self Hosting A Clearml Server In A Linux Machine. I Want To Deploy A Flask-App Inside Another Container That Uploads A Dataset Whenever It Receives An Event From Another App. The Container

Hi @<1813020708339453952:profile|PompousGoldfish33> , it looks like clearml.conf isn't configured in the environment that the flask app is running in. Which process is giving this traceback?

8 months ago

0 Heya, I Hope You'Re All Well In This Beautiful Day, My Gcp Autoscaler Just Died With That Strange But Short Backtrace, Wondered If It Rang A Bell To Any Of You ?

Hi FierceHamster54 , is this an old autoscaler instance? What is the version? You can see the version when you're on the application and click on 'More' at the top left text area

3 years ago

0 Hi! How To Correctly Configure Clearml With Pytorch-Ignite To Write Checkpoints To The

Hi @<1523708920831414272:profile|SuperficialDolphin93> , simply set output_uri=/mnt/nfs/shared in Task.init

11 months ago

0 Hi, Can I Run A Single Hyperparameter Optimization Task With At Least 100 Experiments On Multiple Machines By Using Clearml-Agent And Queue?

Hi @<1664079296102141952:profile|DangerousStarfish38> , you can control it in the agent.default_docker.image section of the clearml.conf where the agent is running. You can also control it via the CLI when you use the --docker tag and finally, you can also control it via the webUI in the execution tab -> container -> image section

one year ago

0 Hey, I'D Like To Store My Trained Models, Results Of Transformers Training, Into Local Disk. I Tried To Set Up

Hi @<1570220844972511232:profile|ObnoxiousBluewhale25> , you can click on the model in the artifacts tab and that should take you to the model repository. What is logged in the url of the model?

2 years ago

0 Hi All, Quick Question About Clearml Datasets. Does Anyone Know If It Is Possible To Access (Could Just Be Paths To The Data In A Bucket) A Dataset Directly From S3, Instead Of Downloading A Local Copy? We Typically Store And Access Large Quantities Of

Hi @<1686909730389233664:profile|AmiableSheep6> , I could suggest using the StorageManager module to pull specific files from S3.

There is no option to download specific files from a dataset. I would suggest breaking it into maybe smaller versions.

You would however need to pull the data locally for training anyways, wouldn't breaking it into smaller versions help this issue?

one year ago

0 Hi! After My Experiment Finishes, The Logs And Scalars Disappear After Some Time -- I Can See It At Task Details (Screen 1). But Scalars Are Displayed In The Usual Tabular Form (Screen 2). Additionally, All Of My Previous Experiments (For Few Years) Also

Do you see any errors in the dev tools console (F12)?
Also are there any errors in elastic?

7 months ago

0 Hello! My Workers Utilization Is Empty And Not Showing Any Graphs. Do You Know How I Can Troubleshoot This?

Also, if you open Developer Tools, do you see any errors in the console?

one year ago

0 Hey, I'D Like To Store My Trained Models, Results Of Transformers Training, Into Local Disk. I Tried To Set Up

What if you set the default_output_uri to false ?

2 years ago

0 Hi, Another Bug To Report With The Aws_Auto_Scaler Using 1.1.2:

JitteryCoyote63 Does it happen to you also with 1.1.1?

4 years ago

0 Hi, I See That Debug Samples Are Taking Up A Huge Amount Of Space. I Want To Limit The Amount Of Debug Images Which Are Stored. I See There Is An Option For That Here:

Hi CloudySwallow27 ,

I think currently the way to do this is by disabling the framework detection and reporting the debug images manually.

You can do this by Task.init( auto_connect_frameworks=False )

3 years ago

0 Hey, Has Anyone Played With Lifecycle Policy On Their Clearml Aws S3 Storage ? I Had To Rollback A Simple Lifecycle Policy Changing The Access Tier Of Some Object After Some Time To Save Cost Because Dataset Uploading Was Completely Frozen After That (St

Does it go back to working if you revert the changes?

2 years ago

0 Hi, Anyone Also Stuck With The Exception Encountered Uploading Pytorch Model File? The Dataset Upload Works Fine, Though.

Can you verify you ~/.clearml.conf has proper configuration. If you do
from clearml import Task t=Task.init()Does this work?

3 years ago

0 Is Vllm Inference An Enterprise Only Feature (

Hi @<1892021261433835520:profile|EnchantingMouse92> , I see that it says at the start of the page you linked that it is an enterprise only feature 🙂

Regarding differences, you can find a comparison between the different versions at this page - None

Just scroll down and you'll have different sections you can expand to see the differences.

23 days ago

Show more results