SmugDolphin23

0 Questions, 418 Answers

Active since 10 January 2023

Last activity one year ago

Reputation

Answers 418

0 Hi, I'M Running

OutrageousSheep60 1.8.4rc1 is out. Can you please try it? pip install -U clearml==1.8.4rc1

2 years ago

0 I Would Like To Use Clearml Together With Hydra Multirun Sweeps, But I’M Having Some Difficulties With The Configuration Of Tasks.

Hi SoreHorse95 ! I think that the way we interact with hydra doesn't account for overrides. We will need to look into this. In the meantime, do you also have somesort of stack trace or similar?

one year ago

0 I Am Using Clearml Pro And Pretty Regularly I Will Restart An Experiment And Nothing Will Get Logged To Clearml. It Shows The Experiment Running (For Days) And It'S Running Fine On The Pc But No Scalers Or Debug Samples Are Shown. How Do We Troubleshoot T

Hi @<1719524641879363584:profile|ThankfulClams64> ! What tensorflow/keras version are you using? I noticed that in the TensorBoardImage you are using tf.Summary which no longer exists since tensorflow 2.2.3 , which I believe is too old to work with tesorboard==2.16.2.
Also, how are you stopping and starting the experiments? When starting an experiment, are you resuming training? In that case, you might want to consider setting the initial iteration to the last iteration your prog...

5 months ago

0 Hello There! After Updating Clearml Server To The Latest Version I'M Not Able To Download Old Datasets. I Got An Error

Hi EcstaticMouse10 ! Are you using the latest clearml sdk version? If not, can you please upgrade and tell us if you still have this issue?

one year ago

0 Hello There Again! So, I Discovered By Accident (As It Usually Happens) That Apparently Clearml Uses

Hi @<1724235687256920064:profile|LonelyFly9> ! I assume in this case we fail to retrieve the dataset? Can you provide an example when this happens?

4 months ago

0 Hi All, I'M Running Into An Issue Where Clear Ml Tasks Being Executed By Services Workers On Self-Hosted Server Are Automatically Terminating. The Message Says "Process Terminated By User", Despite Us Not Aborting Tasks Through The Ui. E.G. (Following D

Hi @<1534706830800850944:profile|ZealousCoyote89> ! Do you have any info under STATUS REASON ? See the screenshot for an example:

10 months ago

0 Hi Everyone! I'M Trying To Use

Hi @<1578555761724755968:profile|GrievingKoala83> ! The only way I see this error appearing is:

your process gets forked while launch_multi_node is called
there has been a network error when receiving the response to Task.enqueue, then the call has been retried, resulting in this errorCan you verify one or the other?

one month ago

0 Hi Everyone! I'M Trying To Use

Hi @<1578555761724755968:profile|GrievingKoala83> ! Can you share the logs after setting NCCL_DEBUG=INFO of all the tasks? Also, did it work for you 5 months ago because you were on another clearml version? If it works with another version, can you share that version number?

one month ago

0 Reporting Nonetype Scalars.

Hi @<1631102016807768064:profile|ZanySealion18> ! Reporting None is not possible, but you could report np.nan instead.

4 months ago

0 Reporting Nonetype Scalars.

By default, as 0 values

4 months ago

0 Some Wierd Bug With Get_Local_Copy? I Have: Dataset C (100X A.Png, 100X B.Json, 100X C.Json) Dataset B (5X A.Png, 5X B.Json, 5X C.Json, 10X New.Json) Dataset A (5X A.Png) When Doing Get_Local_Copy() It Starts Downloading Dataset C Which Is 1.5Tb. This Sh

Hi @<1590514584836378624:profile|AmiableSeaturtle81> , I think you are right. We will try to look into this asap

4 months ago

0 Is There Any Way To Get Dataset Size Without Downloading State.Json? Im Doing Ds = Clearml.Dataset.Get(Dataset_Id=D_Id), But It Instantly Tries To Download State.Json Which Is On S3. Im Only Interested In Size And File Count Which I Then Get From Calling

Hi @<1590514584836378624:profile|AmiableSeaturtle81> ! You could get the Dataset Struct configuration object and get the job_size from there, which is the dataset size in bytes. The task IDs of the datasets are the same as the datasets' IDs by the way, so you can call all the clearml task related function on the task your get by doing Task.get_task("dataset_id")

6 months ago

0 Im Downloading Clearml Dataset With: Dataset.Get_Local_Copy(Use_Soft_Links=True) Somwtimes Datasets Are Really Huge, How To Get Progressbar Or Any Callback On Download Status? Otherwise I Have To Check If I Have Network Trafic Or Something

Hi @<1590514584836378624:profile|AmiableSeaturtle81> ! Having tqdm installed in your environment might help

one year ago

0 Hello, Im Having Huge Performance Issues On Large Clearml Datasets How Can I Link To Parent Dataset Without Parent Dataset Files. I Want To Create A Smaller Subset Of Parent Dataset, Like 5% Of It. To Achieve This, I Have To Call Remove_Files() To 60K It

Hi @<1590514584836378624:profile|AmiableSeaturtle81> ! Looks like remove_files doesn't support lists indeed. It does support paths with wildcards tho, if that helps.
I would remove all the files to the dataset and add only the ones you need back as a workaround for now, or just create a new dataset

7 months ago

otherwise, you could run this as a hack:

        dataset._dataset_file_entries = {
            k: v
            for k, v in self._dataset_file_entries.items()
            if k not in files_to_remove  # you need to define this
        }

then call dataset.remove_files with a path that doesn't exist in the dataset.

7 months ago

0 Hello, For Some Reason My Upload Speed To S3 Is Insanely Slow, I Noticed In Logs That It Upoads To /Tmp Folder. What Does That Mean? Why Tmp?

@<1590514584836378624:profile|AmiableSeaturtle81> note that we zip the files before uploading them as artifacts to the dataset task. Any chance you are specifying the default output uri as being a local path, such as /tmp ?

8 months ago

0 Hi All We Have Set Nginx In Front Of Clearml And Signed With Our Own Self-Signed Certs I'M Trying To Modify The

Hi @<1523701842515595264:profile|PleasantOwl46> ! This looks like a python problem. A useful SO thread: None
First, I would verify that I can access the api server without using the SDK. To do so, run this code after filling the credentials yourself (just login should be enough to verify that the api server is reachable)

    api_server = ""
    access_key = ""
    secret_ke...

10 months ago

0 Hi All We Have Set Nginx In Front Of Clearml And Signed With Our Own Self-Signed Certs I'M Trying To Modify The

Yes it should word with ClearML if it works with requests

10 months ago

0 Hi, I'M Using Lightgbm Package To Train Models. In My Task Execution I'M Getting Lots Of Warnings Print From Lightgbm. When Running Locally These Prints Are Not Occurring. How Can I Set Logging Level Of A Specific Package? Example Of The Logs:

Hi @<1594863230964994048:profile|DangerousBee35> ! This GH issue might be relevant to you: None

one year ago

0 Hi All We Have Set Nginx In Front Of Clearml And Signed With Our Own Self-Signed Certs I'M Trying To Modify The

the sudo update-ca-certificates ? maybe this will work

10 months ago

0 Hi, I Am Running A Script From A Git Repository. In The Repository There Is A Package That I Wrote And I Would Like The Script That I Am Running To Be Able To Import It, Thus I Need To Add The Package Path To Python Path. Repo Structure:

Hi @<1594863230964994048:profile|DangerousBee35> ! This looks like an ok solution, but I would make the package pip-installable and push it to another repo, then add that repo to a requirements file such that the agent can install it. Other than that, I can’t really think of another easy way to use your package

one year ago

0 I Dont Exactly Know How To Ask For Help On This... Nor Have A Reproducible Minimal Example... I Downgraded Back To 1.15.1 From 1.16.2 And Have The Same Issue There. I Have A Pipeline That'S Repeatedly Failing To Complete. It Correctly Marks Things As Cach

do you have any STATUS REASON under the INFO section of the controller task?

5 months ago

can you share the logs of the controller?

5 months ago

Hi @<1689446563463565312:profile|SmallTurkey79> !
Prior runs of this pipeline worked just fine What SDK version were you using for the prior runs? Does this still happen if you revert to that version?
Can you provide a script that imitates what you are doing?
In the pipeline you are running, are you creating new tasks/pipelines/datasets?

5 months ago

are you running this locally or are you enqueueing the task (controller)?

5 months ago

0 Hello. I Have A Question Regarding Pipeline Parameters. Is It Possible To Reference Pipeline Parameters In Other Fields Of The

Hi DangerousDragonfly8 ! At the moment, this is not possible, but we do have it in plan (we had some prior requests for this feature)

2 years ago

0 Hi Everyone, I Am Trying Out Pipeline With Functions. I Have A Requirements.Txt In My Folder Root. When I Run My Pipeline, The Pipeline Started Successfully But When It Starts Executing The First Task, It Fails. Error Saying Clearml Not Found. Logs Show T

@<1523701304709353472:profile|OddShrimp85> I believe you need to set the repo argument to point to your repository

one year ago

0 Hi Everyone, I'M Currently Trying To Add A Csv-File That Is Located In An S3-Bucket To An Existing Clearml Dataset Using The Following Code:

You could consider downgrading to something like 1.7.1 in the meantime, it should work with that version

2 years ago

0 Hi, I'M Running

hi OutrageousSheep60 ! We didn't release an RC yet, we will a bit later today tho. We will ping you when it's ready, sorry for the delay

2 years ago

0 Hi, Its Probably Easy. But I Can Seem To Understand

Hi @<1546303277010784256:profile|LivelyBadger26> ! You can either manually change it in the UI, or use None to set it in your code. Our modified example:

import hydra

from omegaconf import OmegaConf

from clearml import Task


@hydra.main(config_path="config_files", config_name="config", version_base=None)
def my_app(cfg):
    # type (DictConfig) -> None
    task = Task.init(project_name="examples", task_name="Hy...

one year ago

Show more results