I think for this you would need to report this manually. You can extract all of this data using the API and then create custom plots/scalars that you can push into reports for custom dashboards 🙂
GiganticTurtle0 , does it pose some sort of problem? What version are you using?
Cant you paste the output until the stuck point? Sounds very strange. Does it work when it's not enqueued? Also, what version of clearml-agent & server are you on?
Hi @<1749965229388730368:profile|UnevenDeer21> , can you add the log of the job that failed?
Also, note that you can set these arguments from the webUI on the task level itself as well, Execution tab and then container section
You can create a queue through the UI. You can go into Workers & Queues tab -> Queues -> "New Queue"
You can also create new queues using the API as well
https://clear.ml/docs/latest/docs/references/api/queues#post-queuescreate
JitteryCoyote63 , let me take a look if it happens to me as well 🙂
The ubuntu is the client side or you changed OS on the server side?
SmugDolphin23 , maybe you have an idea?
You're totally right, if you managed to upload to a bucket then folder failure should be unrelated to permissions
You mean that you have 30 jobs each in a separate queue and you'd like to move all of them to top priority in each queue?
Did you try what I added? Also the screenshot is too small, nothing is readable
JitteryCoyote63 I'll take a look, thanks a lot for the heads up! 🙂
GentleSwallow91 , you can also use Task.create()
https://clear.ml/docs/latest/docs/references/sdk/task#taskcreate
Yeah I think you're right. You're basically looking for a capability to segregate users into groups so they won't have visibility between themselves?
JitteryCoyote63 , are you on a self hosted server? It seems that the issue was solved for 3.8 release and I think should be released to the next self hosted release
So If I manually add a dataset (many excels), in a folder, and copy that folder to NFC
How would you do that?
Sounds like an issue with your deployment. Did your Devops deploy this? How was it deployed?
Hi @<1708653001188577280:profile|QuaintOwl32> , you can control all of this on the task level. For example through code you can use Task.set_base_docker
- None
You can add all of these as arguments
You added two logs, one with docker the other without. Each stopped on a different step. Is that consistent? What OS is the agent running on? Also what is the command you're using to run the agent?
Hi, what & how are you running? Is it a local deployment?
Hi @<1752139552044093440:profile|UptightPenguin12> , for that you would need to use the API and use the mark_completed call with the force flag on
Hi @<1539417873305309184:profile|DangerousMole43> , I don't think such a capability exists. You can however add a link to a file as a parameter and then run it from the GUI.
Regarding the drag and drop capability I think it's a really cool idea! Maybe open a feature request on GitHub?
Can you please add the ~/clearml.conf
for the agent? Also, are you trying to run everything on the same machine or different ones?
Hi @<1541592204353474560:profile|GhastlySeaurchin98> , how are you running the experiments - which type of machines - local or cloud? Are you running your own server or using the community?
Did you try this key/secret pair on other machines?
Hi @<1523701122311655424:profile|VexedElephant56> , I think is achievable with Slurm + ClearML, however I don't think something like this out of the box exists
Then you can define the git credentials that can clone these repositories
That's the problem. ClearML has to detect the uncommitted changes somehow. This is done while the code itself is running or when running with execute_remotely()
. Otherwise, someone has to do a git diff
and push it into the task object(database)
Hi WhoppingMole85 , you can actually do that with the logger.
Something along the lines of:Dataset.get_logger().report_table(title="Data Sample", series="First Ten Rows", table_plot=data1[:10])
Does this help?