is there a way that there is only one environment for the whole pipeline?
Yes, it recreates the venv (or fetches it from cache) if you need your dataset, use Dataset class (it will cache it persistently, so no need to re-download)
Hi @<1610083503607648256:profile|DiminutiveToad80>
Yes, it does. They are also cached by default (on the machine with the agent)
None
so inside /Users/adityachaudhry/.clearml/venvs-builds.1/3.10/task_repository/ I have my git repo, I have one component that make a dataset directory inside this git repo, but when the other component starts executing this dataset directory is not there
This means all the components of the pipeline use the exact same packages, and then it will just reuse the venv. Make sense ?
what do you mean? the same env for all components ? if they are using/importing exactly the same packages, and using the same container, then yes it could
so my model is not able to find the dataset