Answered

What Could Be The Reason For My Package To Not Be Loading Under The "Installed Packages"? I Have A

what could be the reason for my package to not be loading under the "INSTALLED PACKAGES"?
I have a . in the requirements.txt
I added a task.add_requirements({NAME_OF_PACKAGE})
task runs correctly on machine, the problem is when I reset it and enqueue it

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

Votes Newest

Answers 29

the my_package now works ok 🙂

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

is "my_package" a local package ?
what is the output of:
pip freeze | grep my_package

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Could you test with the latest "cleaml"
pip install git+Task.add_requirement(".") should be supported now 🙂

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

what could be wrong?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

it seems that I need to add it ( import pandas ) in the main file...even though I don't use it there...

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

and in dummy_module I have:

import pandas as pd

def func(args):
pd.read_csv(args.file)

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

Hi AgitatedDove14 , now I'm seeing under INSTALLED PACKAGES both "." and "my_package"..what could be the issue here?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

so in my main file I have:

from my_package import dummy_module

dummy_module.func(args)

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

for it to work in a remote worker

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

MagnificentSeaurchin79
"requirements.txt" is ignored if the Task has an "installed packges" section (i.e. not completely empty) Task.add_requirements('pandas') needs to be called before Task.init() (I'll make sure there is a warning if called after)

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

nope

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

I think that worked, because now I'm having a different issue..it says that cannot import pandas..I have it both in my requirements.txt and in task.add_requirements('pandas')

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

Python 3.7.3 (default, Dec 20 2019, 18:57:59) [GCC 8.3.0]

absl_py == 0.10.0
azure_storage_blob == 12.7.1
clearml == 0.17.4
google_cloud_storage == 1.35.0

Detailed import analysis

**************************

IMPORT PACKAGE absl_py

generate_tfrecord_pipeline.py: 3

IMPORT PACKAGE azure_storage_blob

clearml.storage: 0

IMPORT PACKAGE clearml

generate_tfrecord_pipeline.py: 1

IMPORT PACKAGE google_cloud_storage

clearml.storage: 0

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

So the "packages" are the packages you need in the steps themselves ?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

no, my_package is never added manually

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

exactly

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

the thing is that I have to manually add all imports of packages that don't appear in my main script

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

So if everything works you should see "my_package" package in the "installed packages"
the assumption is that if you do:
pip install "my_package"
It will set "pandas" as one of its dependencies, and pip will automatically pull pandas as well.
That way we do not list the entire venv you are running on, just the packages/versions you are using, and we let pip sort the dependencies when installing with the agent
Make sense ?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

and then it works

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

great, thanks! 🙂

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

clearml should detect the "main" packages used in the repository (not just the main-script), the derivatives will be installed automatically by pip when the agent is installing the environment, once the agent is done setting the environment, it updates back the Task with the full list of packages including all required packages.

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

What exactly do you get automatically on the "Installed Packages" (meaning the "my_package" line)?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

it fails because my_package using pip...so I have to manually edit the section and remove the "my_package"

MagnificentSeaurchin79 did you manually add both "." and my_package ?
If so, what was the reasoning to add my_package if pip cannot install it ?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

and then when running in agent mode, it fails because my_package can't be installed using pip...so I have to manually edit the section and remove the "my_package"

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

and what about those packages that are not being loaded because they don't appear in the main file?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

MagnificentSeaurchin79 do you have the "." package listed under "installed packages" after you reset the Task ?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

in this example my main func is going to be the scripts that creates the pipeline controller

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					MagnificentSeaurchin79
				
					0
					 × 1

In that case when you create the Tasks for the step,do not specify any packages/requirements, then the agent will just use the "requirements.txt" from the repository.
If you need you can also specify them when you create the Task itself see https://github.com/allegroai/clearml/blob/912f6f5ba2328b26de042de03f02de5802df360f/clearml/task.py#L608
https://github.com/allegroai/clearml/blob/912f6f5ba2328b26de042de03f02de5802df360f/clearml/task.py#L609

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Write your answer

1K Views

29 Answers

3 years ago

2 years ago