I’M Using Catboost For Training, But Sadly It Does Not Have A Native Integration With Clearml (Xgboost And Lightgbm Do Have Integrations). But Catboost Writes Down Training Logs In Tensorboard Format (Into A

Answered

I’m using catboost for training, but sadly it does not have a native integration with clearml (xgboost and lightgbm do have integrations). But catboost writes down training logs in tensorboard format (into a .tfevents file). How can I integrate this file into clearml?

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

Votes Newest

Answers 15

Hmm I think everything is generated inside the c++ library code, and python is just an external interface. That means there is no was to collect the metrics as they are created (i.e. inside the c++ code), which means the only was to collect them is to actively analyze/read the tfrecord created by catboost 😞
Is there a python code that does that (reads the tfrecords it creates) ?

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

https://github.com/catboost/catboost/search?q=tensorboard

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

Wanted to check if MLFlow supports catboost. Apparently, it does. Pull request was merged 16 hours ago. Nice timing 😃

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

Hi FiercePenguin76
Is catboost actually using TB or is it just writing to .tfevent on its own ?

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

it certainly does not use tensorboard python lib

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

it certainly does not use tensorboard python lib

Hmm, yes I assume this is why the automagic is not working 😞

Does it have a pythonic interface form the metrics ?

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Yep 😞

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

looking into the output folder of catboost, I see 3 types of metrics outputs:
tfevents (can be read by tensorboard) catboost_training.json (custom (?) format). Is read here to be shown as an ipython widget: https://github.com/catboost/catboost/blob/c2a6ed0cb85869a73a13d08bf8df8d17320f8215/catboost/python-package/catboost/widget/ipythonwidget.py#L93 learn_error.tsv, test_error.tsv, time_left.tsv which have the same data as json. Apparently they are to be used with this stale metrics viewer project: https://github.com/catboost/catboost-viewer

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

Actually that is less interesting, as it is quite straight forward

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

as I understand, it uses tensorboard from C++ code

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

Although it is only for model tracking, autologging is yet to be implemented there

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

https://github.com/mlflow/mlflow/pull/2417

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

Yes, but as you mentioned everything is created inside the lib, which means the python is not able to intercept the metrics so that clearml can send them to the backend.

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I guess this is the one https://catboost.ai/docs/concepts/python-reference_catboostipythonwidget.html

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

nope, catboost docs offer to manually run tensorboard against the output folder https://catboost.ai/docs/features/visualization_tensorboard.html

  				
Posted 
	4 years ago

					More
				  		
  Report
		
					FiercePenguin76
				
					0
					 × 1

Write your answer

3K Views

15 Answers

4 years ago

2 years ago