Hey, So We Noticed The

Answered

Hey, so we noticed the /var/lib/docker/overlay2 directory on the clearml-server is growing a lot in size, we added more disk space but we want to put something in place to stop this growing too much.
These are the options I’ve looked into:

docker system prune - removes all stopped containers, all networks not used by at least one container, all dangling images, all dangling build cache, Problem: we don’t really know what this is pruning
docker image prune --all - removes all images without at least one container associated to them
Set the max-size in docker-compose.yaml for logging

Are the first 2 options safe to run without killing the server? I’m not happy on removing files without knowing what they are.
Are there any plans to automate this in the future?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					RoundCat60
				
					0
					 × 1

Votes Newest

Answers 43

Try looking at their logs

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

back up and running again, thanks for your help

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					RoundCat60
				
					0
					 × 1

Hey there waves

Not sure about plans to automate this in the future, as this is more how docker behaves and not really clearml, especially with the overlay2 filesystem. The biggest offender usually is your json logfiles. have a look in /var/lib/docker/containers/ for *.log

assuming this IS the case, you can tell docker to only log upto a max-size .. I have mine set to 100m or some such

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AlertBlackbird30
				
					0
					 × 1

After making the change yesterday to the docker-compose file, the server is completely unusable - this is all I see for the /dashboard screen

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					RoundCat60
				
					0
					 × 1

think I found the issue, a typo in apiserver.conf

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					RoundCat60
				
					0
					 × 1

hhrrmm.. in the initial problem, you mentioned that the /var/lib/docker/overlay2 was growing large in size.. but.. 4GB seems "fine" for docker images.. I wonder .. does your nvme0n1p1 ever report like 85% or 90% used or do you think that the 4GB is a lot ? when you restart the server, does the % used noticeably drop ? that would suggest tmp files inside the docker image itself which.. is possible with docker (weird but, possible)

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AlertBlackbird30
				
					0
					 × 1

Basically whatever was under the old /opt/trains/ folder is required, you can see the list here: None

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

it looks like clearml-apiserver and clearml-fileserver are continually restarting

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					RoundCat60
				
					0
					 × 1

yeah, that's usually the case when you get an empty dashboard

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

Check sudo docker logs <container-name>

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					SuccessfulKoala55
				
					0
					 × 1

🤔 i'll add the logging max_size now and monitor over the next week

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					RoundCat60
				
					0
					 × 1

not entirely sure on this as we used the custom AMI solution, is there any documentation on it?

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					RoundCat60
				
					0
					 × 1

@<1687643893996195840:profile|RoundCat60> you set it once, inside the docker-compose itself.. it will affect all docker containers but, to be honest, docker tends to log everything

  				
Posted 
	3 years ago

					More
				  		
  Report
		
					AlertBlackbird30
				
					0
					 × 1

Show more results

Write your answer

25K Views

43 Answers

3 years ago

7 months ago