Hey, Using K8S With Trains 0.16.1-320, All Of A Sudden The Entire Data (I.E Experiments, Tasks, Api Creds) Is Not Showing In The Ui Anymore. All Logs Seems To Be Fine Afai Can Tell... Any Idea What Went Wrong?

Answered

hey, using k8s with trains 0.16.1-320, all of a sudden the entire data (i.e experiments, tasks, API creds) is not showing in the UI anymore. All logs seems to be fine AFAI can tell... any idea what went wrong?

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

Votes Newest

Answers 30

No worries, and I hope you manage to get that backup.

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

https://techbeacon.com/enterprise-it/5-ways-lose-data-kubernetes-how-avoid-them
see (1) and (4)

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

https://stackoverflow.com/questions/37743683/why-is-an-empty-mongodb-database-so-big

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

yea the api server configuration also went away

okay that proves it

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

azure

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

(I mean new logs, while we are here did it report any progress)

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I will investigate a bit more and then check if I can recover

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

Now I suspect what happened is it stayed on another node, and your k8s never took care of that

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

And if this is the case, that would explain the empty elastic as well

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

backup?

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Could it be it was never allocated to begin with ?

what do you mean?

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

In that case, I think it is stuck on a previous Node, I can;t think of any other reason.
Do you have something else on the same PV that was lost ? like api server configuration?

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

so if the node went down and then some other node came up, the data is lost

That might be the case. where is the k8s running ? cloud service ?

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

nothing for now

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

I wonder if it's completely lost

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

That somehow the PV never worked and it was all local inside the pod

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

yea the api server configuration also went away

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

Meaning the node restarted (or actually moved)

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

ohh sec

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

check if you have any more of those recovery reports in the mongo log, it should report progress

I think I have sent you all the existing logs

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

thank you for your time and support, I appreciate it!

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

Damn 😞

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

but the PV seems to be just a path to the labeled node

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

I hope so

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

no that's for sure not

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

Oh dear, I think your theory might be correct, and this is just the mongo preallocating storage.
Which means the entire /opt/trains just disappeared

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

🤞

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Could it be it was never allocated to begin with ?

  				
Posted 
	4 years ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Now I suspect what happened is it stayed on another node, and your k8s never took care of that

that's an interesting theory

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

so if the node went down and then some other node came up, the data is lost

  				
Posted 
	4 years ago

					More  		
  Report
		
					DefiantCrab67
				
					0
					 × 1

Write your answer

1K Views

30 Answers

4 years ago

2 years ago