Hi, I'M Using Huggingface Trainer, Is There A Way To Capture Grad_Norm Per Layer? Thanks!

Answered

hi, I'm using huggingface trainer, is there a way to capture grad_norm per layer? Thanks!

  				
Posted 
	23 days ago

					More  		
  Report
		
					PanickyBee11
				
					0
					 × 1

Votes Newest

Answers 5

. I wonder if I can extend this to reporting grad_norm per layer.

oh that makes sense, technically I assume so, is this a HF logger option? notice ClearML is already integrated with HF on the HF side, do they report that when TB logger is used?

  				
Posted 
	22 days ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Yes inherit either:
None
which inherits
None

  				
Posted 
	21 days ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

I mean that HF trainer by default reports to clearml a single grad_norm scalar for the whole model. I wonder if I can extend this to reporting grad_norm per layer.

  				
Posted 
	22 days ago

					More  		
  Report
		
					PanickyBee11
				
					0
					 × 1

I guess they don’t, is there an easy way to add to the HF trainer some callbacks for reporting extra info?

  				
Posted 
	22 days ago

					More  		
  Report
		
					PanickyBee11
				
					0
					 × 1

Hi PanickyBee11
You mean this is not automatically logged? do you have a callback that logs it in HF?

  				
Posted 
	22 days ago

					More  		
  Report
		
					AgitatedDove14
				
					0
					 × 1

Write your answer

120 Views

5 Answers

23 days ago

21 days ago