You will need to change burn_in
, max_batches
and steps
between the two cases, for example, if your final target is 500200, your first .cfg
file should have this:
burn_in=100
max_batches = 50000
policy=steps
steps=40000,45000
and the second file like this:
burn_in=1000
max_batches = 500200
policy=steps
steps=400000,450000
You need only to change learning_rate
if you get a Nan according to this, then you should divide learning_rate
by the number of GPUs and multiply burn_in
by the same number.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…