learn = synth_learner()
learn.fit(10, lr=100, cbs=TerminateOnNaNCallback())

assert len(learn.recorder.losses) < 10 * len(learn.dls.train)
for l in learn.recorder.losses:
    assert not torch.isinf(l) and not torch.isnan(l)

When implementing a Callback that has behavior that depends on the best value of a metric or loss, subclass this Callback and use its best (for best value so far) and new_best (there was a new best value this epoch) attributes.

comp is the comparison operator used to determine if a value is best than another (defaults to np.less if 'loss' is in the name passed in monitor, np.greater otherwise) and min_delta is an optional float that requires a new value to go over the current best (depending on comp) by at least that amount.

comp is the comparison operator used to determine if a value is best than another (defaults to np.less if 'loss' is in the name passed in monitor, np.greater otherwise) and min_delta is an optional float that requires a new value to go over the current best (depending on comp) by at least that amount. patience is the number of epochs you're willing to wait without improvement.

learn = synth_learner(n_trn=2, metrics=F.mse_loss)
learn.fit(n_epoch=200, lr=1e-7, cbs=EarlyStoppingCallback(monitor='mse_loss', min_delta=0.1, patience=2))

No improvement since epoch 0: early stopping

learn.validate()

(#2) [24.20275115966797,24.20275115966797]

learn = synth_learner(n_trn=2)
learn.fit(n_epoch=200, lr=1e-7, cbs=EarlyStoppingCallback(monitor='valid_loss', min_delta=0.1, patience=2))

No improvement since epoch 0: early stopping

comp is the comparison operator used to determine if a value is best than another (defaults to np.less if 'loss' is in the name passed in monitor, np.greater otherwise) and min_delta is an optional float that requires a new value to go over the current best (depending on comp) by at least that amount. Model will be saved in learn.path/learn.model_dir/name.pth, maybe every_epoch or at each improvement of the monitored quantity.

learn = synth_learner(n_trn=2, path=Path.cwd()/'tmp')
learn.fit(n_epoch=2, cbs=SaveModelCallback())
assert (Path.cwd()/'tmp/models/model.pth').exists()
learn.fit(n_epoch=2, cbs=SaveModelCallback(every_epoch=True))
for i in range(2): assert (Path.cwd()/f'tmp/models/model_{i}.pth').exists()
shutil.rmtree(Path.cwd()/'tmp')

Better model found at epoch 0 with valid_loss value: 19.01547622680664.
Better model found at epoch 1 with valid_loss value: 18.5483341217041.

ReduceLROnPlateau

learn = synth_learner(n_trn=2)
learn.fit(n_epoch=4, lr=1e-7, cbs=ReduceLROnPlateau(monitor='valid_loss', min_delta=0.1, patience=2))

Epoch 2: reducing lr to 1e-08

learn = synth_learner(n_trn=2)
learn.fit(n_epoch=6, lr=5e-8, cbs=ReduceLROnPlateau(monitor='valid_loss', min_delta=0.1, patience=2, min_lr=1e-8))

Epoch 2: reducing lr to 1e-08

epoch	train_loss	valid_loss	mse_loss	time
0	19.993200	24.202908	24.202908	00:00
1	20.007574	24.202845	24.202845	00:00
2	20.021687	24.202751	24.202751	00:00

epoch	train_loss	valid_loss	time
0	12.963860	10.800257	00:00
1	12.936502	10.800226	00:00
2	12.926699	10.800186	00:00

epoch	train_loss	valid_loss	time
0	22.050220	19.015476	00:00
1	21.870991	18.548334	00:00

epoch	train_loss	valid_loss	time
0	21.132572	17.913414	00:00
1	20.745022	17.148394	00:00

epoch	train_loss	valid_loss	time
0	19.359529	19.566990	00:00
1	19.375505	19.566935	00:00
2	19.362509	19.566856	00:00
3	19.386513	19.566845	00:00

Tracking callbacks

`class` `TerminateOnNaNCallback`[source]

`class` `TrackerCallback`[source]

`class` `EarlyStoppingCallback`[source]

`class` `SaveModelCallback`[source]

ReduceLROnPlateau

`class` `ReduceLROnPlateau`[source]

epoch	train_loss	valid_loss	time
0	11.854585	8.217508	00:00
1	11.875463	8.217495	00:00
2	11.885604	8.217478	00:00
3	11.876034	8.217473	00:00
4	11.872295	8.217467	00:00
5	11.874965	8.217461	00:00

Tracking callbacks

class TerminateOnNaNCallback[source]

class TrackerCallback[source]

class EarlyStoppingCallback[source]

class SaveModelCallback[source]

ReduceLROnPlateau

class ReduceLROnPlateau[source]

`class` `TerminateOnNaNCallback`[source]

`class` `TrackerCallback`[source]

`class` `EarlyStoppingCallback`[source]

`class` `SaveModelCallback`[source]

`class` `ReduceLROnPlateau`[source]