pantheonrl.algos.adap.util.get_context_kl_loss
- get_context_kl_loss(policy, model, train_batch)[source]
Gets the KL loss for ADAP
- Parameters:
policy (ADAP) –
model (AdapPolicy) –
train_batch (RolloutBufferSamples) –
Gets the KL loss for ADAP
policy (ADAP) –
model (AdapPolicy) –
train_batch (RolloutBufferSamples) –