pantheonrl.algos.adap.util.get_context_kl_loss

get_context_kl_loss(policy, model, train_batch)[source]

Gets the KL loss for ADAP

Parameters:
  • policy (ADAP) –

  • model (AdapPolicy) –

  • train_batch (RolloutBufferSamples) –