gm.nn.AnchoredPolicy

gm.nn.AnchoredPolicy#

class gemma.gm.nn.AnchoredPolicy(
policy: flax.linen.module.Module,
anchor: flax.linen.module.Module | None = None,
parent: flax.linen.module.Module | flax.core.scope.Scope | flax.linen.module._Sentinel | None = <flax.linen.module._Sentinel object>,
name: str | None = None,
)[source]

Bases: flax.linen.module.Module

围绕模型计算策略和锚点输出的包装器。

此包装器接收一个输入,并通过两个模型传递它:- policy: 训练后的模型。- anchor: 冻结的模型。如果未提供,则设置为 policy 的副本。

要初始化模型,请使用 gm.ckpts.AnchoredPolicyLoader

policy: flax.linen.module.Module
anchor: flax.linen.module.Module | None = None
name: str | None = None
parent: flax.linen.module.Module | flax.core.scope.Scope | flax.linen.module._Sentinel | None = None
scope: Scope | None = None