credit.attend#
Attributes#
Classes#
Base class for all neural network modules. |
Functions#
Module Contents#
- credit.attend.exists(val)#
- credit.attend.default(val, d)#
- credit.attend.once(fn)#
- credit.attend.print_once#
- class credit.attend.Attend(dropout=0.0, flash=False, scale=None)#
Bases:
torch.nn.ModuleBase class for all neural network modules.
Your models should also subclass this class.
Modules can also contain other Modules, allowing them to be nested in a tree structure. You can assign the submodules as regular attributes:
import torch.nn as nn import torch.nn.functional as F class Model(nn.Module): def __init__(self) -> None: super().__init__() self.conv1 = nn.Conv2d(1, 20, 5) self.conv2 = nn.Conv2d(20, 20, 5) def forward(self, x): x = F.relu(self.conv1(x)) return F.relu(self.conv2(x))
Submodules assigned in this way will be registered, and will also have their parameters converted when you call
to(), etc.Note
As per the example above, an
__init__()call to the parent class must be made before assignment on the child.- Variables:
training (bool) – Boolean represents whether this module is in training or evaluation mode.
- dropout = 0.0#
- scale = None#
- attn_dropout#
- flash = False#
- cpu_config#
- cuda_config = None#
- flash_attn(q, k, v)#
- forward(q, k, v)#
einstein notation b - batch h - heads n, i, j - sequence length (base sequence length, source, target) d - feature dimension