credit.attend

credit.attend#

Attributes#

Classes#

`AttentionConfig`
`Attend`	Base class for all neural network modules.

Functions#

`exists`(val)
`default`(val, d)
`once`(fn)

Module Contents#

class credit.attend.AttentionConfig#

Bases: tuple

enable_flash#

enable_math#

enable_mem_efficient#

credit.attend.exists(val)#

credit.attend.default(val, d)#

credit.attend.once(fn)#

credit.attend.print_once#

class credit.attend.Attend(dropout=0.0, flash=False, scale=None)#

Bases: torch.nn.Module

Base class for all neural network modules.

Your models should also subclass this class.

Modules can also contain other Modules, allowing them to be nested in a tree structure. You can assign the submodules as regular attributes:

import torch.nn as nn
import torch.nn.functional as F


class Model(nn.Module):
    def __init__(self) -> None:
        super().__init__()
        self.conv1 = nn.Conv2d(1, 20, 5)
        self.conv2 = nn.Conv2d(20, 20, 5)

    def forward(self, x):
        x = F.relu(self.conv1(x))
        return F.relu(self.conv2(x))

Submodules assigned in this way will be registered, and will also have their parameters converted when you call to(), etc.

Note

As per the example above, an __init__() call to the parent class must be made before assignment on the child.

Variables:: training (bool) – Boolean represents whether this module is in training or evaluation mode.

dropout = 0.0#

scale = None#

attn_dropout#

flash = False#

cpu_config#

cuda_config = None#

flash_attn(q, k, v)#

forward(q, k, v)#: einstein notation b - batch h - heads n, i, j - sequence length (base sequence length, source, target) d - feature dimension