multi_head_attentionΒΆ

Classes

MultiHeadAttention

Based on the paper, each layer has 2 subayers: