The MAMBA design transformer with a language modeling head on prime (linear layer with weights tied to the input
since we have a formulation of the discrete representation, Allow’s take a look at how we can in fact https://k2spiceshop.com/product/liquid-k2-on-paper-online/