AlphaFold 2 Thread

No.12493564 ViewReplyOriginalReport
DeepMind still has not published any further information on the architecture of AlphaFold 2. There are hints that transformers are used, but information on the iterative attention based model is sparse. How would a pairwise residue matrix even work with a transformer? Tensor dimensions are Batch × Sequence × Sequence × Residue embedding, but transformers can only ingest Batch × Sequence length × embedding... Any ideas /sci/?