Representing neural networks as computational graphs enables learning from diverse architectures using graph neural networks and transformers.