Nick Doiron

monsoon-nlp

AI & ML interests

multilingual and language tutor models

Articles

Organizations

Posts 1

view post
Post
2798
I'm working on Matryoshka embeddings for proteins 🦠🧬 - while that's cooking, here are cosine-distances of selected pairs from UniProt's 1024-dim embeddings, within train/test/validation splits monsoon-nlp/protein-pairs-uniprot-swissprot