Trait2Vec: Ontology-aware embeddings for organismal trait descriptions
Juan J Garcia · James Balhoff · Hilmar Lapp
Abstract
Trait descriptions characterize how an organism looks, behaves or interacts. These descriptions are typically represented as text, but may be manually mapped within an ontology for downstream analysis. Nonetheless, the cost of this manual mapping is not scalable. In this work we propose a method to finetune a transformer model and embed textual trait descriptions in a latent space that captures the notion of distance within an ontology. The resulting model, which we coin Trait2Vec, can then embed trait descriptions in a scalable and biologically meaningful computational representation.
Chat is not available.
Successful Page Load