Enveda Biosciences's repositories
kgem-ensembles-in-drug-discovery
Source code and data repository for "Ensembles of knowledge graph embedding models improve predictions for drug discovery"
ccs-prediction
Evaluating the generalizability of graph neural networks for predicting collision cross section
np-clinical-trials
Source code and data for "Natural products have increased rates of clinical trial success throughout the drug development process"
sinusoidal-embedding
code to generate sinusoidal embeddings
weighting-spectral-similarity
Scripts and notebooks from Weighting low-intensity MS/MS ions and m/z frequency for spectral library annotation
ethnobotany
Source code and data for "Modern drug discovery using ethnobotany: A large-scale cross-cultural analysis of traditional medicine reveals common therapeutic uses"
misosoup-preview
Farm-to-Table Mass-Spec Data Processing
plant-chemical-space
Source code and data for "Exploring the known chemical space of the plant kingdom: Insights into taxonomic patterns, knowledge gaps, and bioactive regions"
transcriptomic-target-correlation
On the correspondence between the transcriptomic response of a compound and its targets
biomedical-nlp-datasets
Tools for curating biomedical training data for large-scale language modeling
commonMZ
A collection of common mz values found in mass spectrometry.
GenBioEL
Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning[NAACL 2022]
libPeak
Methods of peak detection for analytical instruments
nbdev-demo
Demonstration of nbdev-based development
hgraph2graph
Hierarchical Generation of Molecular Graphs using Structural Motifs
sapbert
[NAACL & ACL 2021] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.
sparc-multiomics
SPARC CCF Multi-omics analysis