anindabitm / InstaDeep-Enzyme-Classification-Challenge

Code for Zindi Competition of the same name

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

InstaDeep-Enzyme-Classification-Challenge

Code for Zindi Competition of the same name

For this challenge Zindi is asking participants to do the following. Given a library of labelled sequences from some well-known organisms, your task is to create a model that can label sequences from new organisms. Each sequence could represent any kind of enzyme - not just kinases.

All enzymes are made of one or more chains of amino acids, which determine their structure, behaviour, and interactions with other enzymes and molecules. That means it should be possible to predict the protein’s function and behaviour given just the amino acid sequence.

A model able to perform this task would have many applications. In addition to enzymes from known organisms (which we have from studying their proteomes), there are vast numbers of metagenomic sequences - this is proteomic sequence data from environmental samples. Being able to quickly annotate them with function using this model (i.e. going beyond simple sequence similarity) would be indispensable. Models developed in the course of this challenge may contribute to furthering the understanding of the world around us.

About

Code for Zindi Competition of the same name


Languages

Language:Jupyter Notebook 100.0%