0xVolt / DNASCANNER

An all-in-one tool that effectively performs text analysis on DNA sequences.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DNASCANNER

About this repository

This repository was created as a means of source control for the DNASCANNER project. This repository is authored by Ananya Aditi Singh, Desh Iyer and Alakto Choudhury.

Approaches over versions

Version 2 - September 2022

  • Switched to using fasta files from regular .txt files to store not just one sequence but multiple sequences along with metadata.
  • By extension, the program now works on more than one DNA sequence at once. All the inputs are stored in sample.fasta
  • The sliding window function remains the same with slight tweaks for compatibility with the .fasta file.
  • Output from main.ipynb are stored in json format in output.json.

Version 1 - August 2022

  • We started by writing a python script named sliding-window.py to illustrate the sliding window technique in Python.
  • We then scaled this up to the main.ipynb handling a much larger sequence of DNA that we generated using an online tool. This sequence is stored in the text file dna.txt and is imported as and when required.
  • In addition, the data structure that stores all of the data pertaining to the nucleotides is written to the output.json for future reference.

The docs for this project can be found on a google doc here. This daily log includes an agenda, code snippets and screenshots with an explanation of every block of code.

About

An all-in-one tool that effectively performs text analysis on DNA sequences.

License:GNU General Public License v3.0


Languages

Language:Jupyter Notebook 89.5%Language:Python 7.5%Language:HTML 2.5%Language:CSS 0.4%