Lee Bergstrand (LeeBergstrand)

LeeBergstrand

Geek Repo

Company:@MBLS Inc.

Github PK Tool:Github PK Tool

Lee Bergstrand's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49459Issues:562Issues:209

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:23582Issues:231Issues:134

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:18333Issues:203Issues:386

FreeAskInternet

FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.

Language:PythonLicense:Apache-2.0Stargazers:8464Issues:54Issues:79

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7898Issues:77Issues:161
Language:Jupyter NotebookLicense:MITStargazers:2233Issues:32Issues:10

seqkit

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

hail

Cloud-native genomic dataframes and batch computing

Language:PythonLicense:MITStargazers:973Issues:56Issues:2413

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:663Issues:12Issues:30

bakta

Rapid & standardized annotation of bacterial genomes, MAGs & plasmids

Language:PythonLicense:GPL-3.0Stargazers:432Issues:13Issues:231

Porechop

adapter trimmer for Oxford Nanopore reads

Language:C++License:GPL-3.0Stargazers:334Issues:19Issues:88

rgi

Resistance Gene Identifier (RGI). Software to predict resistomes from protein or nucleotide data, including metagenomics data, based on homology and SNP models.

Language:PythonLicense:NOASSERTIONStargazers:326Issues:19Issues:244

InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Language:PythonLicense:MITStargazers:281Issues:16Issues:46

NextPolish

Fast and accurately polish the genome generated by long reads.

sunbeam

A robust, extensible metagenomics pipeline

sylph

ultrafast taxonomic profiling and genome querying for metagenomic samples by abundance-corrected minhash.

Language:RustLicense:MITStargazers:136Issues:3Issues:14

singlem

Novelty-inclusive microbial community profiling of shotgun metagenomes

Language:PythonLicense:GPL-3.0Stargazers:127Issues:9Issues:132

mob-suite

MOB-suite: Software tools for clustering, reconstruction and typing of plasmids from draft assemblies

Language:PythonLicense:Apache-2.0Stargazers:118Issues:12Issues:143

abPOA

abPOA: an SIMD-based C library for fast partial order alignment using adaptive band

Language:CLicense:MITStargazers:118Issues:8Issues:65

GraffiTE

GraffiTE is a pipeline that finds polymorphic transposable elements in genome assemblies and/or long reads, and genotypes the discovered polymorphisms in read sets using genome-graphs.

Language:RLicense:NOASSERTIONStargazers:107Issues:4Issues:34

hybracter

Automated long-read first bacterial genome assembly tool implemented in Snakemake using Snaketool.

Language:PythonLicense:MITStargazers:95Issues:2Issues:36

pyMSAviz

MSA(Multiple Sequence Alignment) visualization python package for sequence analysis

Language:PythonLicense:MITStargazers:80Issues:4Issues:10

woltka

Woltka: a versatile meta'omic data classifier

Language:PythonLicense:BSD-3-ClauseStargazers:68Issues:9Issues:77

gtdb_to_taxdump

Convert GTDB taxonomy to NCBI taxdump format

Language:PythonLicense:MITStargazers:65Issues:5Issues:18

CompareM2

🦠📇 Microbial genomes-to-report pipeline

Language:PythonLicense:GPL-3.0Stargazers:52Issues:3Issues:96

noveltree

NovelTree is a highly parallelized and computationally efficient phylogenomic workflow that infers gene families, gene family trees, species trees, and gene family evolutionary history.

Language:NextflowLicense:AGPL-3.0Stargazers:17Issues:7Issues:39

psm3mkv

psm3mkv: A package to evaluate the fit and efficiency of three state oncology cost-effectiveness model structures

Language:RLicense:GPL-3.0Stargazers:9Issues:6Issues:9

rotary

Assembly/annotation workflow for Nanopore-based microbial genome data containing circular DNA elements

Language:PythonLicense:BSD-3-ClauseStargazers:2Issues:2Issues:80

seguid-python

SEGUID v2: Checksums for Linear, Circular, Single- and Double-Stranded Biological Sequences

Language:PythonLicense:MITStargazers:2Issues:4Issues:29