Anjan Nepal's starred repositories

hmm

Large-scale unsupervised Hidden Markov Model (HMM) implementation supporting online-learning and multi-threading

Language:JavaStargazers:8Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12012Issues:0Issues:0

long-range-arena

Long Range Arena for Benchmarking Efficient Transformers

Language:PythonLicense:Apache-2.0Stargazers:707Issues:0Issues:0

netron

Visualizer for neural network, deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:27042Issues:0Issues:0

tree-of-thoughts

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

Language:PythonLicense:Apache-2.0Stargazers:4192Issues:0Issues:0

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

License:Apache-2.0Stargazers:7313Issues:0Issues:0

shuttle

Build & ship backends without writing any infrastructure files.

Language:RustLicense:Apache-2.0Stargazers:5832Issues:0Issues:0

prophet

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Language:PythonLicense:MITStargazers:18131Issues:0Issues:0

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10809Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29221Issues:0Issues:0

h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

Language:PythonLicense:Apache-2.0Stargazers:11087Issues:0Issues:0

splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Language:PythonLicense:NOASSERTIONStargazers:719Issues:0Issues:0

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it. Not actively maintained.

Language:PythonLicense:MITStargazers:51545Issues:0Issues:0

mojo

The Mojo Programming Language

Language:MojoLicense:NOASSERTIONStargazers:22480Issues:0Issues:0

tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Language:PythonLicense:MITStargazers:11374Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35335Issues:0Issues:0

capnproto

Cap'n Proto serialization/RPC system - core tools and C++ library

Language:C++License:NOASSERTIONStargazers:11449Issues:0Issues:0

are-16-heads-really-better-than-1

Code for the paper "Are Sixteen Heads Really Better than One?"

Language:ShellLicense:MITStargazers:163Issues:0Issues:0

bort

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

Language:PythonLicense:Apache-2.0Stargazers:468Issues:0Issues:0

batnoter

An open source, markdown-based, self-hosted note taking webapp.

Language:TypeScriptLicense:MITStargazers:2331Issues:0Issues:0

coding-interview-university

A complete computer science study plan to become a software engineer.

License:CC-BY-SA-4.0Stargazers:301187Issues:0Issues:0

xxHash

Extremely fast non-cryptographic hash algorithm

Language:CLicense:NOASSERTIONStargazers:8812Issues:0Issues:0

snorkel

A system for quickly generating training data with weak supervision

Language:PythonLicense:Apache-2.0Stargazers:5769Issues:0Issues:0

resilience4j

Resilience4j is a fault tolerance library designed for Java8 and functional programming

Language:JavaLicense:Apache-2.0Stargazers:9599Issues:0Issues:0

Grokking-System-Design

Systems design is the process of defining the architecture, modules, interfaces, and data for a system to satisfy specified requirements. Systems design could be seen as the application of systems theory to product development.

Language:ShellLicense:GPL-3.0Stargazers:4866Issues:0Issues:0

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:265870Issues:0Issues:0

greedy-layer-pruning

Greedy layer pruning for transformer models.

Language:PythonStargazers:7Issues:0Issues:0

transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Language:PythonLicense:Apache-2.0Stargazers:1638Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:13211Issues:0Issues:0

tutorials

Tutorials for creating and using ONNX models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3306Issues:0Issues:0