gm-is's starred repositories

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1376Issues:0Issues:0

embetter

just a bunch of useful embeddings

Language:PythonLicense:MITStargazers:447Issues:0Issues:0

plotly.py

The interactive graphing library for Python :sparkles: This project now includes Plotly Express!

Language:PythonLicense:MITStargazers:15793Issues:0Issues:0

voila

Voilà turns Jupyter notebooks into standalone web applications

Language:PythonLicense:NOASSERTIONStargazers:5337Issues:0Issues:0

1brc

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

Language:JavaLicense:Apache-2.0Stargazers:5843Issues:0Issues:0

cinemagoer

Cinemagoer is a Python package useful to retrieve and manage the data of the IMDb (to which we are not affiliated in any way) movie database about movies, people, characters and companies

Language:PythonLicense:GPL-2.0Stargazers:1211Issues:0Issues:0

mteb

MTEB: Massive Text Embedding Benchmark

Language:PythonLicense:Apache-2.0Stargazers:1685Issues:0Issues:0

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1123Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22378Issues:0Issues:0

system-design

A resource to help you pass system design interview and become good at work

License:NOASSERTIONStargazers:10687Issues:0Issues:0

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:82171Issues:0Issues:0

doubleml-for-py

DoubleML - Double Machine Learning in Python

Language:PythonLicense:BSD-3-ClauseStargazers:455Issues:0Issues:0

what_are_embeddings

A deep dive into embeddings starting from fundamentals

Language:Jupyter NotebookStargazers:903Issues:0Issues:0

optimum-quanto

A pytorch quantization backend for optimum

Language:PythonLicense:Apache-2.0Stargazers:681Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:12148Issues:0Issues:0

alacritty

A cross-platform, OpenGL terminal emulator.

Language:RustLicense:Apache-2.0Stargazers:54933Issues:0Issues:0

great-tables

Make awesome display tables using Python.

Language:PythonLicense:MITStargazers:1586Issues:0Issues:0

DiD

Keeping track of what is going on with the latest DiD innovations.

Language:HTMLLicense:MITStargazers:426Issues:0Issues:0

polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Language:RustLicense:NOASSERTIONStargazers:28292Issues:0Issues:0

chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Language:PythonLicense:Apache-2.0Stargazers:139Issues:0Issues:0

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:14958Issues:0Issues:0

renumics-rag

Visualization for a Retrieval-Augmented Generation (RAG) Assistant 🤖❤️📚

Language:PythonLicense:MITStargazers:147Issues:0Issues:0

captions

transcripts and captions for 3blue1brown videos

Language:TypeScriptStargazers:233Issues:0Issues:0

colima

Container runtimes on macOS (and Linux) with minimal setup

Language:GoLicense:MITStargazers:18024Issues:0Issues:0

NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

Language:PythonLicense:Apache-2.0Stargazers:809Issues:0Issues:0

LLM-Finetuning

LLM Finetuning with peft

Language:Jupyter NotebookStargazers:1933Issues:0Issues:0

generator9000

Web App for generating synthetic data

Language:TypeScriptLicense:BSD-3-ClauseStargazers:44Issues:0Issues:0

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

License:MITStargazers:6479Issues:0Issues:0

fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Language:PythonLicense:Apache-2.0Stargazers:1211Issues:0Issues:0

Analytic-continual-learning

This repository will be posting analytic continual learning series, including Analytic Class-Incremental Learning (ACIL), Gaussian Kernel Embedded Analytic Learning (GKEAL), Dual-Stream Analytic Learning (DS-AL), etc.

Language:PythonLicense:MITStargazers:163Issues:0Issues:0