Sebastian Husch Lee (sjrl)

sjrl

Geek Repo

Company:@deepset-ai

Location:Munich

Twitter:@sebjrlee

Github PK Tool:Github PK Tool

Sebastian Husch Lee's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129765Issues:1120Issues:15316

Web-Dev-For-Beginners

24 Lessons, 12 Weeks, Get Started as a Web Developer

Language:JavaScriptLicense:MITStargazers:82331Issues:2701Issues:287

act

Run your GitHub Actions locally 🚀

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35134Issues:355Issues:305

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29880Issues:424Issues:4173

chatbot-ui

AI chat for every model.

Language:TypeScriptLicense:MITStargazers:27601Issues:246Issues:940

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:19243Issues:297Issues:1339

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15196Issues:105Issues:975

dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Language:PythonLicense:Apache-2.0Stargazers:10809Issues:136Issues:162

open-llms

📋 A list of open LLMs available for commercial use.

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9090Issues:109Issues:81

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5756Issues:48Issues:968

superduperdb

🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

Language:PythonLicense:Apache-2.0Stargazers:4548Issues:42Issues:1156

huggingface_hub

The official Python client for the Huggingface Hub.

Language:PythonLicense:Apache-2.0Stargazers:1867Issues:59Issues:863

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

Language:PythonLicense:Apache-2.0Stargazers:1797Issues:36Issues:1046

brave-ios

Brave iOS Browser

Language:SwiftLicense:MPL-2.0Stargazers:1689Issues:67Issues:4891
Language:PythonLicense:Apache-2.0Stargazers:1236Issues:22Issues:48

llmsherpa

Developer APIs to Accelerate LLM Projects

Language:Jupyter NotebookLicense:MITStargazers:1205Issues:11Issues:64

notion-to-md

Convert notion pages, block and list of blocks to markdown (supports nesting and custom parsing)

Language:TypeScriptLicense:MITStargazers:1058Issues:7Issues:70

large-qa-datasets

A collection of large question answering datasets

haystack-tutorials

Here you can find all the Tutorials for Haystack 📓

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:233Issues:15Issues:89

mMARCO

A multilingual version of MS MARCO passage ranking dataset

Language:PythonLicense:Apache-2.0Stargazers:139Issues:6Issues:16

awesome-legal-data

Collection of Datasets for Legal Text Processing

should-i-follow

🦄 An NLP application just for the lols: built with Haystack to get an overview of what a user is posting about on Twitter

Language:JavaScriptLicense:MITStargazers:26Issues:2Issues:0

deepset-cloud-sdk

A Python SDK to interact with deepset Cloud

Language:PythonLicense:Apache-2.0Stargazers:8Issues:11Issues:34

templates

Usable templates for your work.