Mark Douthwaite (markdouthwaite)

markdouthwaite

Geek Repo

Company:Peak AI

Location:Manchester, UK

Home Page:mark.douthwaite.io

Twitter:@markldouthwaite

Github PK Tool:Github PK Tool


Organizations
PeakBI

Mark Douthwaite's starred repositories

system-design

Learn how to design systems at scale and prepare for system design interviews

Data-Science-For-Beginners

10 Weeks, 20 Lessons, Data Science for All!

Language:Jupyter NotebookLicense:MITStargazers:26998Issues:496Issues:115

qdrant

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Language:RustLicense:Apache-2.0Stargazers:18790Issues:118Issues:1129

data-science

:bar_chart: Path to a free self-taught education in Data Science!

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:14524Issues:128Issues:3337

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14346Issues:265Issues:203

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

danswer

Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.

Language:PythonLicense:NOASSERTIONStargazers:9771Issues:95Issues:404

promptflow

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Language:PythonLicense:MITStargazers:8700Issues:98Issues:461

openplayground

An LLM playground you can run on your laptop

Language:TypeScriptLicense:MITStargazers:6153Issues:61Issues:92

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4438Issues:76Issues:87

llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Language:PythonLicense:Apache-2.0Stargazers:4221Issues:40Issues:116

llm-app

Dynamic RAG for enterprise. Ready to run with Docker,⚡in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

Data-Engineering-HowTo

A list of useful resources to learn Data Engineering from scratch

pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Language:PythonLicense:NOASSERTIONStargazers:2843Issues:22Issues:58

obsidian-smart-connections

Chat with your notes & see links to related content with AI embeddings. Use local models or 100+ via APIs like Claude, Gemini, ChatGPT & Llama 3

Language:JavaScriptLicense:GPL-3.0Stargazers:2188Issues:29Issues:442

griptape

Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.

Language:PythonLicense:Apache-2.0Stargazers:1775Issues:27Issues:285

sqlite-vss

A SQLite extension for efficient vector search, based on Faiss!

Language:C++License:MITStargazers:1592Issues:22Issues:99

awesome-fastapi-projects

List of FastAPI projects! :sunglasses: :rocket:

fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Language:PythonLicense:Apache-2.0Stargazers:1021Issues:9Issues:102

business-rules

Python DSL for setting up business intelligence rules that can be configured without code

Language:PythonLicense:MITStargazers:881Issues:87Issues:30

pgvector-python

pgvector support for Python

Language:PythonLicense:MITStargazers:781Issues:12Issues:57

objaverse-xl

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Language:PythonLicense:Apache-2.0Stargazers:630Issues:8Issues:42

obsidian-copilot

🤖 A prototype assistant for writing and thinking

Language:PythonLicense:Apache-2.0Stargazers:464Issues:9Issues:8

Awesome-VQVAE

📚 A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application

tinyllm

Develop, evaluate and monitor LLM applications at scale

Language:PythonLicense:MITStargazers:86Issues:3Issues:1

system-design-101

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

License:NOASSERTIONStargazers:71Issues:0Issues:0

UniRec

UniRec is an easy-to-use, lightweight, and scalable implementation of recommender systems. Its primary objective is to enable users to swiftly construct a comprehensive ecosystem of recommenders using a minimal set of robust and practical recommendation models.

Language:PythonLicense:MITStargazers:32Issues:5Issues:3

inai

An experiment in structuring a NodeJS application *internally* using REST principles.

Language:JavaScriptLicense:Apache-2.0Stargazers:3Issues:2Issues:0