PhSteel's starred repositories

crawl4ai

πŸ”₯πŸ•·οΈ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper

Language:PythonLicense:Apache-2.0Stargazers:10591Issues:0Issues:0

docetl

A system for agentic LLM-powered data processing

Language:PythonLicense:MITStargazers:721Issues:0Issues:0

Prompt-Engineering-Guide

πŸ™ Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXLicense:MITStargazers:48394Issues:0Issues:0

pyglove

Manipulating Python Programs

Language:PythonLicense:Apache-2.0Stargazers:528Issues:0Issues:0

hessian-spectrum

Code for the paper: Why Transformers Need Adam: A Hessian Perspective

Language:Jupyter NotebookStargazers:34Issues:0Issues:0

awesome-pipeline

A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

Stargazers:6144Issues:0Issues:0

awesome-pentest

A collection of awesome penetration testing resources, tools and other shiny things

Stargazers:21494Issues:0Issues:0

awesome-LaTeX

Curated list of LaTeX awesomeness

License:NOASSERTIONStargazers:1388Issues:0Issues:0

streamable

Stream-like manipulation of iterables.

Language:PythonLicense:Apache-2.0Stargazers:71Issues:0Issues:0

skrub

Prepping tables for machine learning

Language:PythonLicense:BSD-3-ClauseStargazers:1166Issues:0Issues:0

SurfSense

Personal AI Assistant for World Wide Web Surfers. Research & Never forget anything you see on the Internet

Language:PythonLicense:Apache-2.0Stargazers:350Issues:0Issues:0

MemoRAG

Empowering RAG with a memory-based data interface for all-purpose applications!

Language:PythonLicense:Apache-2.0Stargazers:958Issues:0Issues:0

agentic-customer-service-medical-clinic

This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clinic

Language:PythonStargazers:156Issues:0Issues:0

coroot

Coroot is an open-source APM & Observability tool, a DataDog and NewRelic alternative πŸ“Š, πŸ–₯️, πŸ‘‰. Powered by eBPF for rapid insights into system performance. Monitor, analyze, and optimize your infrastructure effortlessly for peak reliability at any scale.

Language:GoLicense:Apache-2.0Stargazers:5231Issues:0Issues:0

ell

A language model programming library.

Language:PythonLicense:MITStargazers:4208Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1026Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:11092Issues:0Issues:0

OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

Language:PythonLicense:Apache-2.0Stargazers:9819Issues:0Issues:0

portmaster

πŸ” Love Freedom - ❌ Block Mass Surveillance

Language:GoLicense:GPL-3.0Stargazers:9223Issues:0Issues:0

MLE-agent

πŸ€– MLE-Agent: Your intelligent companion for seamless AI engineering and research. πŸ” Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc supported. :fireworks: Code RAG

Language:PythonLicense:MITStargazers:1029Issues:0Issues:0

uncertain_ground_truth

Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23 and ArXiv pre-print).

Language:PythonLicense:Apache-2.0Stargazers:535Issues:0Issues:0

claude-engineer

Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.

Language:PythonStargazers:9174Issues:0Issues:0

unravelsports

The unravelsports package aims to aid researchers, analysts and enthusiasts by providing intermediary steps in the complex process of turning raw sports data into meaningful information and actionable insights.

Language:PythonLicense:MPL-2.0Stargazers:36Issues:0Issues:0

firecrawl

πŸ”₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

Language:TypeScriptLicense:AGPL-3.0Stargazers:15286Issues:0Issues:0

phi-mamba

Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)

Language:PythonStargazers:69Issues:0Issues:0

self-hosted-ai-starter-kit

The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.

License:Apache-2.0Stargazers:2776Issues:0Issues:0

cola

Compositional Linear Algebra

Language:PythonLicense:Apache-2.0Stargazers:401Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:13223Issues:0Issues:0

TabNine

AI Code Completions

Language:ShellLicense:MITStargazers:10576Issues:0Issues:0

Awesome-Code-LLM

[TMLR] A curated list of language modeling researches for code and related datasets.

Stargazers:1443Issues:0Issues:0