Ananth Packkildurai's starred repositories

Web-Dev-For-Beginners

24 Lessons, 12 Weeks, Get Started as a Web Developer

Language:JavaScriptLicense:MITStargazers:82620Issues:2705Issues:288

ML-For-Beginners

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:58246Issues:499Issues:105

awesome-scalability

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

License:MITStargazers:57620Issues:1868Issues:0

semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

dub

Open-source link management infrastructure. Loved by modern marketing teams like Vercel, Raycast, and Perplexity.

Language:TypeScriptLicense:AGPL-3.0Stargazers:17277Issues:72Issues:433

chroma

the AI-native open-source embedding database

Language:RustLicense:Apache-2.0Stargazers:14119Issues:84Issues:1103

static-analysis

⚙️ A curated list of static analysis (SAST) tools and linters for all programming languages, config files, build tools, and more. The focus is on tools which improve code quality.

Language:RustLicense:MITStargazers:13127Issues:320Issues:575

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:10077Issues:65Issues:105

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8879Issues:82Issues:36

keda

KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes

Language:GoLicense:Apache-2.0Stargazers:8203Issues:93Issues:2221

azure-search-openai-demo

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

Language:PythonLicense:MITStargazers:5763Issues:233Issues:1010

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonLicense:MITStargazers:5110Issues:63Issues:190

flink-cdc-connectors

CDC Connectors for Apache Flink®

Language:JavaLicense:Apache-2.0Stargazers:5002Issues:116Issues:1631

hands-on-llms

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

Language:Jupyter NotebookLicense:MITStargazers:2835Issues:46Issues:16

substrait

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Language:PythonLicense:Apache-2.0Stargazers:1132Issues:42Issues:156

apicurio-registry

An API/Schema registry - stores APIs and Schemas.

Language:JavaLicense:Apache-2.0Stargazers:572Issues:19Issues:1037

onetable

OneTable is an omni-directional converter for table formats that facilitates interoperability across data processing systems and query engines.

Language:JavaLicense:Apache-2.0Stargazers:562Issues:18Issues:144

LLM_AppDev-HandsOn

Repository and hands-on workshop on how to develop applications with local LLMs

Language:Jupyter NotebookStargazers:386Issues:8Issues:2

hnswlib

Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs

Language:JavaLicense:Apache-2.0Stargazers:248Issues:16Issues:52

schemata

Schema modelling framework for decentralised domain-driven ownership of data.

Language:JavaLicense:Apache-2.0Stargazers:243Issues:8Issues:12

lintrule

Let the LLM review your code.

presidio-research

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

Language:PythonLicense:MITStargazers:162Issues:14Issues:36

datagen

Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.

Language:TypeScriptLicense:Apache-2.0Stargazers:141Issues:7Issues:45

dbt-airflow-factory

Library to convert DBT manifest metadata to Airflow tasks

Language:PythonLicense:Apache-2.0Stargazers:45Issues:8Issues:8

syft

Analytics event modeling framework in Typescript

Language:TypeScriptLicense:Apache-2.0Stargazers:45Issues:2Issues:1

cel2sql

CEL to SQL condition

Language:GoLicense:Apache-2.0Stargazers:32Issues:2Issues:0

quarkus-multi-module-project-quickstart

Modularized Quarkus 3.8 quickstart template project

Language:HTMLLicense:NOASSERTIONStargazers:21Issues:3Issues:6

java-berkleydb-queue

Lightweight fast persistent queue in Java using Berkley DB

Language:JavaLicense:UnlicenseStargazers:12Issues:17Issues:1