hercules261188's repositories
1brc
Python solutions to the 1 billion row challenge.
100-pandas-puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
llm-answer-engine
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
historical-docs-analysis
Code to go along with my "Solving Real World Data Science Problems with LLMs (Historical Doc Analysis)" video.
grok-1
Grok open release
gpt-pilot
The first real AI developer
rembg-webapp-tutorial
a simple webapp with rembg
complete-dbt-bootcamp-zero-to-hero
Supplementary Materials for the The Complete dbt (Data Build Tool) Bootcamp Udemy course
pydantic
Data parsing and validation using Python type hints
dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
rivet-examples
Collection of rivet examples to get you going! (scroll down for more information)
campusx-dsmp
Get all the resources like π Links of π Notes and π Notebooks provided in the CampusX's DSMP Course.
yt-watch-history
Analyses User's YouTube Watch History.
duckdb
DuckDB is an in-process SQL OLAP Database Management System
pypi-duck-flow
e2e data engineering project to get insights from PyPi using #python and #duckdb
Flowise
Drag & drop UI to build your customized LLM flow
marimo
A reactive notebook for Python β run reproducible experiments, execute as a script, deploy as an app, and version with git.
vscode-marimo
marimo vscode extenion
ping_smuggler
Concept script to demonstrate how to exfiltrate data inside of ping packets
demos
Code from presentations
100-Days-of-DataScience
Greetings! π I'm Loga Aswin, diving into a 100-day data science immersion from Python fundamentals to real-world applications. This space will be a live documentation of my journey, where code meets curiosity. Let's connect, learn, and code together. Click β on GitHub to stay tuned for updates on my work!
ML-Cheat-Codes
Machine Learning Cheatsheet 2024
MEDICAL-DATA-PROJECT-END2END-WITH-FEW-MLOPS
We are on a mission to transform medical data into actionable insights using the power of machine learning. Whether you are a data scientist, healthcare professional, or an enthusiast in the field, your contributions and ideas are invaluable to us. Join us in making a difference!
ruby-1-billion
One billion row challenge, Ruby edition