Manish Gupta's repositories
appsmith
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
argilla
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
azure-search-openai-demo
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
changedetection.io
changedetection.io - The best and simplest self-hosted free open source website change detection tracking, monitoring and notification service. An alternative to Visualping, Watchtower etc. Designed for simplicity - the main goal is to simply monitor which websites had a text change for free. Free Open source web page change detection
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
d2
D2 is a modern diagram scripting language that turns text to diagrams.
danswer
Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
datavines
Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
deeplake
Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai
desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
developer
with 100k context windows on the way, it's now feasible for every dev to have their own smol developer
Finance
150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data
instagraph
Converts text input or URL into knowledge graph and displays
lakehouse-engine
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
magika
Detect file content types with deep learning
meerkat
Interactive data structures for evaluating foundation models.
nussknacker
A visual tool to define and run real-time decision algorithms. Brings agility to business teams, liberates developers to focus on technology.
PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
rags
Build ChatGPT over your data, all with natural language
remorph
Cross-compiler into Databricks Lakehouse
spacy-llm
🦙 Integrating LLMs into structured NLP pipelines
spookystuff
Scalable query engine for web scrapping/data mashup/acceptance QA, powered by Apache Spark
supervision
We write your reusable computer vision tools. 💜
uptime-kuma
A fancy self-hosted monitoring tool
visualblocks
Visual Blocks for ML is a Google visual programming framework that lets you create ML pipelines in a no-code graph editor. You – and your users – can quickly prototype workflows by connecting drag-and-drop ML components, including models, user inputs, processors, and visualizations.
VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer