kangkot's starred repositories

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34586Issues:309Issues:877

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28116Issues:167Issues:406

gleam

⭐️ A friendly language for building type-safe, scalable systems!

Language:RustLicense:Apache-2.0Stargazers:16754Issues:88Issues:1775

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:10709Issues:83Issues:141

data-engineer-handbook

This is a repo with links to everything you'd ever want to learn about data engineering

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8510Issues:67Issues:196

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:8480Issues:42Issues:314

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7761Issues:52Issues:1057

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptLicense:Apache-2.0Stargazers:5842Issues:33Issues:73

dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

Language:PythonLicense:Apache-2.0Stargazers:3242Issues:24Issues:41

FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Language:PythonLicense:MITStargazers:2823Issues:27Issues:68

AlgorithmsSedgewick

Code from the book "Algorithms" (4th ed.) by Robert Sedgewick and Kevin Wayne (original, and my solutions to exercises).

STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Language:C++License:MPL-2.0Stargazers:2207Issues:63Issues:183

infinity

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text

Language:C++License:Apache-2.0Stargazers:2130Issues:25Issues:300

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonLicense:NOASSERTIONStargazers:1952Issues:41Issues:55

piko

An open-source alternative to Ngrok, designed to serve production traffic and be simple to host (particularly on Kubernetes)

Language:GoLicense:MITStargazers:1761Issues:8Issues:8

WrenAI

Wren AI makes your database RAG-ready. Implement Text-to-SQL more accurately and securely.

Language:TypeScriptLicense:AGPL-3.0Stargazers:1477Issues:23Issues:100

zed

A novel data lake based on super-structured data

Language:GoLicense:BSD-3-ClauseStargazers:1351Issues:22Issues:1739

intake

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Language:PythonLicense:BSD-2-ClauseStargazers:994Issues:41Issues:378

MyScaleDB

An open-source, high-performance SQL vector database built on ClickHouse.

Language:C++License:Apache-2.0Stargazers:775Issues:12Issues:12

lantern

PostgreSQL vector database extension for building AI applications

Language:CLicense:NOASSERTIONStargazers:715Issues:6Issues:72

SQL-Leetcode-Challenge

Contains all the 117 Leetcode questions with their solutions ranging from Easy to Hard in MySQL.

gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Language:JavaLicense:Apache-2.0Stargazers:663Issues:20Issues:2031

camel-examples

Apache Camel Examples

Language:JavaLicense:Apache-2.0Stargazers:392Issues:39Issues:0

recap

Work with your web service, database, and streaming schemas in a single format.

Language:PythonLicense:MITStargazers:312Issues:10Issues:133

pg_timeseries

Simple and focused time-series tables for PostgreSQL, from Tembo

Language:PLpgSQLLicense:PostgreSQLStargazers:290Issues:11Issues:7

bigquery-data-lineage

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Language:JavaLicense:Apache-2.0Stargazers:141Issues:19Issues:17

ndindex

A Python library for manipulating indices of ndarrays

Language:PythonLicense:MITStargazers:95Issues:16Issues:57

kafka-connect-milvus

kafka-connect-milvus sink connector

Language:JavaLicense:Apache-2.0Stargazers:16Issues:4Issues:1