kangkot's starred repositories

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34351Issues:308Issues:876

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:NOASSERTIONStargazers:25826Issues:161Issues:298

gleam

⭐️ A friendly language for building type-safe, scalable systems!

Language:RustLicense:Apache-2.0Stargazers:16309Issues:86Issues:1740

phidata

Build AI Assistants with memory, knowledge and tools.

Language:PythonLicense:MPL-2.0Stargazers:10279Issues:80Issues:129

data-engineer-handbook

This is a repo with links to everything you'd ever want to learn about data engineering

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:8376Issues:67Issues:193

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:7747Issues:39Issues:256

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:7301Issues:51Issues:1011

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptLicense:Apache-2.0Stargazers:5453Issues:33Issues:69

dataherald

Interact with your SQL database, Natural Language to SQL using LLMs

Language:PythonLicense:Apache-2.0Stargazers:3183Issues:24Issues:40

AlgorithmsSedgewick

Code from the book "Algorithms" (4th ed.) by Robert Sedgewick and Kevin Wayne (original, and my solutions to exercises).

FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Language:PythonLicense:MITStargazers:2566Issues:26Issues:55

STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Language:C++License:MPL-2.0Stargazers:2190Issues:64Issues:182

infinity

The AI-native database built for LLM applications, providing incredibly fast full-text and vector search

Language:C++License:Apache-2.0Stargazers:2003Issues:24Issues:289

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonLicense:NOASSERTIONStargazers:1775Issues:38Issues:51

piko

An open-source alternative to Ngrok, designed to serve production traffic and be simple to host (particularly on Kubernetes)

Language:GoLicense:MITStargazers:1707Issues:7Issues:8

zed

A novel data lake based on super-structured data

Language:GoLicense:BSD-3-ClauseStargazers:1334Issues:21Issues:1719

WrenAI

Wren AI makes your database RAG-ready. Implement Text-to-SQL more accurately and securely.

Language:TypeScriptLicense:AGPL-3.0Stargazers:1217Issues:18Issues:70

intake

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Language:PythonLicense:BSD-2-ClauseStargazers:990Issues:41Issues:375

MyScaleDB

An open-source, high-performance SQL vector database built on ClickHouse.

Language:C++License:Apache-2.0Stargazers:740Issues:12Issues:12

lantern

PostgreSQL vector database extension for building AI applications

Language:CLicense:NOASSERTIONStargazers:686Issues:6Issues:72

SQL-Leetcode-Challenge

Contains all the 117 Leetcode questions with their solutions ranging from Easy to Hard in MySQL.

gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Language:JavaLicense:Apache-2.0Stargazers:641Issues:21Issues:1937

camel-examples

Apache Camel Examples

Language:JavaLicense:Apache-2.0Stargazers:387Issues:40Issues:0

recap

Work with your web service, database, and streaming schemas in a single format.

Language:PythonLicense:MITStargazers:310Issues:10Issues:133

pg_timeseries

Simple and focused time-series tables for PostgreSQL, from Tembo

Language:PLpgSQLLicense:PostgreSQLStargazers:280Issues:11Issues:7

bigquery-data-lineage

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Language:JavaLicense:Apache-2.0Stargazers:141Issues:19Issues:17

ndindex

A Python library for manipulating indices of ndarrays

Language:PythonLicense:MITStargazers:95Issues:16Issues:57

kafka-connect-milvus

kafka-connect-milvus sink connector

Language:JavaLicense:Apache-2.0Stargazers:16Issues:4Issues:1