Andrew's starred repositories

gptstore-prompts

Here are the Top 100 prompts on GPTStore, which we can use to learn and improve prompt engineering.

License:CC0-1.0Stargazers:452Issues:0Issues:0

ChineseNLPCorpus

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Language:PythonStargazers:4138Issues:0Issues:0

gowatch

🚀 gowatch is a command line tool that builds and (re)starts your go project everytime you save a Go or template file.

Language:GoLicense:MITStargazers:821Issues:0Issues:0

InstructABSA

Instructional learning for Aspect Based Sentiment Analysis

Language:Jupyter NotebookLicense:MITStargazers:124Issues:0Issues:0

fake-audio-detector

Simple fake audio detector

Language:PythonLicense:AGPL-3.0Stargazers:24Issues:0Issues:0

bootcamp

Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

Language:HTMLLicense:Apache-2.0Stargazers:1695Issues:0Issues:0

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoLicense:Apache-2.0Stargazers:27927Issues:0Issues:0

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3816Issues:0Issues:0

kratos

Your ultimate Go microservices framework for the cloud-native era.

Language:GoLicense:MITStargazers:22668Issues:0Issues:0

ocr-text-renderer

生成用于训练OCR字符识别的数据

Language:PythonStargazers:3Issues:0Issues:0
Language:PythonLicense:MITStargazers:734Issues:0Issues:0

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:PythonStargazers:1628Issues:0Issues:0

PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

Language:PythonLicense:Apache-2.0Stargazers:39974Issues:0Issues:0

TextRecognitionDataGenerator

A synthetic data generator for text recognition

Language:PythonLicense:MITStargazers:3121Issues:0Issues:0

text_renderer

Generate text images for training deep learning ocr model

Language:PythonLicense:MITStargazers:1347Issues:0Issues:0

SynthText

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Language:PythonLicense:Apache-2.0Stargazers:1988Issues:0Issues:0

chineseocr_lite

超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

Language:C++License:GPL-2.0Stargazers:11602Issues:0Issues:0

langchaingo

LangChain for Go, the easiest way to write LLM-based programs in Go

Language:GoLicense:MITStargazers:3667Issues:0Issues:0

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonLicense:Apache-2.0Stargazers:5500Issues:0Issues:0

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:6370Issues:0Issues:0

dingtalk-stream-sdk-python

Python SDK for DingTalk Stream Mode API, Compared with the webhook mode, it is easier to access the DingTalk chatbot

Language:PythonLicense:MITStargazers:47Issues:0Issues:0

WebcamGPT-Vision

Lightweight GPT-4 Vision processing over the Webcam

Language:JavaScriptStargazers:255Issues:0Issues:0

n8n

Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

Language:TypeScriptLicense:NOASSERTIONStargazers:42142Issues:0Issues:0

3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Language:PythonLicense:Apache-2.0Stargazers:881Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:821Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:51388Issues:0Issues:0

sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.

Language:C++License:Apache-2.0Stargazers:877Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:63551Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:19616Issues:0Issues:0

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7321Issues:0Issues:0