Kunkka's starred repositories
Twitter-Insight-LLM
Twitter data scraping, embedding based image search and more.
chineseocr_lite
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
pinyin-pro
中文转拼音、拼音音调、拼音声母、拼音韵母、多音字拼音、姓氏拼音、拼音匹配
MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫
NeuScraper
[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".
efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
asap-dataset
A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.
ollama-python
Ollama Python library
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
Video2Music
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
world-models
Extracting spatial and temporal world models from LLMs
Object-Detection-Metrics
Most popular metrics used to evaluate object detection algorithms.
llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
qiskit-machine-learning
Quantum Machine Learning
Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)