Zhiqi Huang's starred repositories
OSC8-Adoption
List of terminal emulators that support hyperlinks (OSC 8 escape sequences).
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
pillow-simd
The friendly PIL fork
awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
RapidLaTeXOCR
Formula recognition based on LaTeX-OCR and ONNXRuntime.
EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Awesome-Pruning
A curated list of neural network pruning resources.
GitHub-Chinese-Top-Charts
:cn: GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。
Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Languagecodec
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton