hxdaze's repositories
argo-workflows
Workflow Engine for Kubernetes
awesome-japanese-llm
日本語LLMまとめ
AWSIM
Open source simulator for self-driving vehicles
click
Python composable command line interface toolkit
CUDA-FastBEV
TensorRT deploy and PTQ/QAT tools development for FastBEV, total time only need 6.9ms!!!
dask
Parallel computing with task scheduling
datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
fasterrcnn-pytorch-training-pipeline
PyTorch Faster R-CNN Object Detection on Custom Dataset
gato
Unofficial Gato: A Generalist Agent
gitignore
A collection of useful .gitignore templates
GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
growi
:anchor: GROWI - Team collaboration software using markdown
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
LLaVA-JP
LLaVA-JP is a Japanese VLM trained by LLaVA method
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
MemGPT
Teaching LLMs memory management for unbounded context 📚🦙
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
ravens
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
review_object_detection_metrics
Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Precision (STT-AP). This project supports different bounding box formats as in COCO, PASCAL, Imagenet, etc.
simple-onnx-processing-tools
A set of simple tools for splitting, merging, OP deletion, size compression, rewriting attributes and constants, OP generation, change opset, change to the specified input order, addition of OP, RGB to BGR conversion, change batch size, batch rename of OP, and JSON convertion for ONNX models.
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
whisper
Robust Speech Recognition via Large-Scale Weak Supervision