Richard Chen's starred repositories
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
everyone-can-use-english
人人都能用英语
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
annotated-transformer
An annotated implementation of the Transformer paper.
MotionBERT
[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"
autogen-ui
Web UI for AutoGen (A Framework Multi-Agent LLM Applications)
AFFiNE.pro
AFFiNE official website, source for affine.pro
LocalAIVoiceChat
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
StyleSync_PyTorch
PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
Speaker_diarization
Speech Diarization for scrum automation
ContextAware-PoseFormer
The project is an official implementation of our paper "A Single 2D Pose With Context is Worth Hundreds for 3D Human Pose Estimation".
Lightweight-Face-Detector-Pruning
Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geometric Median Criterion", Proc. IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW 2024), Waikoloa, Hawaii, USA, Jan. 2024. Repository updated in April 2024.
create-high-quality-dataset-for-computer-vision
This project focuses on generating a diverse and realistic dataset for computer vision training using ChatGPT and a realistic vision image generation model. The process involves dynamically creating prompts, utilizing ChatGPT to generate image descriptions, and generating images based on those descriptions.