Richard Chen (richardhahahaha)

richardhahahaha

Geek Repo

Company:ZKTeco

Github PK Tool:Github PK Tool

Richard Chen's starred repositories

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonLicense:MITStargazers:52802Issues:308Issues:254

penpot

Penpot: The open-source design tool for design and code collaboration

Language:ClojureLicense:MPL-2.0Stargazers:28447Issues:205Issues:1417

everyone-can-use-english

人人都能用英语

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8175Issues:127Issues:412

rags

Build ChatGPT over your data, all with natural language

Language:PythonLicense:MITStargazers:5981Issues:55Issues:38

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5241Issues:59Issues:86

annotated-transformer

An annotated implementation of the Transformer paper.

Language:Jupyter NotebookLicense:MITStargazers:5186Issues:63Issues:85

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:3866Issues:53Issues:93

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3483Issues:100Issues:159

T-Rex

API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonLicense:NOASSERTIONStargazers:1940Issues:39Issues:59

MotionBERT

[ICCV 2023] PyTorch Implementation of "MotionBERT: A Unified Perspective on Learning Human Motion Representations"

Language:PythonLicense:Apache-2.0Stargazers:892Issues:21Issues:130

diart

A python package to build AI-powered real-time audio applications

Language:PythonLicense:MITStargazers:841Issues:20Issues:139

ar5iv

A web service offering HTML5 articles from arXiv.org as converted with latexml

Language:RustLicense:MITStargazers:728Issues:7Issues:464

autogen-ui

Web UI for AutoGen (A Framework Multi-Agent LLM Applications)

Language:TypeScriptLicense:MITStargazers:611Issues:18Issues:17

AFFiNE.pro

AFFiNE official website, source for affine.pro

Language:VueLicense:AGPL-3.0Stargazers:571Issues:13Issues:16

APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Language:PythonLicense:Apache-2.0Stargazers:444Issues:6Issues:46

Open-NLLB

Effort to open-source NLLB checkpoints.

Language:PythonLicense:MITStargazers:383Issues:9Issues:24

LocalAIVoiceChat

Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.

Language:PythonLicense:NOASSERTIONStargazers:344Issues:6Issues:11

PCT

This is an official implementation of our CVPR 2023 paper "Human Pose as Compositional Tokens" (https://arxiv.org/pdf/2303.11638.pdf)

Language:PythonLicense:MITStargazers:269Issues:5Issues:37

StyleSync_PyTorch

PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"

MeMOTR

[ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking

Language:PythonLicense:MITStargazers:129Issues:5Issues:17

APTM

The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"

Language:PythonLicense:MITStargazers:117Issues:4Issues:19

BUCTD

[ICCV 2023] "Rethinking pose estimation in crowds: overcoming the detection information-bottleneck and ambiguity"

Language:PythonLicense:Apache-2.0Stargazers:80Issues:10Issues:12

Speaker_diarization

Speech Diarization for scrum automation

Language:Jupyter NotebookLicense:MITStargazers:80Issues:1Issues:1

ContextAware-PoseFormer

The project is an official implementation of our paper "A Single 2D Pose With Context is Worth Hundreds for 3D Human Pose Estimation".

Language:PythonStargazers:57Issues:0Issues:12

svt

Scattering Vision Transformer

Lightweight-Face-Detector-Pruning

Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geometric Median Criterion", Proc. IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW 2024), Waikoloa, Hawaii, USA, Jan. 2024. Repository updated in April 2024.

Language:PythonStargazers:9Issues:0Issues:0

create-high-quality-dataset-for-computer-vision

This project focuses on generating a diverse and realistic dataset for computer vision training using ChatGPT and a realistic vision image generation model. The process involves dynamically creating prompts, utilizing ChatGPT to generate image descriptions, and generating images based on those descriptions.

Language:Jupyter NotebookStargazers:7Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0