James Chang (strategist922)

strategist922

Geek Repo

Company:Microsoft

Location:Taipei, Taiwan

Github PK Tool:Github PK Tool


Organizations
THUKElab

James Chang's starred repositories

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:31792Issues:228Issues:3945

chroma

the AI-native open-source embedding database

Language:PythonLicense:Apache-2.0Stargazers:12633Issues:78Issues:976

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:10689Issues:144Issues:783

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:9505Issues:88Issues:274

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7998Issues:92Issues:349

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:7940Issues:100Issues:81

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6253Issues:60Issues:73

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:5769Issues:71Issues:211

camel

🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org

Language:PythonLicense:Apache-2.0Stargazers:4521Issues:56Issues:216

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:3832Issues:52Issues:89

anomalib

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Language:PythonLicense:Apache-2.0Stargazers:3227Issues:37Issues:790
Language:PythonLicense:MITStargazers:1164Issues:31Issues:22

ganomaly

GANomaly: Semi-Supervised Anomaly Detection via Adversarial Training

Language:PythonLicense:MITStargazers:833Issues:23Issues:84

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:680Issues:15Issues:36

ml-aim

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Language:PythonLicense:NOASSERTIONStargazers:637Issues:20Issues:5

APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Language:PythonLicense:Apache-2.0Stargazers:433Issues:6Issues:46

NeuralKG

[Tool] For Knowledge Graph Representation Learning

Language:PythonLicense:Apache-2.0Stargazers:322Issues:9Issues:43

H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

MathVista

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:182Issues:4Issues:21

lightning-attention

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Language:PythonLicense:MITStargazers:167Issues:11Issues:12

KnowPAT

[Paper][Preprint 2023] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering

whisper_dictation

Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, with images, voice control, in under 4 GiB of VRAM.

Language:PythonLicense:GPL-2.0Stargazers:122Issues:10Issues:3

GTA

[NeurIPS 23] Official repository for NeurIPS 2023 paper "Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction"

RMSIN

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

Language:PythonStargazers:63Issues:0Issues:0

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Language:PythonLicense:MITStargazers:51Issues:2Issues:2

SoRA

The source code of the EMNLP 2023 main conference paper: Sparse Low-rank Adaptation of Pre-trained Language Models.

Language:PythonStargazers:49Issues:0Issues:6

OCR-GAN

[TIP 2023] Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection

Language:PythonLicense:MITStargazers:31Issues:2Issues:6

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:1Issues:0Issues:0