Yihao Feng (yihaocs)

yihaocs

Geek Repo

Company:Salesforce Research

Location:Palo Alto, CA

Github PK Tool:Github PK Tool

Yihao Feng's starred repositories

pyautogui

A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

Language:PythonLicense:BSD-3-ClauseStargazers:9819Issues:184Issues:695

self-operating-computer

A framework to enable multimodal models to operate a computer.

Language:PythonLicense:MITStargazers:8099Issues:113Issues:122

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:5755Issues:65Issues:144

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonLicense:Apache-2.0Stargazers:5086Issues:51Issues:184

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4228Issues:68Issues:65

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4174Issues:62Issues:90

vimGPT

Browse the web with GPT-4V and Vimium

Language:PythonLicense:MITStargazers:2519Issues:25Issues:22

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2427Issues:41Issues:0

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2085Issues:23Issues:19

i2vgen-xl

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Language:PythonLicense:MITStargazers:1555Issues:15Issues:73

tarsier

Vision utilities for web interaction agents 👀

Language:Jupyter NotebookLicense:MITStargazers:1210Issues:7Issues:12

GPT-4V-Act

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

ReWOO

Decoupling Reasoning from Observations for Efficient Augmented Language Models

Language:PythonLicense:MITStargazers:856Issues:22Issues:12

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:848Issues:26Issues:31

MFTCoder

High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.

Language:PythonLicense:NOASSERTIONStargazers:585Issues:8Issues:45

EvaluationPapers4ChatGPT

Resource, Evaluation and Detection Papers for ChatGPT

lumos

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

Language:PythonLicense:MITStargazers:419Issues:10Issues:4

Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Language:PythonLicense:Apache-2.0Stargazers:267Issues:5Issues:28

llm-decontaminator

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Language:PythonLicense:Apache-2.0Stargazers:177Issues:3Issues:5

GPT-V-on-Web

👀🧠 GPT-4 Vision x 💪⌨️ Vimium = Autonomous Web Agent

Language:PythonLicense:AGPL-3.0Stargazers:155Issues:4Issues:1

cpl

Code for Contrastive Preference Learning (CPL)

Language:PythonLicense:MITStargazers:136Issues:3Issues:9

VideoLDM

Unofficial PyTorch implementation of the VideoLDM.

Language:PythonLicense:MITStargazers:136Issues:13Issues:7
Language:PythonLicense:MITStargazers:114Issues:3Issues:0

DuckTrack

Multimodal computer agent data collection program

Language:PythonLicense:MITStargazers:98Issues:3Issues:9

ScaleLong

The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (NeurIPS 2023)

Language:PythonStargazers:47Issues:4Issues:0

VLC

Research code for "Training Vision-Language Transformers from Captions Alone"

Language:PythonLicense:Apache-2.0Stargazers:15Issues:0Issues:0