yukang (yukang2017)

yukang2017

Geek Repo

Company:CUHK

Location:Hong Kong

Home Page:yukangchen.com

Github PK Tool:Github PK Tool


Organizations
dvlab-research

yukang's starred repositories

lloco

The official repo for "LLoCo: Learning Long Contexts Offline"

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

ring-flash-attention

Ring attention implementation with flash attention

Language:PythonStargazers:317Issues:0Issues:0

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:231Issues:0Issues:0

MiniGemini

Official implementation for Mini-Gemini

Language:PythonLicense:Apache-2.0Stargazers:2538Issues:0Issues:0

LSK3DNet

This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at CVPR 2024).

License:MITStargazers:17Issues:0Issues:0

DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Language:PythonLicense:Apache-2.0Stargazers:584Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:47654Issues:0Issues:0

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

Stargazers:702Issues:0Issues:0

CUHK-PHD-Thesis-Template

CUHK PhD Thesis Template

Language:TeXStargazers:47Issues:0Issues:0

notus

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

Language:PythonLicense:MITStargazers:147Issues:0Issues:0

ChunkLlama

Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:189Issues:0Issues:0

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonLicense:Apache-2.0Stargazers:770Issues:0Issues:0
Language:PythonLicense:MITStargazers:47Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:6788Issues:0Issues:0

Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Language:PythonStargazers:302Issues:0Issues:0

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:3922Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:154Issues:0Issues:0

VIRL

Code for V-IRL: Grounding Virtual Intelligence in Real Life

Language:PythonStargazers:238Issues:0Issues:0

DDSM

Denoising Diffusion Step-aware Models (ICLR2024)

Language:PythonLicense:MITStargazers:38Issues:0Issues:0

code-act

Official Repo for paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

Language:PythonStargazers:173Issues:0Issues:0

TravelPlanner

Dataset and code for the paper "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"

Language:PythonLicense:MITStargazers:110Issues:0Issues:0

LongAlign

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

Language:PythonLicense:Apache-2.0Stargazers:96Issues:0Issues:0

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:PythonStargazers:1189Issues:0Issues:0

AgentBoard

An Analytical Evaluation Board of Multi-turn LLM Agents

Language:SASStargazers:180Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1844Issues:0Issues:0
Language:PythonStargazers:211Issues:0Issues:0

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1445Issues:0Issues:0
Language:PythonStargazers:394Issues:0Issues:0

ludwig

Low-code framework for building custom LLMs, neural networks, and other AI models

Language:PythonLicense:Apache-2.0Stargazers:10785Issues:0Issues:0
Language:PythonLicense:MITStargazers:814Issues:0Issues:0