Yuechen (JulianJuaner)

JulianJuaner

Geek Repo

Company:CUHK, SmartMore

Location:Hong Kong SAR

Home Page:julianjuaner.github.io

Github PK Tool:Github PK Tool

Yuechen's starred repositories

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:11175Issues:0Issues:0

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonLicense:MITStargazers:1105Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4397Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3958Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12795Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2108Issues:0Issues:0

vid2vid

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.

Language:PythonLicense:NOASSERTIONStargazers:8552Issues:0Issues:0

PnPInversion

[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"

Language:Jupyter NotebookStargazers:220Issues:0Issues:0

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:921Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:60127Issues:0Issues:0

MotionDirector

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Language:PythonLicense:Apache-2.0Stargazers:773Issues:0Issues:0

AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:611Issues:0Issues:0

webvid

Large-scale text-video dataset. 10 million captioned short videos.

Language:PythonStargazers:558Issues:0Issues:0

MR-GSM8K

Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs

Language:PythonLicense:MITStargazers:37Issues:0Issues:0

visualblocks

Visual Blocks for ML is a Google visual programming framework that lets you create ML pipelines in a no-code graph editor. You – and your users – can quickly prototype workflows by connecting drag-and-drop ML components, including models, user inputs, processors, and visualizations.

Language:PythonLicense:Apache-2.0Stargazers:1141Issues:0Issues:0

MoTCoder

This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.

Language:PythonStargazers:58Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5738Issues:0Issues:0

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1584Issues:0Issues:0

OpenLRM

An open-source impl. of Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:876Issues:0Issues:0

StyleCrafter

StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter

Language:PythonLicense:Apache-2.0Stargazers:176Issues:0Issues:0

PointTransformerV3

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Language:PythonLicense:MITStargazers:656Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10656Issues:0Issues:0
Language:PythonStargazers:16Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:2972Issues:0Issues:0
Language:PythonLicense:MITStargazers:4Issues:0Issues:0

DemoFusion

Let us democratise high-resolution generation! (CVPR 2024)

Language:Jupyter NotebookStargazers:1950Issues:0Issues:0

Prompt-Highlighter

[CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Language:PythonLicense:MITStargazers:115Issues:0Issues:0

DynamiCrafter

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2238Issues:0Issues:0

ComfyUI_UltimateSDUpscale

ComfyUI nodes for the Ultimate Stable Diffusion Upscale script by Coyote-A.

Language:PythonLicense:GPL-3.0Stargazers:704Issues:0Issues:0