Tianyu Zhang (tianyu-z)

tianyu-z

Geek Repo

Company:Mila

Location:Montreal

Home Page:ai.t-zhang.com

Github PK Tool:Github PK Tool

Tianyu Zhang's repositories

VCR

Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.

Language:PythonLicense:CC-BY-SA-4.0Stargazers:8Issues:0Issues:0

alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

VCR-wiki-en-easy-test-500

Raw data for VCR-wiki-en-easy-test-500 from https://huggingface.co/datasets/vcr-org/VCR-wiki-en-easy-test-500

License:CC-BY-SA-4.0Stargazers:1Issues:0Issues:0

VCR-wiki-zh-easy-test-500

Raw data for VCR-wiki-zh-easy-test-100 from https://huggingface.co/datasets/vcr-org/VCR-wiki-zh-easy-test-100

License:CC-BY-SA-4.0Stargazers:1Issues:0Issues:0

VCR-wiki-zh-hard-test-500

Raw data for VCR-wiki-zh-hard-test-500 from https://huggingface.co/datasets/vcr-org/VCR-wiki-zh-hard-test-500

License:CC-BY-SA-4.0Stargazers:1Issues:0Issues:0

AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

License:Apache-2.0Stargazers:0Issues:0Issues:0

Best-README-Template

An awesome README template to jumpstart your projects!

License:MITStargazers:0Issues:0Issues:0

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Connect-4-Gym-env-Reinforcement-learning

Connect Four Environment is a project designed for training reinforcement learning models to play the classic Connect4 game. It's compatible with OpenAI Gym / Gymnasium, includes a variety of bots, an Elo leaderboard system, and supports both FCN and CNN policies.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dreamerv3

Mastering Diverse Domains through World Models

License:MITStargazers:0Issues:0Issues:0

EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

License:GPL-3.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

License:Apache-2.0Stargazers:0Issues:0Issues:0

light_on_chatgpt

Good for e-ink monitor user to use ChatGPT. It makes the code blocks white and makes the UI wider.

Language:CSSLicense:MITStargazers:0Issues:1Issues:0

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

maze-transformer

This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

MergeLM

Codebase for Merging Language Models

Stargazers:0Issues:0Issues:0

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

License:Apache-2.0Stargazers:0Issues:0Issues:0

multipleWindow3dScene

A quick example of how one can "synchronize" a 3d scene across multiple windows using three.js and localStorage

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

License:MITStargazers:0Issues:0Issues:0

pymdp

A Python implementation of active inference for Markov Decision Processes

License:MITStargazers:0Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

surya

OCR, layout analysis, reading order, line detection in 90+ languages

License:GPL-3.0Stargazers:0Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

License:MITStargazers:0Issues:0Issues:0

VCR-wiki-en-hard-test-500

Raw data for VCR-wiki-en-hard-test-500 from https://huggingface.co/datasets/vcr-org/VCR-wiki-en-hard-test-500

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0