ParadoxZW

Zhenwei's repositories

LLaVA-UHD-Better

A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo

Language:PythonApache-2.031 4 6

NodeGo

A Node.js Web Server for Go Game AI, powered by WGo.js, SabakiHQ/gtp and leela-zero.

Language:JavaScriptAGPL-3.06 20

CosAttention2d

a 2D cosine attention module inspired by cosFormer: Rethinking Softmax in Attention(https://arxiv.org/abs/2202.08791)

Language:PythonApache-2.03 2 1

NERD

NERD: Named Entity Representations for Disambiguation. 2020大学生服务外包大赛国二

Language:Python3 20

SamaritanHDU

A roll-call system using face recognition technique and WeChat App Platform.

Language:Python2 20

Awesome-Multimodal-Large-Language-Models

Latest Papers and Datasets on Multimodal Large Language Models

100

prophet

Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

Language:PythonApache-2.0100

Automate-Anything-is-All-You-Need

010

ChuanhuChatGPT

GUI for ChatGPT API

Language:PythonGPL-3.0000

cosFormer

Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention

Language:PythonApache-2.0010

cosformer-pytorch

Unofficial PyTorch implementation of the paper "cosFormer: Rethinking Softmax In Attention".

Language:Jupyter NotebookMIT010

Dotfile

Language:Shell020

fancy-and-tricky

remarkable snippets!

Language:Python020

hexo-deploy-github-pages-action

🚀 GitHub action for deploying a Hexo project to GitHub pages.

Language:ShellMIT010

image-processing-from-scratch

This project contains some interesting image processing algorithms that were wrote in python and c++ from scratch.

Language:C++MIT010

imp

Powerful multimodal small language models

Language:PythonApache-2.0000

LLaVA

[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.

Language:PythonApache-2.0000

mmnas

Deep Multimodal Neural Architecture Search

Language:PythonApache-2.0010

mySIFT

course project

Language:Python020

openvqa

A lightweight, scalable, and general framework for visual question answering research

Language:PythonApache-2.0010

ParadoxZW.github.io

Language:HTML01 2

PATexercise

Language:C++010

Phi3V-Finetuning

Parameter-efficient finetuning script for Phi-3-vision, the strong multimodal language model by Microsoft.

Language:PythonApache-2.0000

PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Language:PythonApache-2.0000

shell_display.py

Display a image in shell using 20 lines Python code.

Language:Python020

Sketch2Attributes

predict the attributes of a sketch of humans

Language:Jupyter Notebook020

Test

test some GitHub feature

02 1

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

Visualize_Tool

Language:Python020

xmchat

Apache-2.0000