xyzjin

xyzjin

Geek Repo

Github PK Tool:Github PK Tool

xyzjin's starred repositories

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12977Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:7510Issues:0Issues:0

everyone-can-use-english

人人都能用英语

Language:TypeScriptLicense:MPL-2.0Stargazers:23605Issues:0Issues:0

Visual-Instruction-Tuning

SVIT: Scaling up Visual Instruction Tuning

Language:PythonLicense:MITStargazers:157Issues:0Issues:0

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:232Issues:0Issues:0

coyo-dataset

COYO-700M: Large-scale Image-Text Pair Dataset

Language:PythonStargazers:1125Issues:0Issues:0

HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Language:PythonLicense:Apache-2.0Stargazers:33270Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55009Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5442Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9653Issues:0Issues:0

UFE-AVS

Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""

Language:PythonStargazers:9Issues:0Issues:0

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:Apache-2.0Stargazers:1276Issues:0Issues:0

developer2gwy

公务员从入门到上岸,最佳程序员公考实践教程

License:NOASSERTIONStargazers:6912Issues:0Issues:0

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2107Issues:0Issues:0

cuda_learning

learning how CUDA works

Language:CudaStargazers:125Issues:0Issues:0

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonLicense:BSD-3-ClauseStargazers:1471Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:262Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:718Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:10077Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7890Issues:0Issues:0

ms-swift

Use PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2826Issues:0Issues:0

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:1127Issues:0Issues:0

CUDA-Learn-Notes

🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:995Issues:0Issues:0

Tianji

天机是一款专注人情世故的大语言模型系统。您可以利用它进行涉及传统人情世故的任务,如何说好话、如何会来事儿等,以提升您的“情商”和"核心竞争能力"

Language:PythonLicense:Apache-2.0Stargazers:301Issues:0Issues:0

ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Language:PythonLicense:Apache-2.0Stargazers:264Issues:0Issues:0

dataset

The Open Images dataset

Language:PythonLicense:Apache-2.0Stargazers:4239Issues:0Issues:0

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:470Issues:0Issues:0

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4530Issues:0Issues:0

Uniaa

Unified Multi-modal IAA Baseline and Benchmark

Stargazers:68Issues:0Issues:0

LLaVA-UHD-Better

A bug-free and improved implementation of LLaVA-UHD, based on the code from the official repo

Language:PythonLicense:Apache-2.0Stargazers:27Issues:0Issues:0