CauchyFanUpdate

followers

following

stars

中国科学院自动化研究所&PCL

Beijing

Fan's starred repositories

LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION32711 350 102

my-tv

我的电视电视直播软件，安装即可使用

Language:CApache-2.030143 211 900

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.022242 187 504

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptApache-2.07839 55 66

Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

Apache-2.05165 83 9

VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT4261 116 82

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Telechat

Language:Python1793 21 61

CCTV_Viewer

电视浏览器，一款简易电视视频收看软件，用于方便的在机顶盒上收看网页视频

Language:Java1714 12 96

MaoTai_GUIT

JD京东抢购、京东抢茅台Windows端、开箱即用无需配置环境。开发在即（开源协议采用Apache License）抢茅台外挂，茅台脚本

Language:PythonApache-2.01398 470

KnowledgeGraph

knowledge graph知识图谱,从零开始构建知识图谱

Language:Python1134 47 3

mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Language:PythonMIT1004 18 70

awesome-gpt

🏆 An awe-inspiring collection of resources, encompassing a wide range of tools, documents, resources, applications, and use cases related to ChatGPT.

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:Python811 10 33

transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Language:PythonMIT704 33 17

AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Language:Jupyter NotebookApache-2.0700 13 53

ml-aim

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Language:PythonNOASSERTION697 19 5

Visual-Chinese-LLaMA-Alpaca

多模态中文LLaMA&Alpaca大语言模型（VisualCLA）

Language:PythonApache-2.0423 9 13

Awesome-Medical-Dataset

Collection of awesome medical dataset resources.

EVE

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Language:PythonMIT229 8 16

OpenVid-1M

Language:Python192 3 14

Reading_groups

A paper & resource list of large language models, including course, paper, demo, figures

Awesome-Multimodal-Papers

A curated list of awesome Multimodal studies.

Language:HTMLMIT94 4 1

CoPrompt

[ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models

Language:PythonMIT57 2 7

ChatterBox

ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues

Language:PythonApache-2.050 1 6

LLaFS

PaperReading

Apache-2.031 30

VideoNIAH

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Language:Python25 1 1

Awesome-AI-Environment

A general pytorch environment to follow most up-to-date algorithms.

Apache-2.08 20