Fan (CauchyFanUpdate)

CauchyFanUpdate

Geek Repo

Company:中国科学院自动化研究所&PCL

Location:Beijing

Github PK Tool:Github PK Tool

Fan's starred repositories

LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:32711Issues:350Issues:102

my-tv

我的电视 电视直播软件,安装即可使用

Language:CLicense:Apache-2.0Stargazers:30143Issues:211Issues:900

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:22242Issues:187Issues:504

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptLicense:Apache-2.0Stargazers:7839Issues:55Issues:66

Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

VAR

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:4261Issues:116Issues:82

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

CCTV_Viewer

电视浏览器,一款简易电视视频收看软件,用于方便的在机顶盒上收看网页视频

MaoTai_GUIT

JD京东抢购、京东抢茅台Windows端、开箱即用无需配置环境。开发在即(开源协议采用Apache License)抢茅台外挂,茅台脚本

Language:PythonLicense:Apache-2.0Stargazers:1398Issues:47Issues:0

KnowledgeGraph

knowledge graph知识图谱,从零开始构建知识图谱

mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Language:PythonLicense:MITStargazers:1004Issues:18Issues:70

awesome-gpt

🏆 An awe-inspiring collection of resources, encompassing a wide range of tools, documents, resources, applications, and use cases related to ChatGPT.

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Language:PythonLicense:MITStargazers:704Issues:33Issues:17

AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:700Issues:13Issues:53

ml-aim

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Language:PythonLicense:NOASSERTIONStargazers:697Issues:19Issues:5

Visual-Chinese-LLaMA-Alpaca

多模态中文LLaMA&Alpaca大语言模型(VisualCLA)

Language:PythonLicense:Apache-2.0Stargazers:423Issues:9Issues:13

Awesome-Medical-Dataset

Collection of awesome medical dataset resources.

EVE

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Language:PythonLicense:MITStargazers:229Issues:8Issues:16

Reading_groups

A paper & resource list of large language models, including course, paper, demo, figures

Awesome-Multimodal-Papers

A curated list of awesome Multimodal studies.

Language:HTMLLicense:MITStargazers:94Issues:4Issues:1

CoPrompt

[ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models

Language:PythonLicense:MITStargazers:57Issues:2Issues:7

ChatterBox

ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues

Language:PythonLicense:Apache-2.0Stargazers:50Issues:1Issues:6
License:Apache-2.0Stargazers:31Issues:3Issues:0

VideoNIAH

VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs

Awesome-AI-Environment

A general pytorch environment to follow most up-to-date algorithms.

License:Apache-2.0Stargazers:8Issues:2Issues:0