tensorboy

tensorboy

Geek Repo

Company:TikTok Inc

Home Page:www.wangpengan.com

Github PK Tool:Github PK Tool

tensorboy's starred repositories

openui

OpenUI let's you describe UI using your imagination, then see it rendered live.

Language:HTMLLicense:Apache-2.0Stargazers:16205Issues:105Issues:119

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

Language:PythonLicense:GPL-3.0Stargazers:6539Issues:44Issues:401

oneuptime

OneUptime is the complete open-source observability platform.

Language:TypeScriptLicense:Apache-2.0Stargazers:4478Issues:26Issues:322

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:3538Issues:40Issues:43

safetensors

Simple, safe way to store and distribute tensors

Language:PythonLicense:Apache-2.0Stargazers:2538Issues:41Issues:158

TrWebOCR

开源易用的中文离线OCR,识别率媲美大厂,并且提供了易用的web页面及web的接口,方便人类日常工作使用或者其他程序来调用~

Language:PythonLicense:Apache-2.0Stargazers:2531Issues:53Issues:96

text-embeddings-inference

A blazing fast inference solution for text embeddings models

Language:RustLicense:Apache-2.0Stargazers:2148Issues:27Issues:174

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1496Issues:16Issues:11

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1197Issues:29Issues:87

kan-gpt

The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling

Language:PythonLicense:MITStargazers:628Issues:8Issues:10

Groma

Grounded Multimodal Large Language Model with Localized Visual Tokenization

Language:PythonLicense:Apache-2.0Stargazers:446Issues:36Issues:11

ao

Native PyTorch library for quantization and sparsity

Language:PythonLicense:BSD-3-ClauseStargazers:303Issues:18Issues:46

BLIVA

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Language:PythonLicense:BSD-3-ClauseStargazers:234Issues:12Issues:18

rocketnotes

LLM-powered Markdown editor

Language:TypeScriptLicense:MITStargazers:228Issues:5Issues:28

FILM

Official repo for "Make Your LLM Fully Utilize the Context"

Language:PythonLicense:MITStargazers:218Issues:5Issues:4

smart_classroom_demo

群体课堂专注度分析、考试作弊系统、动态点名功能的Qt Demo,使用多人姿态估计、情绪识别、人脸识别、静默活体检测等技术

Language:PythonLicense:MITStargazers:204Issues:4Issues:46

OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

Language:PythonLicense:MITStargazers:187Issues:2Issues:29

mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

Language:PythonLicense:MITStargazers:161Issues:3Issues:0

WorldGPT

WorldGPT: Empowering LLM as Multimodal World Model

Language:Jupyter NotebookStargazers:107Issues:0Issues:0

CuMo

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Language:PythonLicense:Apache-2.0Stargazers:94Issues:1Issues:9

BunnyVisionPro

Bimanual Dexterous Teleoperation with Real-Time Retargeting using VisionPro

Language:PythonLicense:MITStargazers:93Issues:5Issues:2

MLLM-Tool

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

Language:PythonLicense:MITStargazers:78Issues:2Issues:1
Language:PythonLicense:MITStargazers:56Issues:0Issues:0

Pink

Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs

self-reasoning-tokens-pytorch

Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto

Language:PythonLicense:MITStargazers:49Issues:6Issues:0

POVID

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Language:PythonLicense:Apache-2.0Stargazers:44Issues:0Issues:0

UCoFiA

Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)

Language:PythonLicense:MITStargazers:44Issues:3Issues:1

youtube_yapper_trapper

CrewAI agents that gather and analyze YouTube comments to generate insights to inform better content creation.