wanng-ide

WANGJUNJIE's starred repositories

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT20046 152 265

weiboSpider

新浪微博爬虫，用python爬取新浪微博数据

Language:Python8354 132 537

FileCentipede

Cross-platform internet upload/download manager for HTTP(S), FTP(S), SSH, magnet-link, BitTorrent, m3u8, ed2k, and online videos. WebDAV client, FTP client, SSH client.

Language:C++7539 68 655

disco-diffusion

Language:Jupyter NotebookNOASSERTION7474 117 80

FastSAM

Fast Segment Anything

Language:PythonAGPL-3.07417 56 204

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT5965 178 15

video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonApache-2.05887 44 273

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

Language:PythonApache-2.04010 57 294

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MIT3439 64 54

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. （将PaddleOCR模型做了转换，采用ONNXRuntime推理，速度很快）

Language:PythonApache-2.02885 43 119

An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

Language:Jupyter NotebookMIT1847 18 81

Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Language:PythonMIT1304 40 1

lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language:PythonApache-2.01121 11 84

FightingCV-Paper-Reading

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀

Language:Shell793 150

BERT-whitening

简单的向量白化改善句向量质量

Language:Python481 6 12

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonMIT361 6 36

RapidVideOCR

Extract video hard subtitles and automatically generate corresponding srt files.

Language:PythonApache-2.0321 3 33

SimDeblur

Simple framework for image and video deblurring, implemented by PyTorch

Language:PythonMIT307 9 19

TANet

[IJCAI 2022, Official Code] for paper "Rethinking Image Aesthetics Assessment: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个面向多主题场景的美学评估数据集、算法和benchmark.

Language:PythonApache-2.0191 4 22