WANGJUNJIE (wanng-ide)

wanng-ide

Geek Repo

Company:Waseda University

Location:Japan

Home Page:https://wanng-ide.github.io/

Github PK Tool:Github PK Tool

WANGJUNJIE's starred repositories

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:26651Issues:724Issues:0

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:20046Issues:152Issues:265

weiboSpider

新浪微博爬虫,用python爬取新浪微博数据

FileCentipede

Cross-platform internet upload/download manager for HTTP(S), FTP(S), SSH, magnet-link, BitTorrent, m3u8, ed2k, and online videos. WebDAV client, FTP client, SSH client.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7474Issues:117Issues:80

FastSAM

Fast Segment Anything

Language:PythonLicense:AGPL-3.0Stargazers:7417Issues:56Issues:204

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonLicense:Apache-2.0Stargazers:5887Issues:44Issues:273

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:4010Issues:57Issues:294

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

RapidOCR

Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)

Language:PythonLicense:Apache-2.0Stargazers:2885Issues:43Issues:119

Pix2Text

An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

Language:Jupyter NotebookLicense:MITStargazers:1847Issues:18Issues:81

Awesome-Multimodal-Research

A curated list of Multimodal Related Research.

Language:PythonLicense:MITStargazers:1304Issues:40Issues:1

lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Language:PythonLicense:Apache-2.0Stargazers:1121Issues:11Issues:84

FightingCV-Paper-Reading

⭐⭐⭐FightingCV Paper Reading, which helps you understand the most advanced research work in an easier way 🍀 🍀 🍀

Language:ShellStargazers:793Issues:15Issues:0

BERT-whitening

简单的向量白化改善句向量质量

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonLicense:MITStargazers:361Issues:6Issues:36

RapidVideOCR

Extract video hard subtitles and automatically generate corresponding srt files.

Language:PythonLicense:Apache-2.0Stargazers:321Issues:3Issues:33

SimDeblur

Simple framework for image and video deblurring, implemented by PyTorch

Language:PythonLicense:MITStargazers:307Issues:9Issues:19

TANet

[IJCAI 2022, Official Code] for paper "Rethinking Image Aesthetics Assessment: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个面向多主题场景的美学评估数据集、算法和benchmark.

Language:PythonLicense:Apache-2.0Stargazers:191Issues:4Issues:22

MTO-Platform

Multitask Optimization Platform (MToP): A MATLAB Optimization Platform for Evolutionary Multitasking

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonStargazers:110Issues:1Issues:0

CoRe

[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models

Language:PythonStargazers:33Issues:0Issues:1

MIRTT

[EMNLP 2021] MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question Answering

Language:PythonStargazers:8Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0