Haijunlv's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:253Issues:0Issues:0

LabelLLM

The Open-Source Data Annotation Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:428Issues:0Issues:0

labelU

Data annotation toolbox supports image, audio and video data.

Language:PythonStargazers:652Issues:0Issues:0

WanJuan1.0

万卷1.0多模态语料

License:CC-BY-4.0Stargazers:517Issues:0Issues:0

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Language:PythonLicense:Apache-2.0Stargazers:4118Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:8473Issues:0Issues:0
Language:PythonStargazers:15Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:14247Issues:0Issues:0

doremi

Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets

Language:HTMLLicense:MITStargazers:281Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:PythonStargazers:9835Issues:0Issues:0

BiLLa

BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability

Language:PythonLicense:Apache-2.0Stargazers:421Issues:0Issues:0

pandallm

Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。

Language:PythonLicense:Apache-2.0Stargazers:1067Issues:0Issues:0

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2544Issues:0Issues:0

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:25729Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10096Issues:0Issues:0

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Language:CudaStargazers:1339Issues:0Issues:0

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Language:C++License:NOASSERTIONStargazers:2310Issues:0Issues:0

lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

Language:C++License:NOASSERTIONStargazers:3151Issues:0Issues:0

LeetcodeTop

汇总各大互联网公司容易考察的高频leetcode题🔥

Stargazers:18480Issues:0Issues:0

RFNext

RF-Next: Efficient Receptive Field Search for CNN(TPAMI2022, CVPR2021) Try it, you wouldn't regret it!

Language:PythonStargazers:62Issues:0Issues:0

coco-minitrain

a subset of coco dataset for faster experimentation

Language:PythonStargazers:225Issues:0Issues:0

mmrazor

OpenMMLab Model Compression Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:1440Issues:0Issues:0

lightweight-neural-architecture-search

This is a collection of our zero-cost NAS and efficient vision applications.

Language:PythonLicense:Apache-2.0Stargazers:363Issues:0Issues:0

bagua

Bagua Speeds up PyTorch

Language:PythonLicense:MITStargazers:872Issues:0Issues:0

tfeat

TFeat descriptor models for BMVC 2016 paper "Learning local feature descriptors with triplets and shallow convolutional neural networks"

Language:Jupyter NotebookLicense:MITStargazers:148Issues:0Issues:0

pytorch-metric-learning

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Language:PythonLicense:MITStargazers:5921Issues:0Issues:0

AWS-OHL-AutoAug

An integration of several popular automatic augmentation methods, including OHL (Online Hyper-Parameter Learning for Auto-Augmentation Strategy) and AWS (Improving Auto Augment via Augmentation Wise Weight Sharing) by Sensetime Research.

Language:PythonStargazers:47Issues:0Issues:0

neural_network_papers

记录一些读过的论文,给出个人对论文的评分情况并简述论文insight

License:CC0-1.0Stargazers:162Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:3264Issues:0Issues:0