茶豚's repositories
AI-Paper-Collector
Fully-automated scripts for collecting AI-related papers
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.
leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
LLM4IR-Survey
This is the repo for the survey of LLM4IR.
cam-notes
My Cambridge Lecture Notes
CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
CSL
[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
google-search-results-python
Google Search Results via SERP API pip Python Package
LeetCode-Go
✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解
leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
leetcode_cpp
LeetCode Problems' Solutions
leetcode_java
Leetcode solutions
List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
paper-reading
深度学习经典、新论文逐段精读
PyMuPDF
PyMuPDF is an enhanced Python binding for MuPDF – a lightweight PDF, XPS, and E-book viewer, renderer, and toolkit.
stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
text2vec
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.