mw's repositories
WSDM-Cup-2024
1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc
hf-codegen
A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.
DHS-LLM-Workshop
DHS 2023 LLM Workshop
pymc-marketing
Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.
LLMmergekit
Tools for merging pretrained large language models.
RecAlgorithm
主流推荐系统Rank算法的实现
qa-lora
Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Tencent2020_Rank1st
The code for 2020 Tencent College Algorithm Contest, and the online result ranks 1st.
leetcode
python 数据结构与算法 leetcode 算法题与书籍 刷算法全靠套路与总结!Crack LeetCode, not only how, but also why.
Tencent2019_Preliminary_Rank1st
The code for 2019 Tencent College Algorithm Contest, and the online result ranks 1st in the preliminary.
Tencent2019_Finals_Rank1st
2019腾讯广告算法大赛完整代码(冠军)
recsys
LR, FM, DeepFM, xDeepFM, DIN, CF等推荐算法代码demo。采用TFRecords作为输入,方便实际场景应用。
RdfToArangoDBJson
This repository forms part of thesis research. Thesis can be downloaded at https://is.cuni.cz/webapps/zzp/download/120353259 for in-depth details.
Awesome-RecSystem-Models
Implements of Awesome RecSystem Models with PyTorch/TF2.0
learn_python3_spider
python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,爬虫项目实战实例等
python-machine-learning-book-2nd-edition
The "Python Machine Learning (2nd edition)" book code repository and info resource
nickname-and-diminutive-names-lookup
A CSV file that containing US given names (first name) and their associated nicknames or diminutive names.