slamandar's repositories
tensorflow
Computation using data flow graphs for scalable machine learning
CodeFuse-Query
QL-Based Code Analysis Engine NOT only for CodeFuse training data quality
Language:Jupyter NotebookApache-2.0000
codellm-data-preprocess-pipeline
代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota
Language:Python000
nsfw_data_scrapper
Collection of scripts to aggregate image data for the purposes of training an NSFW Image Classifier
Language:Jupyter Notebook000
tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Language:Jupyter NotebookApache-2.0000
UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Language:Python000