SivilTaram

Qian's repositories

Persona-Dialogue-Generation

The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"

Language:PythonMIT309 5 34

code-html-to-markdown

A lightweight script for processing HTML page to markdown format with support for code blocks

Language:HTMLMIT81 20

CHASE

Synthetic Data Generation for Evaluation

MIT200

OctoThinker

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

Apache-2.0100

santacoder-finetuning-commit

Fine-tune SantaCoder for Code/Text Generation.

Language:PythonApache-2.01 10

simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Language:PythonMIT100

SivilTaram.github.io

Language:HTML1 20

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.0000

bytepiece

更纯粹、更高压缩率的Tokenizer

Language:PythonApache-2.0000

catwalk

This project studies the performance and robustness of language models and task-adaptation methods.

Language:PythonApache-2.0010

dclm

DataComp for Language Models

Language:HTMLMIT000

dl4c.github.io

Deep Learning for Code Website

Language:HTMLApache-2.0000

dl4c.github.io-1

✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com

Language:HTMLMIT000

extract-expert

Extract a single expert from an MoE model of Mixtral architecture, using slerp

Language:PythonApache-2.0000

imp

Language:Python000

infinigen

Infinite Photorealistic Worlds using Procedural Generation

Language:PythonBSD-3-Clause010

Megatron-LLM

distributed trainer for LLMs

Language:PythonNOASSERTION010

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.0010

oat

🌾 OAT: Online AlignmenT for LLMs

Language:PythonApache-2.0000

OpenAgents

OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonApache-2.0010

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.0010

Precision-RL

Defeating the Training-Inference Mismatch via FP16

MIT000

sailcraft

Data Toolkit for Sailor Language Models

Language:Python000

SivilTaram

020

steplaw

000

surya

Accurate line-level text detection and recognition (OCR) in any language

Language:PythonGPL-3.0010

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.0010

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookApache-2.0010

verl-pipeline

Async pipelined version of Verl

Apache-2.0000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000