Qian (SivilTaram)

SivilTaram

User data from Github https://github.com/SivilTaram

Company:Researcher @ TikTok

Location:Singapore

Home Page:http://siviltaram.github.io/

GitHub:@SivilTaram

Twitter:@sivil_taram


Organizations
buaase
MLNLP-World
sail-sg
sea-sailor

Qian's repositories

Persona-Dialogue-Generation

The code of ACL 2020 paper "You Impress Me: Dialogue Generation via Mutual Persona Perception"

Language:PythonLicense:MITStargazers:309Issues:5Issues:34

code-html-to-markdown

A lightweight script for processing HTML page to markdown format with support for code blocks

Language:HTMLLicense:MITStargazers:81Issues:2Issues:0

CHASE

Synthetic Data Generation for Evaluation

License:MITStargazers:2Issues:0Issues:0

OctoThinker

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

License:Apache-2.0Stargazers:1Issues:0Issues:0

santacoder-finetuning-commit

Fine-tune SantaCoder for Code/Text Generation.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

bytepiece

更纯粹、更高压缩率的Tokenizer

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

catwalk

This project studies the performance and robustness of language models and task-adaptation methods.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

dl4c.github.io

Deep Learning for Code Website

Language:HTMLLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dl4c.github.io-1

✨ Build a beautiful and simple website in literally minutes. Demo at https://beautifuljekyll.com

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

extract-expert

Extract a single expert from an MoE model of Mixtral architecture, using slerp

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

infinigen

Infinite Photorealistic Worlds using Procedural Generation

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

Megatron-LLM

distributed trainer for LLMs

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:1Issues:0

oat

🌾 OAT: Online AlignmenT for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

OpenAgents

OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Precision-RL

Defeating the Training-Inference Mismatch via FP16

License:MITStargazers:0Issues:0Issues:0

sailcraft

Data Toolkit for Sailor Language Models

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0
Stargazers:0Issues:0Issues:0

surya

Accurate line-level text detection and recognition (OCR) in any language

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Triton-Puzzles

Puzzles for learning Triton

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

verl-pipeline

Async pipelined version of Verl

License:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0