Qian (SivilTaram)

SivilTaram

Geek Repo

Company:Research Scientist @ Sea AI Lab

Location:Singapore

Home Page:http://siviltaram.github.io/

Twitter:@sivil_taram

Github PK Tool:Github PK Tool


Organizations
buaase
MLNLP-World

Qian's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:20582Issues:194Issues:2940

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:12360Issues:116Issues:499

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:8402Issues:74Issues:84

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5435Issues:67Issues:389

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:3812Issues:47Issues:235

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

starcoder2

Home of StarCoder2!

Language:PythonLicense:Apache-2.0Stargazers:1533Issues:18Issues:17

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:894Issues:13Issues:32

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:890Issues:41Issues:58

DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Language:PythonLicense:MITStargazers:643Issues:12Issues:22

SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Language:PythonLicense:NOASSERTIONStargazers:510Issues:17Issues:27

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookLicense:MITStargazers:439Issues:8Issues:32

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Language:PythonLicense:NOASSERTIONStargazers:201Issues:5Issues:12

AgentBoard

An Analytical Evaluation Board of Multi-turn LLM Agents

LongMamba

Some preliminary explorations of Mamba's context scaling.

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonLicense:MITStargazers:157Issues:7Issues:28

sailor-llm

Sailor: Open Language Models for South-East Asia

Language:PythonLicense:MITStargazers:78Issues:7Issues:1

Agent-Smith

[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Language:PythonLicense:MITStargazers:54Issues:6Issues:1

astraios

Astraios: Parameter-Efficient Instruction Tuning Code Language Models

Language:Jupyter NotebookLicense:MITStargazers:52Issues:4Issues:4

weak-to-strong

Weak-to-Strong Jailbreaking on Large Language Models

Language:PythonLicense:MITStargazers:47Issues:0Issues:0

autofd

Automatic Functional Differentiation in JAX

Language:PythonLicense:Apache-2.0Stargazers:44Issues:4Issues:3

AnyDoor

AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models

youtube_subtitle_dataset

YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training

SciTab

The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"

thaimaimee

Scrape, clean and explore ThaiME dataset

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:12Issues:2Issues:0
Language:PythonStargazers:4Issues:0Issues:0