hanker's starred repositories
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
JimuReport
「开源可视化报表,商业BI替代方案」积木报表是一款类似excel操作风格,在线拖拽完成设计的报表工具。低代码产品的臂膀!功能涵盖: 报表设计、图形报表、打印设计、大屏设计等,完全免费!秉承“简单、易用、专业”的产品理念,极大的降低报表开发难度、缩短开发周期、解决各类报表难题。
llm-foundry
LLM training code for Databricks foundation models
openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
awesome-data-labeling
A curated list of awesome data labeling tools
synthetic-data-generator
SDG is a specialized framework designed to generate high-quality structured tabular data.
differential-privacy
Google's differential privacy libraries.
waymo-open-dataset
Waymo Open Dataset
tf-encrypted
A Framework for Encrypted Machine Learning in TensorFlow
awesome-he
✨ Awesome - A curated list of amazing Homomorphic Encryption libraries, software and resources
Awesome-3D-Object-Detection-for-Autonomous-Driving
3D Object Detection for Autonomous Driving: A Comprehensive Survey (IJCV 2023)
My_Bibliography_for_Research_on_Autonomous_Driving
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
auto-regex
automatic regex generation tool
awesome-data-synthesis
A curated list of awesome resources for creating synthetic data
spark-privacy-preserver
Anonymizing Library for Apache Spark