David Park's repositories
patent-crawler
Patent Crawler is a python program to crawl patent information from Google Patent with given keywords.
LLM_Mini_Series_Part_II
This is a demo repository for parallel multi-index question answering using streamlit and llama index
awesome-mac
Now we have become very big, Different from the original idea. Collect premium software in various categories.
Machine-Learning-Tutorials
machine learning and deep learning tutorials, articles and other resources
DoctorGPT
DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
AI-DeepFakes
This repository contains the source code for the paper First Order Motion Model for Image Animation
Chain-of-Thought
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. Meanwhile, we created a new branch to build a Tabular LLM.(我们分别统一了丰富的IFT数据(如CoT数据,目前仍不断扩充)、多种训练效率方法(如lora,p-tuning)以及多种LLMs,三个层面上的接口,打造方便研究人员上手的LLM-IFT研究平台。同时tabular_llm分支构建了面向表格智能任务的LLM。
US-AI-Patents
Code and Data for Text Classification of AI Related Patents Research Paper
uspto-patent-citation-graph
Graph that downloads patent citation data from USPTO's PatentsView API on-demand and stores it locally in an SQL database (and in memory) for fast access later.
patents-public-data
Patent analysis using the Google Patents Public Datasets on BigQuery
schema-registry
Confluent Schema Registry for Kafka
xlambda
Predicts AWS Lambda demand and keeps a fleet of containers warm to mitigate cold-start latency
awesome-patent-retrieval
Curated list of resources for processing patent data
pke
Python Keyphrase Extraction module
Data-Analysis
Data Science Using Python
spring-data-cassandra
Provides support to increase developer productivity in Java when using Apache Cassandra. Uses familiar Spring concepts such as a template classes for core API usage and lightweight repository style data access.
pygrams
Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
go-control-plane
Go implementation of data-plane-api
Turbo-Boost-Switcher
Turbo Boost disabler / enable app for Mac OS X
eks-tf-gitops
A fully functional and secure EKS cluster provisioned with Terraform and powered by ArgoCD
playbook
Guides for getting things done, programming well, and programming in style.
PatCit
Making Patent Citations Uncool Again
david-blog
🚀⚡️ Blazing fast blog built with Gatsby and the Cosmic Headless CMS 🔥
vim-bootstrap
Vim Bootstrap is a generator that provides a simple method of generating a configuration for vim / neovim.
Chandrahasa
A solid recon tool I use personally.
awesome-ds-setting
A tutorial for setting a new machine with core data science tools