Artur Tanona's repositories
transformers-for-lawyers
AI apps/benchmark for legaltech
alpaca-lora
Finetuning InstructLLaMA on consumer hardware
Blackstone
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
dummy_project
spacy drills
finetune-gpt2xl
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
imbalanced-dataset-sampler
A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones
invalid_flow_of_jina
debugging
language-models-are-knowledge-graphs-pytorch
Language models are open knowledge graphs ( non official implementation )
Legal-Text-Analytics
A list of selected resources, methods, and tools dedicated to Legal Text Analytics.
llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
long_sequence_summarization
Side project for learning purposes
ML_Resources
GitHub Repo with various ML/AI/DS resources that I find useful
nlp_paper_summaries
✍️ A carefully curated list of NLP paper summaries
poleval-2018
Code and Data Accompanying the Paper "Approaching nested named entity recognition with parallel LSTM-CRFs"
Semantic-Search
Semantic search using Transformers and others
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
transformer-kernel-ranking
TK & TKL - Efficient Transformer-based neural re-ranking models
TransformerCVAE
Transformer-based Conditional Variational Autoencoder for Controllable Story Generation
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.