Ph03nix's repositories
Maldataset-2021
Maldataset2021 is a malware dataset that consists of 28 classes of malware, in which each class represents a malware family, and each sample gives a RGB 224x224 PNG file. The PNG files are transformed from the original binary malware files. The motivation of image transformation is to identify malware on the raw bytes of entire executable files (i.e., image), so that deep learning technologies such as CNN can be applied to malware classification, since CNN model has been demonstrated with its outstanding capability on image classification. In this view, we provide here a new dataset that contains the latest malware samples. The entire PNG files are split as, 70% for training and the remaining 30% for testing.
AdalFlow
AdalFlow: The “PyTorch” library to auto-optimize any LLM tasks.
Adversarial_Examples_Papers
A list of recent papers about adversarial learning
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
AutoPoison
The official repository of the paper "On the Exploitability of Instruction Tuning".
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
blogs
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
Blogs_Content
Contains Google Colab or Jupyter notebooks, as well as other associated files for my Medium blogposts.
datasets-Creation
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
deep-learning-containers
AWS Deep Learning Containers (DLCs) are a set of Docker images for training and serving models in TensorFlow, TensorFlow 2, PyTorch, and MXNet.
GML2023
Graph Machine Learning course, Xavier Bresson, 2023
Large-Language-Model-Notebooks-Course
Practical course about Large Language Models.
llama-recipes
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger
llm-attacks
Universal and Transferable Attacks on Aligned Language Models
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
machine-learning-articles
🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.
MathForDeepLearning
Source code for the book "Math for Deep Learning" (No Starch Press)
microagents
Agents Capable of Self-Editing Their Prompts / Python Code
nebula
AI-Powered Ethical Hacking Assistant
Ollama-Colab-Integration
Jupyter Notebooks for Ollama integration
Red_Reaper_v2
Stage 1: Sensitive Email/Chat Classification for Adversary Agent Emulation (espionage). This project is meant to extend Red Reaper v1 which was presented at RSA San Francisco 2024.
Training-Small-Language-Model
Training Small Language Model
TrainingLLMs-LaTeX-MathematicalExpression-To-ValidPythonCode
Given a LaTeX expression, train an LLM to generate valid python code.
unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory