npuichigo / pytorch_lmdb_dataset

pytorch lmdb dataset with protobuf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Pytorch Lmdb Dataloader

Use lmdb with protobuf to efficiently read big data for pytorch training.

Getting Started

  1. Install python and protobuf. It's convinient to get protoc in grpc_tools.
pip install grpcio grpcio-tools
  1. Generate proto.
python -m grpc_tools.protoc -I./proto --python_out=./proto ./proto/tensor.proto
  1. Create dummy training data.
python create_lmdb.py --output_file train_lmdb
  1. Run the unit testing.
python dataset_test.py

Reference

About

pytorch lmdb dataset with protobuf


Languages

Language:Python 100.0%