Jravvit / serverless-libreoffice

Run LibreOffice in AWS Lambda to create PDFs & convert documents

Home Page:https://vladholubiev.com/serverless-libreoffice

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Serverless LibreOffice

πŸ‘‰πŸ» Read the blog post on Medium: How to Run LibreOffice in AWS Lambda for Dirty-Cheap PDFs at Scale πŸ‘ˆπŸ»

Show Me the Code

This repo contains code used to run the online demo.

β”œβ”€β”€ compile.sh  <-- commands used to compile LibreOffice for Lambda
β”œβ”€β”€ infra       <-- terraform config to deploy example Lambda
β”‚   β”œβ”€β”€ iam.tf
β”‚   β”œβ”€β”€ lambda.tf
β”‚   β”œβ”€β”€ main.tf
β”‚   β”œβ”€β”€ s3.tf
β”‚   └── vars.tf
└── src         <-- example Lambda function node in Node.js used for website demo
    β”œβ”€β”€ handler.js
    β”œβ”€β”€ libreoffice.js
    β”œβ”€β”€ logic.js
    β”œβ”€β”€ package.json <-- put lo.tar.gz in this folder to deploy. Download it below
    └── s3.js

Compiled and ready to use archive can be downloaded under Releases section. Also check out NPM package with bundled LibreOffice for Lambda (85 MB).

✨ Check out a new Lambda Layer with LibreOffice!

How to compile by yourself

Check out a comprehensive step-by-step tutorial from 0 to deployed function.

  1. Go to Lambda Execution Environment and Available Libraries page to get the latest AMI id
  2. Click on this link to get AMI id for your region
  3. Spin up a c5.2xlarge spot instance with ~ 100 GB of storage attached
  4. Follow the steps in compile.sh file in the repo

Help

Related Projects

How To Help

Reduce Cold Start Time

Currently Ζ› unpacks 109 MB .tar.gz to /tmp folder which takes ~1-2 seconds on cold start.

Would be nice to create a single compressed executable to save unpack time and increase portability. I tried using Ermine packager and it works!! But unfortunately this is commercial software. Similar open-source analogue Statifier produces broken binaries.

Maybe someone has another idea how to create a single executable from a folder full of shared objects.

UPD: TODO: Check out node-packer and libsquash (no FUSE required!)

Further Size Reduction

I am not a Linux or C++ expert, so for sure I missed some easy "hacks" to reduce size of compiled LibreOffice.

Mostly I just excluded from compilation as much unrelated stuff as possible. And stripped symbols from shared objects.

Here is the list of: available RPM packages and libraries available in AWS Lambda Environment, which can be helpful.

You can also use multi compression level, with upx and then decompress after brotli.

Testing

Update repo for testing. Return before S3 for example, hardcode or generate files to convert and setup variables. Then simply run:

docker run \
 -v "\$PWD":/var/task \
 lambci/lambda:nodejs12.x src/handler.handler

After successful execution, get the resulted files to check the pdfs.

docker ps -a

Find exect container id.

Then execute

docker cp containerId:/tmp/filename.pdf ./filename.pdf

Then check your results locally

License

MIT Β© Vlad Holubiev

About

Run LibreOffice in AWS Lambda to create PDFs & convert documents

https://vladholubiev.com/serverless-libreoffice


Languages

Language:JavaScript 37.6%Language:Shell 35.2%Language:HCL 27.2%