http-over-file-transfer

allow api calls using a file transfer pipe

decisions

layer 0 - unreliable file transfer
- some folder that sometimes pushes files into the other folder
- assume the folder is shared among multiple tenants, so use subfolders with uuid names
  - also speeds up file listing
- only ways to organize data are by subfolder and filename
- error correction & encryption & completeness check & compression should be handled here
layer 1 - reliable secure message log replication (rediscovering tcp from first principles)
- bounded message size, maybe up to 100mb
- maybe send it in a framed format so we can concat multiple short messages using a greedy algorithm
- base the replication algo on a lamport clock since it's easier to reason about
- use keyed hashes with a random key? store the encapsulated key in the packet?
layer 2 - http proxy
- allows http requests to be split into multiple messages if they're too large
- requires callback url
- include full schema+creds+url+query+params, headers, timeout?, verb, cookie
optional frontend layer - ttl cache
- allow user to poll and pull instead
optional backend layer - oauth cache
- cookies or client id/credentials
- store and refresh tokens

data (request) (can be compressed?)
- complete http request details, including files
- caller's callback url
- caller's ip? (for x-forwarded-for)
data (response) (can be compressed?)
- complete response details, including files
- caller's callback url
- callee's ip or other details?
- round trip time?

we're currently at step 2 of this process:

encrypt, sign
- jwcrypto
- python-jose
- streaming encryption layer
reference for http proxy
- pproxy
alternative: use a custom binary format, handle signing and encryption manually
- maybe use a known format?
  - protobuf / flatbuffers
  - cbor / messagepack
  - avro / parquet / pickle / ion (amazon) / thrift /
- message format a bit like jwe / jwt / jws (jose)
  - header
  - data (signed and encrypted with random key and iv)
  - encrypted random key, iv
  - hmac with random key
error correction codes
- raptorq
- par2cmdline
- reed-solomon
- just append nulls (after a 0xFF end flag) since we only really get truncation errors
binary encoding - maybe try base85? slower but more space efficient, and we're probably network limited
uuid7 instead of timestamp + packet sequential id? requires 16 bytes instead of 4 + 4 though
see also BBRv2 paper