A wrapper over Caffeine cache library supporting suspending functions with respect for structured concurrency

This library is a wrapper over Caffeine, a popular cache library. It adds support for suspending functions, so that you can use it with coroutines.

Usage

To create a cache, use Caffeine builder, but then create it using buildSuspending method.

val cache: SuspendingCache<String, String> = Caffeine.newBuilder()
    // ...
    .buildSuspending<String, String>()

Then you can use it as a regular cache, but with suspending function get. It takes a key and a function that will be executed, if the key is not present in the cache. The function will be executed only once, even if there are multiple calls to get for the same key.

var calls = 0
suspend fun request(key: String): String {
    calls++
    delay(1000)
    return "Result for $key"
}

val result1 = cache.get("ABC", ::request)
// (1 sec)
println(result1) // Result for ABC

val result2 = cache.get("ABC", ::request)
println(result2) // Result for ABC

val result3 = cache.get("DEF", ::request)
// (1 sec)
println(result3) // Result for DEF

println(calls) // 2

Library also supports all the features of Caffeine, like expiration time, refresh, etc.

Dependency

Add the dependency in your module build.gradle(.kts):

// build.gradle / build.gradle.kts
dependencies {
    implementation("com.github.MarcinMoskala:caffeine-suspending:<version>")
}

Add it in your root build.gradle(.kts) at the repositories block:

// build.gradle
repositories {
    // ...
    maven { url 'https://jitpack.io' }
}

// build.gradle.kts
repositories {
    // ...
    maven("https://jitpack.io")
}

This library intentionally do not expose Caffeine as a transitive dependency, so each user can choose the version of Caffeine that he wants to use. That is why to use this library, you also need to add Caffeine dependency to your project:

dependencies {
    implementation("com.github.ben-manes.caffeine:caffeine:3.1.5")
}

Why this library?

There are no well implemented libraries that support suspending functions in cache. The most popular one is Aedile, but is has essential design flaws. It does not support structured concurrency and cancellation, it starts its calls on an external scope instead of using caller scope. It is also bound to specific Caffeine version.

Design choices

It is not an easy task to make proper design choices for a cache. My observation is, that most of the creators do not even consider most of the problems. I assume, that the typical use case is to have a cache for a remote resource (API or DB). Here are the most important problems worth considering, and my answers to them, that are implemented in this library:

What scope should be used for suspending requests?

Request should be executed on the suspending function caller's scope. Thanks to that, scope specificities set for caller are preserved for the request. For example, if the caller is includes a name, or a context keeping some data, the request will also have it.

fun request(key: String): String {
    delay(1000)
    return "Result(${coroutineContext[CoroutineName]?.name})})"
}

withContext(CoroutineName("Request1")) {
    val result = cache.get("1", ::request)
    // (1 sec)
    println(result) // Result(Request1)
}

withContext(CoroutineName("Request2")) {
    val result = cache.get("1", ::request)
    println(result) // Result(Request1)
}

cache.invalidate("1")

withContext(CoroutineName("Request3")) {
    val result = cache.get("1", ::request)
    // (1 sec)
    println(result) // Result(Request3)
}

withContext(CoroutineName("Request4")) {
    val result = cache.get("2", ::request)
    // (1 sec)
    println(result) // Result(Request4)
}

However, it is essential, that a cancellation of the caller that started a request should cancel this request, but the first caller waiting for the result should start the request with its context.

suspend fun request(key: String): String {
    delay(1000)
    return "Result(${coroutineContext[CoroutineName]?.name})})"
}

val job = launch(CoroutineName("Request1")) {
    cache.get("1", ::request)
}

val res = async(CoroutineName("Request2")) {
    delay(10)
    cache.get("1", ::request)
}

delay(500)
job.cancel()
val result = res.await()
// (1.5 sec) <- We waited 0.5 for the first request, until it was cancelled, and the second request started, and then we waited 1 sec for the second request to finish.
println(result) // Result(Request2)

What should be the result of a call to get method, when its "request" function throws an exception?

When a request fails, the caller should be acknowledged about it, to show appropriate information to user.

suspend fun request(key: String): String {
    delay(1000)
    throw RuntimeException()
}

val result = runCatching { cache.get(key, ::request) }
println(result) // Failure(java.lang.RuntimeException)

Consider two requests for the same key. The second one will wait to the result of the first one. The first one sends a request that fails. What should be the result of the second call?

If two requests for the same key are made, the second one will wait for the result of the first one. If the first one fails, the second one should also fail.

var isFirst = true
fun request(key: String): String {
    delay(1000)
    if (isFirst) {
        isFirst = false
        throw RuntimeException()
    } else {
        return "Result"
    }
}

launch {
    val result1 = runCatching { cache.get(key, ::request) } // (1 sec)
    println(result1) // Failure(java.lang.RuntimeException)
}
launch {
    delay(500) // (0.5 sec)
    val result2 = runCatching { cache.get(key, ::request) }
    // (0.5 sec)
    println(result2) // Failure(java.lang.RuntimeException)
}

The reason behind it is simple: Consider that 10 clients are requesting the same resource at the same time. If the first one fails, the second one should also fail, because the resource is not available. It would make little sense to make next calls request one after another. If we wanted such behavior, we should use a retry mechanism. We want to make a new request when a cororoutine asks for the resource after the first request is finished, what is explained in the next point.

Consider you have a read that starts a request that fails. Then you have another read before value expire time. Should it automatically fail, or should it start the request again?

A cache might keep values for a long time, and we do not want to treat a failing request as a permanent failure. We want to retry the request, when a coroutine asks for the resource after the first request is finished. In other words, a failing request should not be treated as a valid cache entry.

var isFirst = true
fun request(key: String): String {
    delay(1000)
    if (isFirst) {
        isFirst = false
        throw RuntimeException()
    } else {
        return "Result"
    }
}

val result1 = runCatching { cache.get(key, ::request) }
// (1 sec)
println(result1) // Failure(java.lang.RuntimeException)
val result2 = runCatching { cache.get(key, ::request) }
// (1 sec)
println(result2) // Result

griffio / caffeine-suspending

A wrapper over Caffeine cache library supporting suspending functions with respect for structured concurrency

Usage

Dependency

Why this library?

Design choices

About

Languages