ScalaZ and Cats

ScalaZ and Cats are libraries which provide Functional Programming constructs for Scala (i.e. Monad n' friends).

This repository is a comparison of these two libraries by someone who isn't predisposed to either one. If you're a contributor to either library and notice a discrepancy here, please let me know!

We seek to answer the following question:

Should I use ScalaZ or Cats?

The answer is, of course, "it depends". What's your use-case?

Use Cases
Benchmarks
Usage Considerations
Library Health and Ecosystems
- Project Pulses
- Sub-libraries
  - Shims
- Resources
  - ScalaZ
  - Cats

Use Cases

I want to train my Scala team in Functional Programming fundamentals

Functional Programming in Scala, a book by Runar Bjarnson and Paul Chiusano, is a valuable book for learning about Functional Programming in general. Otherwise, both ScalaZ and Cats have specific books and professional training available.

ScalaZ:

Functional Programming for Mortals by Sam Halliday (for FP beginners - assumes an OO Scala background)
Training by the Fantasyland Institute

Cats:

Scala with Cats by Noel Welsh and Dave Gurnell (assumes 1+ years of Scala experience)
Training by Underscore Consultants

I'm writing a performance-sensitive application

Lean toward Cats, it tends to be faster in aggregate. Are you using a database? Consider Doobie, which uses Cats.

I want to improve quality-of-life for my Scala devs

Any dedicated application of FP concepts will help you organize and simplify your code. Libaries like simulacrum, decline and circe can provide immediate wins by drastically cutting down boilerplate. The latter two can be used natively with Cats, or via ScalaZ with the shims library.

I'm writing a library/application that works with a complex DSL

You probably need Recursion Schemes, which are supplied by the Matryoshka library in ScalaZ-land.

I want to port a well-known, general-purpose Haskell library to Scala

You'd be a champ to write a backend for both ScalaZ and Cats, but know that Cats has a head start and has a nice set of ported libraries already.

I care about which stays truer to Haskell

ScalaZ does. Its core has a larger API, provides more features up-front, and tends to keep Haskell function names and operators (e.g. <*>).

I care about which has more industry backing

According to this survey, ScalaZ does.

I hear the `IO` Monad can help me logically organize my code

Both ScalaZ 7 and Cats have a effects subpackage which provides an IO type. They both help you contain "real world" side-effects into smaller areas of your code base, freeing the rest of it to purity (referential transparency). They also help you wrangle IO-based Exceptions.

Cats' IO is currently faster in aggregate. However, an overhaul of scalaz-effects with many orders of magnitude of improvement in performance is promised for ScalaZ 8, so you may want to wait for that if IO is a great concern to you.

Futures suck and I hate JVM thread pools. Help?

Wait for ScalaZ 8.

Just gimme Monads

Then either is fine, you can flip a coin.

Benchmarks

Benchmarks were performed using the JMH plugin for SBT. Vanilla Scala and Haskell results are also included where applicable.

Results

All times are in nanoseconds. Kittens and scalaz-deriving were used to derive Eq instances.

scalaz-deriving v0.9.1-SNAPSHOT
kittens 1.0.0-RC1

Benchmark	ScalaZ 7.2.16	ScalaZ 7.2.17	Cats 1.0.0-RC1	Vanilla Scala	Haskell 8.0.2
`Eq` - same `[Int]`	78,653	11.5	2.5	2.4	3,974
`Eq` - different `[Int]`	80,508	5,753	3,983	5,180
`Eq` - `while` w/ `Int`	3,226	3,223	199	198
`Eq` (derived) - same `[Foo]`	79,150	10.2	2.8	2.5
`Eq` (derived) - different `[Foo]`	80,737	2,945	38,630	2,071
`Eq` (derived) - `while` w/ `Foo`	470,323	463,595	40,113	5,335
`Eq` (hand-written) - same `[Foo]`	26,673	10.1	2.8	2.5
`Eq` (hand-written) - different `[Foo]`	26,638	2,962	7,835	2,071
`Eq` (hand-written) - `while` w/ `Foo`	10,771	3,156	5,341	5,335
`Show` - `[Int]`	1,000,757		43,633	41,079	46,540
`Show` - `String`	216.6		3.2	2.8	199.4
`Foldable.fold` on `[Int]`	3,355		5,026	7,939	3,330
`Foldable.fold` on `[Maybe Int]`	10,740		12,506		15,440
`State` - `get`	17.9		33.3		4.1
`State` - `>>=`	90		139.1		10.43
`State` - `flatMap`	63.9		133.3
`State` - countdown	4,259,320		2,071,480		6,069
`StateT` - countdown			4,572,499		24,070
`Applicative` - sum `(<*>)`	31,709		32,132		22,140
`Applicative` - sum (cartesian)	50,431		33,638
`IO` - recurse 1000	117,569		48,558		907.7
`IO` - recurse 10000	1,183,352		503,889		9,095
`IO` - recurse 100000	11,671,581		5,167,355		89,860

Observations

Cats' type-safe equality checking is faster than Vanilla Scala. So, there seems to be no reason not to use Cats' === in all cases.
Cats' type-safe String rendering via Show is as fast as Vanilla toString. So .toString should be avoided.
At the small scale (i.e. a single >>=), ScalaZ tends to be faster.
At aggregate scale, Cats tends to be faster.
Neither library performs well on recursive Monadic operations. Haskell is two to three orders of magnitude faster in this regard. In particular, GHC heavily optimizes both IO and State operations.

Caveat

As of this writing (2017 November), ScalaZ 8 is still under development but promises significant performance improvements for their IO Monad. The benchmarks above will have to be reran when it is released.

Usage Considerations

API Accessibility

Up front, Cats has much more documentation and usage examples. Their website is good for this. However, given that they both have blog posts and books written about them, overall the availability of resources should be about equal between the two libraries.

The Cats import story is consistent - for most tasks you only need:

import cats._            /* To refer to top-level symbols like Monad */
import cats.implicits._  /* To get typeclass instances and operators */

ScalaZ has a bit more flexibility with their imports, but honestly you can just avoid that and do:

import scalaz._
import Scalaz._

and you'll get all data types, typeclasses, instances, and operators. If you're willing to do that, then the import experience for both libraries is the same.

Features

ScalaZ: `IList`

From its Scaladocs:

Safe, invariant alternative to stdlib List. Most methods on List have a sensible equivalent here, either on the IList interface itself or via typeclass instances (which are the same as those defined for stdlib List). All methods are total and stack-safe.

Between being invariant and avoiding connection to Scala's enormous Collections API, IList manages to be the fastest general-purpose Scala container type to iterate over. Specifically, it handles tail-recursive algorithms with pattern matching (thus mimicking .map and .foldLeft) twice as fast as vanilla List. Only an Array of Int or Double via a while loop can iterate faster.

ScalaZ: `Maybe`

From its Scaladocs:

Maybe[A] is isomorphic to Option[A], however there are some differences between the two. Maybe is invariant in A while Option is covariant. Maybe[A] does not expose an unsafe get operation to access the underlying A value (that may not exist) like Option[A] does. Maybe[A] does not come with an implicit conversion to Iterable[A] (a trait with over a dozen super types).

The implication is that Maybe should be safer and slightly more performant than Option. Ironically, many ScalaZ methods that yield an "optional" value use Option and not Maybe.

Where Monad Transformers and concerned, ScalaZ provides both MaybeT and OptionT.

ScalaZ: `EphemeralStream`

From its Scaladocs:

Like scala.collection.immutable.Stream, but doesn't save computed values. As such, it can be used to represent similar things, but without the space leak problem frequently encountered using that type.

The dream of lazy Haskell lists realized? Maybe. With EphemeralStream (or EStream as the cool kids call it), even the "head" value is lazy. So one would use EStream when there's no guarantee that even the first value might be used.

How does it perform?

All times are in microseconds.

Benchmark	List	IList	Vector	Array	Stream	EphemeralStream	Iterator
`foldLeft`	33.3	31.3	68.9	56.4	56.9	163.1	55.4
`foldRight`	69.2	89.5	228.39	55.1	Stack Overflow	Stack Overflow	147.6
Tail Recursion	45.9	24.1			69.8

We see similar slowdowns for chained higher-order ops as well. Looks like building in the laziness has its cost.

Typeclasses

Typeclasses are a powerful programming construct to relate data types that have common behaviour. They describe how a type should behave, as opposed to what a data type is (re: Object Oriented programming).

Both ScalaZ and Cats provide the "standard" typeclasses, namely Monoid, Functor, Applicative, and Monad, as well as a wealth of others for more specialized work. In general, the ScalaZ typeclass hierarchy is larger than the Cats' one.

Custom Typeclasses

Scala doesn't yet have first-class support for typeclasses. While it's very possible to create trait/object structures that represent a typeclass, there is no built-in syntax for it. The library simulacrum helps greatly with this:

package mylib

import simulacrum._

@typeclass trait Semigroup[A] {
  @op("<>") def combine(x: A, y: A): A
}

This significantly reduces boilerplate. At compile time, this tiny definition is expanded into everything necessary to use .combine (or its optional operator <>!) as an injected method on your A type. Here's how to write an instance:

case class Pair(n: Int, m: Int)

object Pair {
  implicit val pairSemi: Semigroup[Pair] = new Semigroup[Pair] {
    def combine(x: Pair, y: Pair): Pair = Pair(x.n + y.n, x.m + y.m)
  }
}

This way, whenever Pair is in scope, its Semigroup instance will also be automatically visible. Defining the Semigroup[Pair] somewhere else makes it an Orphan Instance, which runs the risk of burdening your users with confusing imports.

Now extend some top-level package object of yours like:

package object mylib extends Semigroup.ToSemigroupOps

And then full use of your typeclass is just one import away!

import mylib._

scala> Pair(1, 2) <> Pair(3, 4)
res0: Pair = Pair(4, 6)

Instance Derivation

In Haskell, automatic typeclass instance derivation is frequent:

-- The usuals - many more can be derived.
data User = User { age  :: Int
                 , name :: Text
                 } deriving (Eq, Ord, Show, NFData, Generic, ToJSON, FromJSON)

Fortunately, both ScalaZ and Cats provide a similar mechanism. Nobody wants to write boilerplate!

scalaz-deriving exposes the @deriving macro for ScalaZ typeclasses:

@deriving(Equal, Show, Encoder, Decoder)
case class User(age: Int, name: String)

Where Encoder and Decoder are from play.json.

Kittens provides shapeless-based "semi-auto" derivation for Cats:

case class User(age: Int, name: String)

object User {
  implicit val userEq: Eq[User] = cats.derive.eq[User]
  implicit val userShow: Show[User] = cats.derive.show[User]
}

Which requires more typing, but has more features, like auto-derivation of higher-kinded things like Functor.

For Circe Encoder and Decoder instances specifically, the following was already possible:

import io.circe.generic.JsonCodec

@JsonCodec
case class User(age: Int, name: String)

Monadic Recursion

If you're not careful, Monadic Recursion with ScalaZ can blow the JVM stack. For instance, the following will "just work" with Cats:

def countdown: State[Int, Int] = State.get.flatMap { n =>
  if (n <= 0) State.pure(n) else State.set(n - 1) *> countdown
}

Which in ScalaZ would blow the stack for n greater than a few thousand. The proper ScalaZ equivalent is:

def trampolineCountdown: StateT[Trampoline, Int, Int] = State.get.lift[Trampoline].flatMap{ n =>
  if (n <= 0) StateT(_ => Trampoline.done((n,n)))
  else State.put(n - 1).lift[Trampoline] >> trampolineCountdown
}

Trampoline seems like an implementation detail, but it's exposed to the user here.

A quote from Cats:

Because monadic recursion is so common in functional programming but is not stack safe on the JVM, Cats has chosen to require tailRecM of all monad implementations as opposed to just a subset.

So tailRecM gets us stack safety - if you can figure out how to implement it correctly. I tried for Tree and was not successful.

John de Goes on ScalaZ 8:

tailRecM will not be a function on Monad, because not all monads can implement it in constant stack space.

So ScalaZ chooses lawfulness over convenience in this case.

Library Health and Ecosystems

Project Pulses

As of 2017 November 6.

Project	Releases	Watchers	Stars	Forks	Commits	Prev. Month Commits	ScalaJS	Scala Native
ScalaZ	106	257	3312	534	6101	45	Yes	Yes
Cats	22	174	2118	493	3280	51	Yes	No

ScalaZ's numbers are higher, but that's to be expected as it's an older project. Otherwise the projects seem to be about equally active. Notably missing is the lack of Scala Native support in Cats.

Sub-libraries

The diagram below looks one-sided, but must be taken with a grain of salt. As projects, Cats and ScalaZ have different aims. Cats has a small, tight core and espouses modularity. ScalaZ frames itself as a batteries-included standard library for FP in Scala. ScalaZ certainly has a larger and more featureful API than Cats at current. This will be increasingly true for the up-coming ScalaZ 8, which aims to provide the equivalent functionality of Dogs, Monocle, and Matryoshka directly. It also plans to provide low-level concurrency primitives which see no analogue in Cats or Vanilla Scala.

That in mind, here is a simplified view of their library ecosystems:

Notes:

Origami is a port of Haskell's foldl library
Atto is a port of Haskell's attoparsec library
Decline is a port of Haskell's optparse-applicative library
Refined is a port of Haskell's refined library
Monocle is a port of Haskell's lens library

Shims

Libraries like circe, atto and decline are immense standard-of-living improvements for Scala developers. Luckily, the shims library allows us to use them via ScalaZ, too. Likewise, Matryoshka becomes usable via Cats. From the shims project:

Shims aims to provide a convenient, bidirectional, and transparent set of conversions between scalaz and cats, covering typeclasses (e.g. Monad) and data types (e.g. \/). By that I mean, with shims, anything that has a cats.Functor instance also has a scalaz.Functor instance, and vice versa.

Here is a working example:

package shimmy

import scalaz._
import Scalaz._
import shims._
import com.monovore.decline._  /* Depends on Cats */

object Shimmy extends CommandApp(
  name = "shimmy",
  header = "Demonstrate how shims works.",
  main = {
    /* These are `decline` data types with `Applicative` instances from Cats */
    val foo = Opts.option[String]("foo", help = "Foo")
    val bar = Opts.option[Int]("bar", help = "Bar")
    val baz = Opts.flag("baz", help = "Baz").orFalse

    /* These are ScalaZ operators that use ScalaZ's `Applicative` */
    (foo |@| bar |@| baz) { (_, _, _) => println("It worked!") }
  }
)

Resources

The tendency is for Cats to have better documentation and examples up-front, while ScalaZ has an extensive examples subpackage.

ScalaZ

Functional Programming for Mortals by Sam Halliday (book)
Learning ScalaZ by Eugene Yokota (blog series)
Cheatsheet (typeclass usage and imports)
ScalaZ README
Scaladocs
ScalaZ Gitter

Cats

Cats Website
Scala with Cats by Noel Walsh and Dave Gurnell (book)
Scaladocs
Herding Cats by Eugene Yokota (blog series)
Cats Gitter

nmeln / scalaz-and-cats

ScalaZ and Cats

Use Cases

I want to train my Scala team in Functional Programming fundamentals

I'm writing a performance-sensitive application

I want to improve quality-of-life for my Scala devs

I'm writing a library/application that works with a complex DSL

I want to port a well-known, general-purpose Haskell library to Scala

I care about which stays truer to Haskell

I care about which has more industry backing

I hear the `IO` Monad can help me logically organize my code

Futures suck and I hate JVM thread pools. Help?

Just gimme Monads

Benchmarks

Results

Observations

Caveat

Usage Considerations

API Accessibility

Features

ScalaZ: `IList`

ScalaZ: `Maybe`

ScalaZ: `EphemeralStream`

Typeclasses

Custom Typeclasses

Instance Derivation

Monadic Recursion

Library Health and Ecosystems

Project Pulses

Sub-libraries

Shims

Resources

ScalaZ

Cats

About

Languages

ScalaZ and Cats

Use Cases

I want to train my Scala team in Functional Programming fundamentals

I'm writing a performance-sensitive application

I want to improve quality-of-life for my Scala devs

I'm writing a library/application that works with a complex DSL

I want to port a well-known, general-purpose Haskell library to Scala

I care about which stays truer to Haskell

I care about which has more industry backing

I hear the IO Monad can help me logically organize my code

Futures suck and I hate JVM thread pools. Help?

Just gimme Monads

Benchmarks

Results

Observations

Caveat

Usage Considerations

API Accessibility

Features

ScalaZ: IList

ScalaZ: Maybe

ScalaZ: EphemeralStream

Typeclasses

Custom Typeclasses

Instance Derivation

Monadic Recursion

Library Health and Ecosystems

Project Pulses

Sub-libraries

Shims

Resources

ScalaZ

Cats

About

Languages

I hear the `IO` Monad can help me logically organize my code

ScalaZ: `IList`

ScalaZ: `Maybe`

ScalaZ: `EphemeralStream`