EXE — DepTypes Language with Encodings

This document should drop you to EXE/OM stack immediately. Groupoid Infinity creates a new programming language with dependent types called EXE with small provable dependent core called OM. OM is an implementation of Calculus of Constructions (CoC), the pure lambda calculus with dependent types. It can be compiled (code extraction) to bytecode of Erlang virtual machines BEAM and LING. EXE is an implementation of Calculus of Inductive Constructions (CiC) that lives on top of CoC OM model. You may think of EXE as AST transformation of higher language (with HITs) to OM.

Why new Dependent Language?

No Fixpoint and Induction in Core. We came up with pure CoC core having predicative and impredicative universe hierarchies and macro extensions. Other MLTT cores have additional axioms like Fixpoint and Induction (and even more) — something we strive to escape because it leads to the complex core. No, we don't have Fixpoint, and yes, we implemented Induction principle in pure CoC.

Extensible Language Design. Encoding of inductive types is based on categorical semantic of compilation to CoC. All other syntax constructions are inductive definitions, plugged into the stream parser. AST of the CoC language is also defined in terms of inductive constructions and thus allowed in the macros. The language of polynomial functors (data and record) and core language of the process calculus (spawn, receive and send) is just macrosystem over CoC language, its syntax extensions.

Changeable Encodings. In pure CoC we have only arrows, so all inductive type encodings would be Church-encoding variations. Most extended nowadays is Church-Boehm-Berrarducci the encoding which dedicated to inductive types. Another well known are Scott (laziness), Parigot (laziness and constant-time iterators) and CPS (continuations) encodings.

Proved Categorical Semantic. There was modeled a math model (using higher-order categorical logic) of encoding, which calculates (co)limits in a category of (co)algebras built with given set of (de)constructors. We call such encoding in honor of Lambek lemma that leads us to the equality of (co)initial object and (co)limit in the categories of (co)algebras. Such encoding works with dependent types and its consistency is proved in Lean model.

Native Erlang Language with own shell

You can switch with no stress between Erlang and EXE shell in the same user_drv process. EXE shell is implememted as exe_repl in 120 LOC using io:request API and {get_until,unicode,_,_,_,_} message of Erlang IO protocol. No custom user_drv or group process is used. Color aware pretty printing exe_pretty took same size.

OM — Compact Core of CoC

In repository OM you may found following parts of core:

Parser
Typechecker
Eraser (could be improved)
Code Extractor

OM ships with different modes (spaces of types with own encodings), or preludes you may find in priv directory. They are selectable with om:mode("normal"), e.g.

General Preludes

normal

This is minimal practical prelude similar to Morte's base library of Gabriel Gonzalez. For modeling inductive constructions we use here plain Church (or Boehm-Berrarducci if you wish), we have here basic I/O monads: IO (free monad, for limited I/O) and IOI (free comonad, for infinitary I/O, long-term processes). The generated code is being sewed with Erlang effects that are passed as parameters to pure functions.

new-setoids

This is an implementation of Setoid structure, that provides us Equality. However we switched lately to more compact poset encoding.

posets

This is implementation of non-reflexive partial order sets which has more compact representation than setoids for our needs. It has only Bool, Empty and Unit encoded just to show the general idea. Dependent eliminator of Bool you can found here https://github.com/groupoid/om/tree/master/priv/posets/Data/Bool/

Note: all these folders (modules) are encoded in plain CoC in OM repository to demonstrate you the basic principles how things work. Later all these should be written in EXE languages and translated to OM automatically. You may think of OM as the low-level typed assembler of type theory.

Paradox Preludes

girard, hurkens, russell are preludes dedicated to demonstration of appropriate models of paradoxes which arise in type theory and show the inconsistency of type theory with impredicative hierarhy of universes. You also can play with it in pure CoC and its variations.

EXE — Expressive Language of CiC

EXE is top level, user language with CiC semantics and minimal yet useful syntax wich is subject to change. EXE parser is implemented as LALR-(1) grammar using Erlang's leex and yecc applications, and translates to OM terms, each term is a file on a filesystem and folder is a module or compilation unit (BEAM files).

Technically it is useful to distinguish different layers of EXE:

Internal Core with record used as structures, modules, and namespaces for the entire language. It transparently translates to OM as Contexts (named types and terms).
Metaprogramming Encodings layer which allows expanding macros to Internal Core. Macros are used to encoding programming that hides implementation details of CPS, Parigot or Church encodings. However, our EXE encodings are a bit different encodings with categorical semantics, proven in Lean model.
End-user Top-level EXE Language with powerful primitives data and codata that uses underlying encoding macros. In the future with inductive-inductive types and HITs. Likely Top Language would be a superset of the Internal Core sharing most of the parser.

Macrosystems

macro.new

Proptypes of general macros built using poset approach. Here you can find different encodings for basic types Nat, Bool, List, Prod, Sum along with packing macros and even Free Monad.

smart-simpleton

Most compact, final model of encodings with recursor and induction, the result of all theoretical EXE findings.

How it works?

Recursor

There is a belief that recursor (non-dependent eliminator) in Type Theory is a weaker property than induction principle (dependent eliminator). At the same time from category theory, we know that Universal Property defines the object uniquely. In the case of an initial object in the category of algebras, the initiality could be defined by recursor. That means that all properties of algebra follow from its initiality, as a case, it is possible to get the recursor from induction. There is a sensitive moment here, all categorical constructions are being formulated with defined equality on morphisms, in type theory the equality is the built-in type that could have extended properties. Simplify we could say that we can get recursor from induction without equality, and with proper equality, we could get induction from recursor.

Fibrations

Mechanism of getting induction principle from equality is based on the presentation of dependent types through fibrations. Hereby dependent type (D: B → Type) is defined as (p: Sigma B P → B) which projects dependent pair to the first field. In topology, such approach is called fibration. To the other direction for a given morphism (p: E → B) which we understands as fibration with projection p, we could get its dependent type as (D: B → Type) by calculation in every point (b: B) its image of projection p by using equality on elements of B. In type theory besides dependent pair Sigma also used the dependent product Pi. In encoding of dependent types with fibrations there is a correspondence between elements of dependent and morphism-fibrations for projection p: such (s: B → E) that s * p = I. The example of this implementation could be seen in

EXE as Macro
OM as expanded term Sectioning

Induction

The input for induction is a predicate — dependent type encoded with (p: E → B). Induction needed additional information for the predicate. The type of induction is defined by set of inductive constructors. Induction is just a statement that on E we have the structure of F-algebra of the inductive type. Now we could apply recursor to E getting the map (I → E) from the initial object which in fact the section (fiber bundle) of fibration and thus defines the dependent function which is a proved value of induction principle. The example for Bool could be found in

EXE Bool
OM Induction

Summarizing we encode types of source lambda calculus with objects of selected category, dependent types with fibrations, dependent function as fibrations, inductive types as limits of identity functors on the category of F-algebras.

OM A HUM

zaoqi-unsafe / exe