Example parsers

Question

Example parsers

Geal opened this issue 9 years ago · comments

We currently have a few example parsers. In order to test the project and make it useful, other formats can be implemented. Here is a list, if anyone wants to try it:

text file formats:
- INI
- FASTQ
- libconfig-like configuration file format
- torrc configuration file
- ISO 8601 dates
- Web archive
- TOML
- bencode
- CSV
- YAML
- CommonMark
audio, video and image formats:
- MP4 (partial implementation)
- GIF
- FLAC
- FLV
- MKV
- OGG
- MPEG TS
- AVI
- PNG
- JPEG
- EXIF
- MP3
document formats:
- torrent files
- TAR
- PDF
- MS-CFB (compound format, used in doc, xls, ppt, cab, msi files)
- GZ
- ZIP
- RAR
- binary PLIST
database formats:
- Redis database files
- Ceph crush maps
network protocol formats:
- IRC
- Pcap-NG
- IP
- Ethernet
- PCAP
- NTP
- SNMP
- TLS
- TCP
- UDP
- DNS
executable formats:
- Portable executables (PE)
- ELF
- GameBoy ROM
crypto related:
- ASN.1
- X.509 certificates
- DER public and private keys
- SSL/TLS packets
- OpenPGP
Programming Languages
- Rust
- Lua
- Python
- C
interface definition formats:
- Thrift
- Protobuf
- AIDL

Daniel Fagnan · Answer 1 · Fri Apr 03 2015 12:02:41 GMT+0800 (China Standard Time)

I'm writing a Thrift library for Rust that'll use Nom for both their IDL and the network protocol, so that can be another example (although in a different repo).

Geoffroy Couprie · Answer 2 · Fri Apr 03 2015 15:36:32 GMT+0800 (China Standard Time)

Nice idea, that will be useful! Please notify me when it is done, I will add a link in this list.

Filipe Gonçalves · Answer 3 · Mon Apr 27 2015 18:45:04 GMT+0800 (China Standard Time)

This looks interesting. Is anyone actively working on any of these parsers? I'd like to work on a few of these.

Geoffroy Couprie · Answer 4 · Mon Apr 27 2015 20:54:43 GMT+0800 (China Standard Time)

I have some code for a GIF one at https://github.com/Geal/gif.rs but it is hard to test, since the graphical tools in Piston change a lot.

You can pick any of them. Network packets may be the easiest, since they don't require a decompression phase.

I am using the gif example to see what kind of API can be built over nom. Most of the parsing example are done as one pass over the data, but often there is some logic on the side, and it is not easy to encode correctly.

Elijah Charles · Answer 5 · Fri May 01 2015 21:14:10 GMT+0800 (China Standard Time)

I've started a fastq (http://en.wikipedia.org/wiki/FASTQ_format) parser https://github.com/elij/fastq.rs

Geoffroy Couprie · Answer 6 · Tue May 05 2015 17:12:37 GMT+0800 (China Standard Time)

@elij this is a great idea! Was it easy to do?

Elijah Charles · Answer 7 · Tue May 05 2015 21:43:14 GMT+0800 (China Standard Time)

yup it's a great framework -- though I struggled a bit with eof so I borrowed some code from rust-config (https://github.com/elij/fastq.rs/blob/master/src/parser.rs#L69) -- is there a better solution?

Geoffroy Couprie · Answer 8 · Tue May 05 2015 22:37:44 GMT+0800 (China Standard Time)

yes, eof should be a parser provided by nom, I am just waiting for @filipegoncalves to send a PR 😉

Filipe Gonçalves · Answer 9 · Tue May 05 2015 23:01:26 GMT+0800 (China Standard Time)

Hah, sorry for my silence. I've been busy lately. I just sent a PR (#31).

I will be working on one of these example parsers as soon as I get some spare time. There are some great ideas in here!

Marc-Antoine Perennou · Answer 10 · Sat May 30 2015 05:03:25 GMT+0800 (China Standard Time)

I might give tar a try

Nelson Chen · Answer 11 · Fri Jun 19 2015 10:25:20 GMT+0800 (China Standard Time)

Does this check off PCAP?

https://github.com/richo/pcapng-rs

Geoffroy Couprie · Answer 12 · Fri Jun 19 2015 17:14:35 GMT+0800 (China Standard Time)

pcap-ng and pcap are two different formats, right? It seems the consensus now is to move everything to pcap-ng, though.

OmniTechnoMancer · Answer 13 · Fri Jul 17 2015 17:36:36 GMT+0800 (China Standard Time)

I will try a FLAC parser, need to add quite a few things for it though.

Jan-Erik Rediger · Answer 14 · Fri Jul 17 2015 18:07:15 GMT+0800 (China Standard Time)

ISO8601 is done in https://github.com/badboy/iso8601 (I hope it's mostly correct.)

Geoffroy Couprie · Answer 15 · Fri Jul 17 2015 19:51:32 GMT+0800 (China Standard Time)

ok, it should be up to date. More to come 😄

Stephen Becker IV · Answer 16 · Mon Aug 24 2015 05:49:58 GMT+0800 (China Standard Time)

WARC file format released. https://crates.io/crates/warc_parser

Geoffroy Couprie · Answer 17 · Tue Aug 25 2015 05:53:06 GMT+0800 (China Standard Time)

@sbeckeriv great, thanks!

Cassie Jones · Answer 18 · Tue Sep 15 2015 03:51:29 GMT+0800 (China Standard Time)

It might be informative to try parsing the rust grammar with nom, if nobody has yet. In any case, I'd like to see a few programming languages on that list, since that's my use case.

Geoffroy Couprie · Answer 19 · Tue Sep 15 2015 15:48:24 GMT+0800 (China Standard Time)

@porglezomp programming languages examples would definitely be useful, but the Rust grammar might be a bit too much for the first attempt. Which other languages would you like to handle?

Cassie Jones · Answer 20 · Tue Sep 15 2015 19:11:02 GMT+0800 (China Standard Time)

Yeah, I'm aware of the scale problem of Rust. I don't want to write that one, but I think it's a good holy grail for any parser library written in Rust. I'd like to try parsing the Lua grammar first, I think.

I recommend adding to the list:

Programming Languages
- Rust
- Lua (I'll do this)
- Python (or some other whitespace significant language)
- C

Geoffroy Couprie · Answer 21 · Tue Sep 15 2015 21:28:41 GMT+0800 (China Standard Time)

ok, I added them to the list :)

Chris Krycho · Answer 22 · Tue Nov 17 2015 03:39:30 GMT+0800 (China Standard Time)

You have INI marked as done; do you have a link to it? (I'd love to use this for some tooling I'm hoping to build in 2016; need a good non-trivial example for it, though.)

Jan-Erik Rediger · Answer 23 · Tue Nov 17 2015 03:48:51 GMT+0800 (China Standard Time)

@chriskrycho: https://github.com/Geal/nom/blob/master/tests/ini.rs

Chris Krycho · Answer 24 · Tue Nov 17 2015 03:49:16 GMT+0800 (China Standard Time)

Thanks very much, @badboy!

François Bernier · Answer 25 · Tue Nov 17 2015 06:55:46 GMT+0800 (China Standard Time)

I'll try to make the TOML parser very soon.

Geoffroy Couprie · Answer 26 · Tue Nov 17 2015 07:12:23 GMT+0800 (China Standard Time)

Actually, I think I should rewrite that INI parser, now that more convenient combinators are available.
Also, I should really work on that combinator for space separated stuff

Geoffroy Couprie · Answer 27 · Tue Nov 17 2015 07:12:47 GMT+0800 (China Standard Time)

@fbernier great! Please keep me posted!

l0calh05t · Answer 28 · Tue Nov 17 2015 07:52:23 GMT+0800 (China Standard Time)

Maybe add a simple example for trailing commas in lists? Python has those, but is quite complex. Can't think of a simple example though.

Johannes Hoff · Answer 29 · Tue Nov 17 2015 14:31:55 GMT+0800 (China Standard Time)

That IRC example is no longer using nom. The parser was moved into its own repository: https://github.com/Detegr/RBot-parser

Geoffroy Couprie · Answer 30 · Tue Nov 17 2015 16:30:22 GMT+0800 (China Standard Time)

@l0calh05t to parse something like [a,b,c,] or [a,b,c] ?
@johshoff fixed, thanks

l0calh05t · Answer 31 · Wed Nov 18 2015 00:06:36 GMT+0800 (China Standard Time)

@Geal yes, exactly

Geoffroy Couprie · Answer 32 · Sun Nov 22 2015 23:01:27 GMT+0800 (China Standard Time)

@l0calh05t for [a,b,c], you can parse with delimited!(char!('['), separated_list!( char!(','), alphabetic), char!(']')).
For [a,b,c,], you can have delimited!(char!('['), terminated!(alphabetic, char!(',')), char!(']')).

A parser that would handle both cases is much trickier.

l0calh05t · Answer 33 · Sun Nov 22 2015 23:20:46 GMT+0800 (China Standard Time)

Both is really the more interesting case. And what is needed for Python for example

Marc-Antoine Perennou · Answer 34 · Mon Nov 23 2015 01:54:31 GMT+0800 (China Standard Time)

Could there be something like maybe_char!(',') which would read a char, consume it if it's ',' or backtrack if it isn't?

EDIT: actually that's probably what opt!(char!(',')) would do, so you just have to take the one that parses [a,b,c] and stick that before the ']' or am I missing something?

l0calh05t · Answer 35 · Mon Nov 23 2015 02:14:09 GMT+0800 (China Standard Time)

Problem is that wont work unless a look ahead of more than one character is added automatically

Geoffroy Couprie · Answer 36 · Mon Nov 23 2015 02:49:03 GMT+0800 (China Standard Time)

In fact, it is easier than I thought, but requires some work:

preceded!(
  char!('['),
  terminated!(
    separated_list!(
      char!(','),
      alphabetic
    ),
    terminated!(
      opt!(char!(',')),
      char!(']')
    )
  )
)

opt! will return an option of the result of its child parser (Some if success, None if failure), so it will accept the trailing comma.

Jan-Erik Rediger · Answer 37 · Mon Nov 23 2015 21:09:07 GMT+0800 (China Standard Time)

I wrote a simplistic bencode parser: nom-bencode.

Not sure if it covers everything (yet).

Félix Saparelli · Answer 38 · Sun Nov 29 2015 07:43:34 GMT+0800 (China Standard Time)

I've started a TOML parser as a learning project: https://github.com/passcod/noml

Jeremy Hull · Answer 39 · Sat Jan 09 2016 11:11:47 GMT+0800 (China Standard Time)

At this point I'm ready to share my flac implementation as an example parser.

jow blew · Answer 40 · Sat Jan 23 2016 20:13:27 GMT+0800 (China Standard Time)

Hey sourrust. Nom looks great way to do this.
I am interested in parsing different video formats with nom. If there is some existing rust kibs in this space that anyone knows then u could start porting some to nom. Worth a crack to see how it goes.

I am very curious about using the streaming capabilities of nom. For my use case I want to stream data between servers, manipulate frames, and then fan it back into the main stream.
I would love to get some feedback on some potential gotchas.
Doing this type of work should ultimately feedback into making nom better.

Tom Jakubowski · Answer 41 · Mon Mar 21 2016 11:00:26 GMT+0800 (China Standard Time)

Correct me if I'm wrong, but the linked Redis project doesn't seem to use nom.

Cassie Jones · Answer 42 · Mon Mar 21 2016 12:03:00 GMT+0800 (China Standard Time)

I agree, I checked the history of it's Cargo.toml and at no point was nom listed as a dependency. I'm not sure how it ended up on the list, but it looks like it should be taken off.

Jan-Erik Rediger · Answer 43 · Mon Mar 21 2016 17:30:03 GMT+0800 (China Standard Time)

It does in another branch, which is still not merged because time.

Daniel Fagnan · Answer 44 · Tue Apr 05 2016 07:09:52 GMT+0800 (China Standard Time)

@Geal you can remove my Thrift library as an example as I'm no longer using Nom in it.

Joel · Answer 45 · Mon Apr 11 2016 20:56:17 GMT+0800 (China Standard Time)

I've released a TOML parser. It doesn't let you modify everything possible in the document or create documents from scratch, but does correctly parse TOML, report errors, allow some modification and then output the document with comments and whitespace intact.

Nathan Moos · Answer 46 · Wed May 18 2016 01:09:52 GMT+0800 (China Standard Time)

I've started working on a parser for IP, TCP, UDP, and Ethernet headers. It is located at https://github.com/moosingin3space/pktparse-rs.
Warning: there is little to no documentation right now!

David Xu · Answer 47 · Tue Jun 07 2016 07:49:13 GMT+0800 (China Standard Time)

Java class file parser! It is part of a larger class project.

The parser uses helper macros based on #160 to get more backtracking support.

Paul Woolcock · Answer 48 · Fri Jun 17 2016 21:27:15 GMT+0800 (China Standard Time)

Not sure if it's worth putting here or not, but I'm using nom to parse strings for the tracery library I am writing for rust: https://github.com/pwoolcoc/tracery-rs

jethrogb · Answer 49 · Fri Jul 15 2016 07:38:33 GMT+0800 (China Standard Time)

A subset of C, namely C literals and expressions: https://crates.io/crates/cexpr

Gerd Zellweger · Answer 50 · Thu Sep 08 2016 00:44:39 GMT+0800 (China Standard Time)

FYI I used nom to parse the linux perf data format (https://github.com/gz/rust-perfcnt/blob/master/src/linux/parser.rs) in case you want to add it. In comparison to most examples listed here it parses binary data.

Also, it's roughly 25x faster than an equivalent parser written in python ;)

Nick Babcock · Answer 51 · Thu Sep 15 2016 22:00:33 GMT+0800 (China Standard Time)

Boxcars is an example of a Rocket League replay parser with serde serialization. Let boxcars be a good example of Rust code using nom, and serde as extensive examples are hard to come by. While lacking user friendly error message -- among other issues, tests and documentation strive to be thorough.

David Tolnay · Answer 52 · Mon Oct 31 2016 12:12:45 GMT+0800 (China Standard Time)

Yeah, I'm aware of the scale problem of Rust. I don't want to write that one, but I think it's a good holy grail for any parser library written in Rust.

As of version 0.10.0, syn is now able to parse practically all of Rust syntax. One of my test cases is to parse the entire github.com/rust-lang/rust repo into an AST and print it back out, asserting that the output is identical to the original.

I am technically not using nom but instead a fork which removes the IResult::Incomplete variant. I found that the extra macro code generated to handle Incomplete was more than doubling the compile time for something that I didn't even want. Nevertheless, the code is enough like nom that I think we can check off the box.

Example snippet to parse one arm of a match expression:

named!(match_arm -> Arm, do_parse!(
    attrs: many0!(outer_attr) >>
    pats: separated_nonempty_list!(punct!("|"), pat) >>
    guard: option!(preceded!(keyword!("if"), expr)) >>
    punct!("=>") >>
    body: alt!(
        map!(block, |blk| ExprKind::Block(BlockCheckMode::Default, blk).into())
        |
        expr
    ) >>
    (Arm {
        attrs: attrs,
        pats: pats,
        guard: guard.map(Box::new),
        body: Box::new(body),
    })
));

Geoffroy Couprie · Answer 53 · Tue Nov 01 2016 19:39:01 GMT+0800 (China Standard Time)

@dtolnay syn is an amazing example, thanks for your hard work :)

Geoffroy Couprie · Answer 54 · Tue Nov 01 2016 21:40:02 GMT+0800 (China Standard Time)

@dtolnay could I get your input on #356? It might fix your issues with compile times, so I'd like to get your thoughts on this.

Junfeng Liu · Answer 55 · Fri Dec 23 2016 15:27:19 GMT+0800 (China Standard Time)

I am writing a PDF library using nom to parse PDF syntax. Released v0.1.0 just now.
https://github.com/J-F-Liu/lopdf

Cody Laeder · Answer 56 · Sat Dec 24 2016 03:23:26 GMT+0800 (China Standard Time)

So I've implemented a EDI parser for the ANS standard EDI for work with this. Awesome library really useful. Sadly that's owned by my employer.

I've started implementing an x64 assembler with nom. I'm really struggling with writing the parser. The main reason is register names have a lot of overlap, and are very short. For example r8, r8w, r11, and r12d. Ideally I want to map these to an enum. map!() makes this easy, but how can I match those terms in nom?

Marc-Antoine Perennou · Answer 57 · Sat Dec 24 2016 16:17:33 GMT+0800 (China Standard Time)

I converted several "keys" to enum values in my brainfuck parser, might or might not be relevant to your needs. See the first parsers defined with "named!" https://github.com/Keruspe/brainfuck.rs/blob/master/src/parser.rs

Tony Przygienda · Answer 58 · Thu Mar 09 2017 15:04:31 GMT+0800 (China Standard Time)

is there a way (or it would be great if it's possible) to generate EBNF from this? Great package BTW ...

Wilfried Chauveau · Answer 59 · Sat Apr 15 2017 07:53:14 GMT+0800 (China Standard Time)

Hi,
I just pushed a pcap parser : https://github.com/ithinuel/pcap-rs.
It still needs the PR #492 to be merge so it can use official nom crate.

Any feedback is welcome.

Brendan Molloy · Answer 60 · Wed Apr 19 2017 16:31:59 GMT+0800 (China Standard Time)

A parser for the Mediawiki format would be quite useful.

Daniel N. Werner · Answer 61 · Sun Jun 04 2017 05:32:04 GMT+0800 (China Standard Time)

@Geal thanks for an awesome library! I wrote a wavefront obj/mtl 3d mesh parser using it nom-obj, which I published to crates.io

Olivier Renaud · Answer 62 · Thu Aug 10 2017 21:20:59 GMT+0800 (China Standard Time)

I wrote a parser for the simple key/value text format .properties, which is a standard for Java configuration files. It uses nom 3.1. Can it be added to the list?

This is the first parser I wrote using a Parser Combinator library. If anyone can review my code I would be delighted. Also, I tried to add error reporting to my code, but I gave up after I tried to insert add_return_error and return_error calls all over the place to no avail (in the branch "error-reporting"). Is there an example of a text parser that reports parsing errors?

Edit: I rewrote my library using Pest instead of Nom, as I find it more suited to parsing a text format. I will definitely use nom if I need to parse a binary format, though.

Henrik Jürges · Answer 63 · Tue Sep 05 2017 18:06:29 GMT+0800 (China Standard Time)

@Geal thanks for this library.
I've implemented a parser for URI's which is
part of a larger side project for RDF (n3, ttl,...) parsers. The full abnf of rfc 3986 is implemented but the pct-encoding is still a bit messy.

Danilo Bargen · Answer 64 · Fri Sep 22 2017 16:44:24 GMT+0800 (China Standard Time)

Here's a parser for ICE candidates SDP (RFC 5245), used for example in WebRTC: https://github.com/dbrgn/candidateparser

Kamil Markiewicz · Answer 65 · Fri Sep 22 2017 17:03:29 GMT+0800 (China Standard Time)

I wrote a Session Initiation Protocol (RFC3261) low-level push parser with API inspired by seanmonstar/httparse (hyper's HTTP parser):
https://github.com/kamarkiewicz/parsip

Jonathan 'theJPster' Pallant · Answer 66 · Thu Jan 11 2018 17:40:23 GMT+0800 (China Standard Time)

I'd be interested in something that could parse SNMP MIB and YANG.

https://en.wikipedia.org/wiki/YANG

Fredrick Brennan · Answer 67 · Sat Jun 02 2018 14:40:22 GMT+0800 (China Standard Time)

The BitTorrent example has been deleted, it seems.

Nicolas Delsaux · Answer 68 · Thu Jun 07 2018 03:42:56 GMT+0800 (China Standard Time)

As a beginner in Rust world, I'm quite sure I will say something horribly wrong, but is there any planned support for some XML dialects ? (typically RSS/ATOM) ?

Daniel N. Werner · Answer 69 · Thu Jun 07 2018 04:24:36 GMT+0800 (China Standard Time)

Nothing at all wrong with asking, and I'm sure someone might want to implement one at some point, but this is a list of example parsers written using nom, rather than a list of formats "supported" by nom. An xml parser would be an excellent idea for learning nom, imo.

Cassie Jones · Answer 70 · Fri Jun 08 2018 02:38:08 GMT+0800 (China Standard Time)

@Riduidel if you're specifically interested in just having parsers for those formats, look at https://github.com/rust-syndication. I don't think there's any nom involved there though.

Mitchell Tannenbaum · Answer 71 · Fri Jun 08 2018 15:43:38 GMT+0800 (China Standard Time)

HTTP: https://github.com/hjr3/weldr/blob/00481f80ae60bd6b312805245c126c168ab77b36/src/http/parser.rs

vandenoever · Answer 72 · Wed Jun 13 2018 14:27:03 GMT+0800 (China Standard Time)

A parser for Turtle. It passes the test suite in 15ms.

https://github.com/vandenoever/rome/tree/master/src/io/turtle

Val Lorentz · Answer 73 · Tue Jun 19 2018 00:21:37 GMT+0800 (China Standard Time)

I wrote a Python parser: https://docs.rs/python-parser/

ibrahim dursun · Answer 74 · Fri Jan 04 2019 18:09:16 GMT+0800 (China Standard Time)

I think Redis database file format parser is not using nom at all. I couldn't find any reference to nom anywhere.

Nelson Chen · Answer 75 · Fri Jan 04 2019 18:17:27 GMT+0800 (China Standard Time)

@idursun Maybe it refers to this old branch from a year before the last update to master. https://github.com/badboy/rdb-rs/tree/nom-parser

saggit · Answer 76 · Tue Mar 12 2019 20:33:24 GMT+0800 (China Standard Time)

is there any SQL parser？

Mitchell Tannenbaum · Answer 77 · Wed Mar 13 2019 00:37:16 GMT+0800 (China Standard Time)

is there any SQL parser？

it'd seem better to me to import it to an sql engine and interact with that data using Diesel. parsing flat sql files seems very limited.

instead of writing a one-off Rust app to do this, you could add diesel bindings to Torchbear, see jazzdotdev/jazz#85 , then make a Speakeasy library for transporting data from your schema using content model in ContentDB.

then, you could develop a lot further beyond.

Wilfried Chauveau · Answer 78 · Tue Mar 19 2019 22:30:34 GMT+0800 (China Standard Time)

@naturallymitchell maybe @saggit was simply looking for something to extract some data from a raw sql dump. Like a one-off log analysis tool. :D

Mark McCaskey · Answer 79 · Wed Jun 19 2019 11:12:33 GMT+0800 (China Standard Time)

I made a GameBoy ROM parser with nom5!
https://github.com/MarkMcCaskey/gameboy-rom-parser
https://crates.io/crates/gameboy-rom

It's extremely simple and doesn't do much, but the crate provides a useful abstraction over the metadata of GameBoy ROMs.

I'll add more optional validation functions to it and refactor my emulator's ROM code to use it soon.

edit:
this post is what inspired me to make this

Mitchell Tannenbaum · Answer 80 · Fri Jun 21 2019 05:14:52 GMT+0800 (China Standard Time)

It's extremely simple and doesn't do much, but the crate provides a useful abstraction over the metadata of GameBoy ROMs.

@MarkMcCaskey It could even make sense to refactor it then into a generalized library with config files (like, TOML and YAML, and now SANE). Do you think that'd be too much more work?

Daniel N. Werner · Answer 81 · Thu Jun 27 2019 03:09:36 GMT+0800 (China Standard Time)

@Geal - I wanted to post my public suffix domain list parser that I wrote a few months back. I couldn't find a performant library that did what I needed, so I grabbed nom and went to work. https://github.com/dwerner/nom-psl

Mark McCaskey · Answer 82 · Thu Jun 27 2019 03:32:51 GMT+0800 (China Standard Time)

@naturallymitchell

Do you mean specifying the layout of the bytes as data and creating a dynamic data structure from it? That's an interesting idea, but I don't think it'd be too helpful for my use case -- as I see it, the primary value-add of the gameboy rom parser is the data layer that it exposes, which lets the user get things like the game's title as as string or the exact cartridge type and how much ROM and RAM it has as well-named, plain Rust values.

The parser may be implementable with serde deserialize on a repr(C) struct though, which is kind of the reverse of what you're saying, I think... I'm not familiar enough with how serde-derive handles errors though.

Jeremy Lempereur · Answer 83 · Sat Jun 29 2019 00:08:19 GMT+0800 (China Standard Time)

Just got a 0.0.1 version of an NMEA-0183 parser using nom 5 https://github.com/YellowInnovation/nmea-0183 . I need to have a look at the docs and guidelines (the code is ugly for now) and refactor it :) I hope to submit a pull request adding a clean version of it to the parsers list soon ! :)

Anatolii Kurotych · Answer 84 · Sat Oct 17 2020 21:48:45 GMT+0800 (China Standard Time)

This is SIP parser
https://github.com/armatusmiles/sipcore/tree/master/crates/sipmsg

Torture test: kurotych/sipcore@32040e5 ( https://tools.ietf.org/html/rfc4475#section-3.1.1.1 )

Geoffroy Couprie · Answer 85 · Sat Oct 24 2020 17:35:22 GMT+0800 (China Standard Time)

@armatusmiles thanks, i added it to the list in 2e58a2c

bion howard · Answer 86 · Fri Nov 27 2020 02:12:23 GMT+0800 (China Standard Time)

Please add OpenCypher to the list... a nice way to parse Graph DB queries could enable a wave of innovation in databases. There are zero legit serverless / autoscaling or decentralized graph databases (like you'd get with a CRDT/ORDT backend for an OpenCypher parser). GunJS is fairly close but JavaScript is not ideal for storage IMHO

Mathieu Amiot · Answer 87 · Mon Feb 15 2021 19:59:22 GMT+0800 (China Standard Time)

Wrote a UBJSON parser w/ nom
Pretty early version with just parsing, but it does the job.

https://github.com/OtaK/ubjson

https://crates.io/crates/ubjson

Nils · Answer 88 · Sun Aug 15 2021 01:19:16 GMT+0800 (China Standard Time)

There is a PDF parser here: https://github.com/J-F-Liu/lopdf (it requires using the nom_parser feature).

FWIW, with lopdf, the nom parser is much faster than the default parser

Atmaram Naik · Answer 89 · Thu Dec 23 2021 18:34:49 GMT+0800 (China Standard Time)

I wrote a tool with its own programming language using nom. here is source repo.

Eric · Answer 90 · Fri Feb 18 2022 16:20:41 GMT+0800 (China Standard Time)

The gds2-parser released at https://crates.io/crates/gds2_io. BTW, my pull request tag is #1497

Manu Schiller · Answer 91 · Mon Feb 21 2022 21:05:47 GMT+0800 (China Standard Time)

would it be feasible to write an ecmascript/typescript parser with nom as well? Or would the scope be too big for that?

Alexander Sagen · Answer 92 · Sun Apr 03 2022 06:54:20 GMT+0800 (China Standard Time)

I have written 2 (public) parsers using nom which may be used as examples:

BitTorrent/bencoding: https://github.com/alexrsagen/rustorrent/blob/4076d0ea689a950021164d2fdd412021519e7c68/src/bencode.rs
BitTorrent/torrent files (metainfo) parsing using bencode.rs: https://github.com/alexrsagen/rustorrent/blob/4076d0ea689a950021164d2fdd412021519e7c68/src/torrent/metainfo.rs
MS-CFB: https://github.com/alexrsagen/rs-nomcfb (probably not a perfect implementation, but usable to parse Outlook .msg files and as a basic example for how to write parsers using nom)

Edgar · Answer 93 · Tue Jul 05 2022 17:48:05 GMT+0800 (China Standard Time)

I made a bencode parser (the format used by .torrent files), https://github.com/edg-l/nom-bencode/

MichiRecRoom · Answer 94 · Wed Jul 13 2022 16:42:23 GMT+0800 (China Standard Time)

Hey there! I was wondering why the Rust parser on this list is syn? From what I can tell, syn does not use nom (although it might have in the past).

Since this is a list of examples of parsers built with nom, I don't see why we should be linking to syn here.

Mathieu Amiot · Answer 95 · Wed Jul 13 2022 18:06:21 GMT+0800 (China Standard Time)

@LikeLakers2

Hey there! I was wondering why the Rust parser on this list is syn? From what I can tell, syn does not use nom (although it might have in the past).

Since this is a list of examples of parsers built with nom, I don't see why we should be linking to syn here.

dtolnay/syn#476

syn was using nom until v0.15, this issue was created 3 years before syn dropped its usage of nom. That's why it's still linked here.

You're absolutely correct that it should be removed though.

Eatgrass · Answer 96 · Tue Feb 14 2023 13:54:15 GMT+0800 (China Standard Time)

mdict-parser is a parser library for .mdx dictionary format file
https://github.com/eatgrass/mdict-parser

WeiWenjie · Answer 97 · Thu Apr 13 2023 11:39:01 GMT+0800 (China Standard Time)

crussmap is a parser library and tool for .chain file format

https://github.com/wjwei-handsome/crussmap