By Jochen Voss, last updated 2014-03-15
When writing a computer program, implementing methods to read data from input files with a complex structure can be surprisingly difficult. For example, if the input data comes from an untrusted source, errors in the input file often need to be dealt with very carefully. If your program is written in Python and if the input data is sufficiently structured (i.e., if the format can be described by a context free grammar), Wisent can help you to implement parts of the input processing of your program.
A cave painting from the cave of Altamira, showing a wisent. The photo was taken from the Wikimedia Commons and is in the public domain.
The parser generator has the following features:
?
(optional elements), *
(zero or more copies) and
+
(one or more copies) operators.The generated parsers have the following features:
uninterestingnodes from the generated tree.
More information can be found in the Wisent Users' Manual.
I called the program Wisent
because the first parser generator I
encountered was Bison and
the Wisent is the European variant of the Bison. Unfortunately, I learned
later that there are at least two other parser generators which use the
name Wisent
:
Semanticpackage for emacs.
allow '-' in symbol names, minor fixes
Comments and multi-line strings in grammar files were broken since version 0.6. Release 0.6.1 fixes this problem.
The source code for more recent, experimental versions of wisent may (or may not) be available on github.com.
Generic installation instructions are in the file INSTALL. On most systems, the following commands should be sufficient:
./configure make make install
Alternatively you can omit the make install
and run Wisent
directly in the build directory.
Please send any suggestions and bug reports to
Jochen Voss. Your message should include the Wisent
version number, as obtained by the command wisent -V
.
Copyright © 2014 Jochen Voss. All content on this website (including text, pictures, and any other original works), unless otherwise noted, is licensed under the CC BY-SA 4.0 license.