2013-12-23 12:44:16 +08:00
# ANTLR v4
2012-02-01 11:28:15 +08:00
2013-12-23 12:44:16 +08:00
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees.
2012-02-01 11:28:15 +08:00
2013-12-23 12:44:16 +08:00
[Wikipedia ](https://en.wikipedia.org/wiki/ANTLR )
[Official site ](http://www.antlr3.org/ )
[ANTLR v3 ](http://www.antlr3.org/ )?
## INTRODUCTION
2012-02-01 11:28:15 +08:00
2013-06-30 03:15:27 +08:00
Hi and welcome to the Honey Badger 4.1 release of ANTLR!
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
## INSTALLATION
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
### UNIX
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
0. [Install Java ](http://www.oracle.com/technetwork/java/javase/downloads/index.html ) (version 1.6 or higher)
2012-09-19 02:25:06 +08:00
2013-01-02 05:43:15 +08:00
1. Download
2013-12-23 12:44:16 +08:00
```sh
2013-01-02 05:43:15 +08:00
$ cd /usr/local/lib
2013-06-30 03:15:27 +08:00
$ curl -O http://www.antlr4.org/download/antlr-4.1-complete.jar
2013-12-23 12:44:16 +08:00
```
Or just [download in browser ](http://www.antlr4.org/download/antlr-4.1-complete.jar ) and put it somewhere rational like `/usr/local/lib` .
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
2. Add `antlr-4.1-complete.jar` to your `CLASSPATH` :
```sh
2013-06-30 03:15:27 +08:00
$ export CLASSPATH=".:/usr/local/lib/antlr-4.1-complete.jar:$CLASSPATH"
2013-12-23 12:44:16 +08:00
```
2013-01-02 05:43:15 +08:00
Is also a good idea to put this in your .bash_profile or whatever your
startup script is.
3. Create aliases for the ANTLR Tool, and TestRig.
2013-12-23 12:44:16 +08:00
```sh
2013-06-30 03:15:27 +08:00
$ alias antlr4='java -jar /usr/local/lib/antlr-4.1-complete.jar'
2013-01-26 03:20:29 +08:00
$ alias grun='java org.antlr.v4.runtime.misc.TestRig'
2013-12-23 12:44:16 +08:00
```
2013-01-02 05:43:15 +08:00
2013-12-23 12:44:16 +08:00
### Windows (Thanks to Graham Wideman)
2013-01-02 05:43:15 +08:00
2013-12-23 12:44:16 +08:00
0. [Install Java ](http://www.oracle.com/technetwork/java/javase/downloads/index.html ) (version 1.6 or higher)
2013-01-02 05:43:15 +08:00
2013-12-23 12:44:16 +08:00
1. Download [antlr-4.1-complete.jar ](http://antlr.org/download/antlr-4.1-complete.jar )
Save to your directory for 3rd party Java libraries, say `C:\Javalib`
2013-01-02 05:43:15 +08:00
2013-12-23 12:44:16 +08:00
2. Add `antlr-4.1-complete.jar` to `CLASSPATH` , either:
2013-01-02 05:43:15 +08:00
* Permanently: Using System Properties dialog > Environment variables >
2013-12-23 12:44:16 +08:00
Create or append to `CLASSPATH` variable
2013-01-02 05:43:15 +08:00
* Temporarily, at command line:
2013-12-23 12:44:16 +08:00
`SET CLASSPATH=C:\Javalib\antlr-4.1-complete.jar;%CLASSPATH%`
2013-01-02 05:43:15 +08:00
3. Create short convenient commands for the ANTLR Tool, and TestRig,
using batch files or doskey commands:
2013-12-23 12:44:16 +08:00
* Batch files (in directory in system `PATH` )
```
2013-01-02 05:43:15 +08:00
antlr4.bat: java org.antlr.v4.Tool %*
run.bat: java org.antlr.v4.runtime.misc.TestRig %*
2013-12-23 12:44:16 +08:00
```
2013-01-02 05:43:15 +08:00
* Or, use doskey commands:
2013-12-23 12:44:16 +08:00
```bat
2013-01-02 05:43:15 +08:00
doskey antlr4=java org.antlr.v4.Tool $*
doskey grun =java org.antlr.v4.runtime.misc.TestRig $*
2013-12-23 12:44:16 +08:00
```
2013-01-02 05:43:15 +08:00
2013-12-23 12:44:16 +08:00
## Testing installation
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
Either launch `org.antlr.v4.Tool` directly:
```sh
2012-09-19 02:25:06 +08:00
$ java org.antlr.v4.Tool
2013-06-30 03:15:27 +08:00
ANTLR Parser Generator Version 4.1
2012-09-19 02:25:06 +08:00
-o ___ specify output directory where all output is generated
-lib ___ specify location of .tokens files
...
2013-12-23 12:44:16 +08:00
```
or use `-jar` option on java:
```sh
2013-06-30 03:15:27 +08:00
$ java -jar /usr/local/lib/antlr-4.1-complete.jar
ANTLR Parser Generator Version 4.1
2012-09-19 02:25:06 +08:00
-o ___ specify output directory where all output is generated
-lib ___ specify location of .tokens files
...
2013-12-23 12:44:16 +08:00
```
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
## Example
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
In a temporary directory, put the following grammar inside file `Hello.g4` :
```g
2012-09-19 02:25:06 +08:00
// Define a grammar called Hello
// match keyword hello followed by an identifier
// match lower-case identifiers
grammar Hello;
r : 'hello' ID ;
ID : [a-z]+ ;
WS : [ \t\n]+ -> skip ; // skip spaces, tabs, newlines
2013-12-23 12:44:16 +08:00
```
2012-09-19 02:25:06 +08:00
Then run ANTLR the tool on it:
2013-12-23 12:44:16 +08:00
```sh
2012-09-19 02:25:06 +08:00
$ cd /tmp
$ antlr4 Hello.g4
$ javac Hello*.java
2013-12-23 12:44:16 +08:00
```
2012-09-19 02:25:06 +08:00
Now test it:
2013-12-23 12:44:16 +08:00
```sh
2012-09-19 02:25:06 +08:00
$ grun Hello r -tree
hello parrt
^D
(r hello parrt)
2013-12-23 12:44:16 +08:00
```
(That `^D` means `EOF` on unix; it's `^Z` in Windows.) The `-tree` option prints the parse tree in LISP notation.
## The Definitive ANTLR 4 Reference
You can by a book [The Definitive ANTLR 4 Reference ](http://pragprog.com/book/tpantlr2/the-definitive-antlr-4-reference )
[bquote]
Programmers run into parsing problems all the time. Whether it’ s a data format like JSON, a network protocol like SMTP, a server configuration file for Apache, a PostScript/PDF file, or a simple spreadsheet macro language—ANTLR v4 and this book will demystify the process. ANTLR v4 has been rewritten from scratch to make it easier than ever to build parsers and the language applications built on top. This completely rewritten new edition of the bestselling Definitive ANTLR Reference shows you how to take advantage of these new features.
[/bquote]
[Book source code ](http://pragprog.com/titles/tpantlr2/source_code )
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
## Addinitional grammars
[This repository ](https://github.com/antlr/grammars-v4 ) is a collection of grammars without actions where the
root directory name is the all-lowercase name of the language parsed
by the grammar. For example, java, cpp, csharp, c, etc...
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
## Author
2012-09-19 02:25:06 +08:00
2013-12-23 12:44:16 +08:00
[Terence Parr ](http://www.cs.usfca.edu/~parrt/ ), parrt@cs.usfca.edu < br />
ANTLR project lead and supreme dictator for life< br / >
[University of San Francisco ](http://www.usfca.edu/ )
2013-06-30 03:15:27 +08:00