While working on my JavaCC book I came across a JavaCC grammar for parsing Cobol programs. It's a pretty hefty grammar file - over 50 KB - with a ton of productions. I ran it through JJDoc and you can see that report as well.
I thought a couple of things were interesting:
- The global lookahead setting is 4, which is a lot more than you usually see.
- There are no lexical states (other than
DEFAULT, of course).
- Whitespace seems to be handled entirely through
MORE. I think that results in a lot of extra
Tokencreation... but maybe that sort of thing is not a big deal these days.
I tried to contact the author to see if he would mind me putting the grammar on the JavaCC grammars page, but the email bounced. Bernard Pinon, if you see this, nice work, please drop me a line and let me know if this grammar can live on the JavaCC site :-)