TruerWords Logo
Google
 
Web www.truerwords.net

Search TruerWords

Welcome
Sign Up  Log On

“Chat with Brian Regarding Lexers and Parser Generators”

From: Seth Dillingham In Response To: Top of Thread.  
Date Posted: Friday, February 2, 2007 1:10:26 PM Replies: 0
   
Enclosures: None.

(I'm sure with that subject, everybody's just dying to read this one...)

I had a (third) chat with Brian Andresen today about Lexers and Parser Generators, a type of software that's working overtime to suck worse than most email clients.

Brian Andresen 11:06:49 Brian Andresen
remember we were looking for Unicode-savvy parsing tools a while back?
Seth Dillingham 11:06:59 Seth Dillingham
I do
Brian Andresen 11:07:04 Brian Andresen
I'm looking for new parser-generator tools for my project at work
Seth Dillingham 11:09:23 Seth Dillingham
oh man, very interesting
Brian Andresen 11:09:51 Brian Andresen
I suspect that the author is more mindful of capability than performance
still, I plan to download it and try a small grammar with GOLD
I'll also try out ANTLR (again) and ACCENT (http://accent.compilertools.net/)
(why do all of these tools use all-caps names? dunno.)
Seth Dillingham 11:11:18 Seth Dillingham
ANTLR is an acronym, I don't know about the rest
not that it matters much
Brian Andresen 11:11:40 Brian Andresen
yeah
Seth Dillingham 11:12:00 Seth Dillingham
I know ANTLR made some more progress, but I haven't played with it.
heh. Check out the first News Item on the ANTLR home page
Brian Andresen 11:12:59 Brian Andresen
bah, GOLD is Win32-only.
Seth Dillingham 11:13:07 Seth Dillingham
Grr.
This whole class of software is a joke.
Brian Andresen 11:13:24 Brian Andresen
2009?! we're going to have to wait a long time for that next beta.
Seth Dillingham 11:14:09 Seth Dillingham
But the news is in past tense! I think they're probably just thousands of timezones ahead of us. That would explain it.
Brian Andresen 11:14:17 Brian Andresen
heh
yeah, there's not much out there that's free. I've found a bunch of commercial tools, but none of them have even inspired me to request a trial version
Seth Dillingham 11:24:57 Seth Dillingham
Brian Andresen 11:25:11 Brian Andresen
that's very relevant, thank you
Seth Dillingham 11:25:32 Seth Dillingham
Looking at this stuff never makes me happy. :-(
Brian Andresen 11:25:40 Brian Andresen
no kiddin'
well, that just left me with a grand total of zero tools to investigate. lame.
Seth Dillingham 11:26:18 Seth Dillingham
I think, "I could devote a ton of my time to learning the issues and helping to fix these problems. Or I could make a living and have a life, and make do with what I already have."
Brian Andresen 11:26:31 Brian Andresen
yep
Seth Dillingham 11:26:58 Seth Dillingham
But it still makes me nuts. My work could really benefit from a good, Unicode-savvy parser generator.
with a C++ target
Brian Andresen 11:28:01 Brian Andresen
yeah. the thing that got me started on this (for Agilent) was how poorly designed lex/yacc (and flex/bison) are for providing code to be part of a larger project
they were designed for making a standalone executable that doesn't need to do much more beyond the parsing, it seems
Seth Dillingham 11:28:56 Seth Dillingham
they're specifically for feeding a compiler, right?
Brian Andresen 11:29:22 Brian Andresen
our simulator already has six lex/yacc-based parsers in the code, and we end up having to mangle the yy___ symbols and other globals to even just make it link
Seth Dillingham 11:29:57 Seth Dillingham
(a particular kind of compiler/builder, I mean)
Brian Andresen 11:30:06 Brian Andresen
yeah
Seth Dillingham 11:30:17 Seth Dillingham
Hey, do you mind if I post this conversation on [tw]? I'll hide your handle.
Brian Andresen 11:31:36 Brian Andresen
oh right, and my other gripe was memory management. Suppose we're parsing through a line and allocating memory for various things as we go. Then we hit a syntax error. There are ways to design the rules to do error recovery, but designing the rules to allow error recovery to clean up all allocated memory is not obvious at all.
go for it
Seth Dillingham 11:32:48 Seth Dillingham
thanks

Discussion Thread:

There are no replies.

Trackbacks:

There are no trackbacks.


Until August 31
My Amazon sales
benefit the PMC

Homepage Links

Apr 1 - Aug 31
Ad revenue
benefits the PMC


TruerWords
is Seth Dillingham's
personal web site.
From now on, ending a sentence with a preposition is something up with which I will not put. - WC