antlr

Commit Graph

Author	SHA1	Message	Date
Mike Lischke	b2daab9477	antlr4cpp on Windows - Solved all compilation issues. Updated the antlr4cpp library project. At the moment we only build a static lib. Need to add exports for a DLL. Since we want this library to be compatible with VS 2013 still, we cannot use std::rethrow_with_nested(), hence we do a simple unnested throw in such cases. Starting with VS 2015 this works fully then. - Added demo application. - Added parser generation script.	2016-04-23 11:03:30 +02:00
Mike Lischke	e85385785e	Simplified use of the demo project. - The needed ANTLR jar is provided now, so it's not needed to build it yourself. - The generate.sh script has been updated to use the new jar. - Small update of the readme too.	2016-04-21 17:57:33 +02:00
Mike Lischke	e8325623d9	Finished checking all memory allocations. - All allocations are now checked for proper deallocation. - Ran LLVM analyzer over the runtime but it found mostly valid stuff and did not find non-freed allocations I left undeleted by intention. So it's not worth much. - Added move and copy assignment operator overloading, as well as a copy c-tor to ATN class to avoid a copy (and to be able to free content properly) after deserialization. - Some clean up.	2016-04-20 17:51:24 +02:00
Mike Lischke	f292d14abc	The parser is parsing for the first time. - Removed ultra simple test grammar + parser. No longer needed. - Removed long list of keywords from (regular) test grammar. - Fixed a number of toString() methods to get better debug output. - Moved Ref typedefs from Declarations.h to the individual classes as defining them on the forward declarations totally confuses the XCode debugger. - Removed reference to the owning ATN in an ATNState. We cannot guarantee to have the correct address there due to the way the states are created. The reference is not needed anyway. - ATNDeserializationOptions now has verifyATN set by default (as in the Java target). - Had to add a workaround for a weird situation: static initialization in ATNDeseralizer stopped working for no apparent reason. Need to investigate this. - Added a few support methods to the CPPUtils, mostly to ease debug output creation. - Added console listener by default to the listeners list (as done in the Java target). - Fixed translation mistakes in the CommonTokenStream class. - Fixed some memory leaks and exception handling bugs.	2016-04-17 13:13:15 +02:00
Mike Lischke	eb0241f767	Another refactoring round. - Removed a few unused classes. - More raw pointers to smart pointers conversion: RuleContext, ParserRuleContext, ParseTree, Token, ParseTreeWalker, Tree... - BitSet is now used directly instead of all those dynamic allocations and is a derived class instead of a composite. - Replced ATNState equals with == operator overload. - Correct wrong iterator over ATNConfigsets. - Added utilitiy function that mimics Java's generic toString().	2016-04-13 19:05:56 +02:00
Mike Lischke	6f344b376b	Fixed all ATNConfigSet memory leaks.	2016-04-09 17:39:38 +02:00
Mike Lischke	7f8ad7bd2d	Applied consistent exception model and fixed is<> helper. - Exceptions are now consistently thrown by value and captured by reference. C++11 exception_ptr and nested_exception are used when exception references are neeeded or when implementing the equivalent of Java's nesting. - The is<> helper didn't handle properly (const) references, which is now explicitly handled. Added new unit tests for that. - Fixed a number of places where a catch all was used to implement a "finally" (which hides exceptions). - Changed exceptions to hold (temporary) raw pointers instead of shared pointers, as otherwise it is tried to free wrapped pointers which might just be references to static objects. Might later be updated again when we continue with removing raw pointers. - Some smaller fixes. - The generated simple parser now runs through without any error (yet, it doesn't do anything useful). - ANTLR C++ target template: - Added getListener and genVisitor bool members to ANTLR's LexerFile + ParserFile classes, so can use them in the template. - Made addition of listener #include dependent on the new genListener member, which allows to run parser generation without listeners/visitors.	2016-04-09 16:36:52 +02:00
Mike Lischke	091c40899c	Finished fixing all C++ TODOs and Java to C++ converter warnings. Fixed again a few mem leaks.	2016-04-06 18:30:27 +02:00
Mike Lischke	8f2e95516b	Removed StringBuilder + stringconverter code. Both can easily be implement using STL code.	2016-04-06 16:28:05 +02:00
Mike Lischke	3d17066e0c	Removed a few unnecessary pointer casts.	2016-04-06 15:12:47 +02:00
Mike Lischke	d1b59ca5af	Next overhaul (more TODOs fixed) - Added an even simpler grammar to ease debugging while getting the lib into a working state. - Added helper template is<> to ease frequent type checks (for value types, ref types and shared_ptr). Added some unit tests for that as well. - Changed the MurmurHash::hashCode() function to take shared_ptr as this is the only variant we need. Had to change the MurmurHash unit tests for that. - Removed conflicting IntStream::_EOF (and other variants). We use the C runtime EOF value instead. - Changed all references to semantic contexts, prediction context and the prediction context cache to use shared_ptr<>. Created *Ref typedefs to simplify usage. - Adjusted the C++ string templates for that. - Fixed a number of memory leaks + some cleanup.	2016-04-06 15:07:25 +02:00
Mike Lischke	1ca5b38868	Fixed some TODOs + exceptions. - Reworked the exception hierarchy to conform with the Java hierarchy (where we mimic that). Ultimative base class is std::exception, which uses std::string (char* actually) for messages, so all exceptions use std::string for that as well. Consider that as first step to rework the entire lib to use std::string instead of std::wstring (with utf-8 for full Unicode support). - Removed ASSERTException + TODOException and fixed the places where they were used. - Removed ANTLRException, which was only an intermediate layer without an equivalent on Java side. - Replaced some equals() calls by == (with defined operator overloading). - Enhanced Arrays::equals() to ensure it compiles only if the actual types being compared support the != operator (both value + reference types). - Made the Recognizer class template free by using plain polymorphism. Some adjustments were need also in the Cpp template to support that. Could convert the .inl file to .cpp then.	2016-03-31 18:32:44 +02:00
Mike Lischke	29dedd17c4	New unit tests + template enhancements + more memory handling work - Added IntervalSet unit tests. Fixed a few bugs found by that. - Enhanced the demo grammar so that we use as many as possible template rules from Cpp.stg. Still not fully done it seems. - Fixed bugs in size determination for arrays (vectors now). - Simplified PredictionContext and SemanticContext (one template parameter less). - Removed no longer used Utils.h/cpp. Fixed CPPUtils.h/cpp. - Extended LexerXXCommand template rules to take a new grammar parameter (code gen has been updated) as we need this context in the Cpp target. This change requires to update all existing templates! Cannot do here as this is an old revision. - Some cleanup.	2016-03-30 20:11:05 +02:00
Mike Lischke	baef9b0b32	New unit tests for Interval + MurmurHash. While testing Interval() and Interval::of() I found that the latter is twice as slow as the normal object creation. Seems caching single element intervals doesn't have the same impact as in Java (quite the opposite), so I removed Interval::off and the interval cache. The MurmurHash implementation was actually for a 32bit platform, so I added a 64 bit version too (stripped down from 128 bit MurmurHash3). Tests cannot directly check the correctness of the algorithm, but duplicate checks over 300K hashs (for short input, which is more prone to duplicates than longer input) showed there are no duplicates. So I take it that the code is good. Fixed a hash creation bug in PredictionContext.cpp.	2016-03-28 18:15:50 +02:00
Mike Lischke	3f78367457	Next overhaul - Added first real unit test set and enable code coverage collection in XCode (for ANTLRInputStream). - Reworked ANTLRFileStream::load, which is now more flexible (supports Unicode BOM + 3 possible encodings), can load from Unicode file names and has almost no platform code. - Enabled strict data size and sign checks in XCode (clang) and fixed a million places... - Started converting int to size_t where it makes more sense. - Started working on const correctness. - Fixed a ton of memory leaks. - The ATN and ATNConfigSet classes now entirely work as value types. Same for Interval(Set). These seem to be the most critical data structures (ATNConfig + ATNState are pending). - The abstract IntSet class is gone now. - Murmur hash code now works with size_t instead of int (need to add unit tests for that). - Fixed a number of TODOs and other smaller things. - The Cpp template now properly handles grammar rule return values.	2016-03-27 22:01:03 +02:00
Mike Lischke	bc81acba06	An attempt to cut the Gordian knot called Java generics, in C++. - Reworked the ATNConfigSet + the config lookup implementation it uses. The new implementation no longer needs the hand written Array2DHashSet class but instead relies now on std::unordered_set with custom hasher and comparer classes. - Fixed a bug where the ATNConfigSet was deriving from std::set while in the original Java code it only implements the Set interface (not the config set itself is a set but the config lookup is). As a consequence all iterations over ATNConfigSet now iterate over ATNConfigSet->configLookup. - Removed the Any class as it didn't solve the problems we had mind. - Removed the no longer necessary Array2DHashSet, AbstractEqualityComparer and ObjectEqualityComparer classes. - Instead there is a new ConfigLookup implementation with a templated config lookup implementation. - Removed ATNConfig::equals, as this is already implement in the == operator overloading. So the operator is used instead where needed.	2016-03-26 11:01:51 +01:00
Mike Lischke	e97820a27e	Reworked handling of ATN instances throught the code + definition of IRecognizer. ATNs are top level structures, which are created and kept by parser/lexer classes (or their simulator equivalents). Hence there are now value types in their controlling class and passed around as const &. IRecognizer was a template class without real need, which has been changed to make it a simple interface easily usable without having to find C++ hacks for fancy Java wildcard generics.	2016-03-24 11:49:44 +01:00
Mike Lischke	2aa40c779e	Removed the need for a separate VectorHelper class + other improvements. Some cleanup too.	2016-03-22 17:55:57 +01:00
Mike Lischke	9006d241fa	Introduced the Any class (based on some public domain code). The Any class is loosly modelled after boost::Any and allows us to use equals() and hashCode() functions to be used where we have no common base class (like Java's Object class). By introducing this class we can replace all void* occurances that would otherwise not work.	2016-03-21 20:50:36 +01:00
Mike Lischke	6df5d025bf	Complete formatting overhaul. - Reformatted every single file to have a consistent indentation style using only space chars, with 2 chars per indentation. Reduced huge indentation due to deep namespace nesting by not indenting namespaces. - Reduced #include usage to a minimum. - Made copyright header the first entry in all files. - Moved the previously mac-only prefix file (antlrcpp-Prefix.h) to the runtime. It can now be used by all platforms and includes all necessary standard headers. - Removed a number of unused files.	2016-03-20 17:21:46 +01:00
Mike Lischke	fec65477d8	ATN deserialization + some initialization and memory leak fixes - ATN deserialization finally works. - Changed a number of pointer to STL classes to just the STL classes and pass them around by const & where necessary.	2016-03-19 17:18:06 +01:00
Mike Lischke	11571fa092	A few more adjustments to make the merged changes work with this revision of antlr.	2016-03-18 15:27:17 +01:00
Mike Lischke	1672bc0739	Added all changes done so far. Since we are here on a very old revision I cannot simply merge, so all files have manually been copied and we have no history for these changes.	2016-03-18 14:22:42 +01:00
Terence Parr	6f48625618	Merge pull request #380 from parrt/master get last not first when get() finds multiple matching nodes.	2013-12-20 12:48:23 -08:00
Terence Parr	2d7b0b4178	intellij git missed these files	2013-12-20 12:47:58 -08:00
Terence Parr	6b2817f8bb	get last not first when get() finds multiple matching nodes.	2013-12-20 12:47:19 -08:00
Terence Parr	64d10cd52f	Merge pull request #379 from parrt/master update change list	2013-12-20 12:37:33 -08:00
Terence Parr	2ff3bb6f52	update change list	2013-12-20 12:37:08 -08:00
Terence Parr	8c5d088eb7	Merge pull request #378 from sharwell/polish Tree patterns polish	2013-12-19 17:24:42 -08:00
Terence Parr	74f9745265	Merge pull request #377 from sharwell/atn-serializer ATN serializer	2013-12-19 17:16:20 -08:00
Sam Harwell	a2ba59d0ac	Use ATNDeserializer methods instead of deprecated ATNSimulator methods	2013-12-19 19:07:25 -06:00
Sam Harwell	fb1880d82c	Move ATNSerializer to runtime	2013-12-19 19:07:24 -06:00
Sam Harwell	bc59f30857	Use ATNDeserializer methods instead of deprecated ATNSimulator methods	2013-12-19 19:07:23 -06:00
Sam Harwell	7f15889d92	Make utility methods in ATNDeserializer static	2013-12-19 19:07:22 -06:00
Sam Harwell	5710eff8f8	Fix small warnings in XPath	2013-12-19 19:06:24 -06:00
Sam Harwell	8449b9258f	Updated documentation and API encapsulation for tree patterns	2013-12-19 19:06:23 -06:00
Sam Harwell	40bbd66231	Updated documentation for Token and TokenSource	2013-12-19 19:06:22 -06:00
Sam Harwell	2a9a716c53	Remove unnecessary methods ParseTreeMatch.getText() and failed() (use getTree().getText() and !succeeded() instead)	2013-12-19 19:06:22 -06:00
Sam Harwell	72675075cf	Remove unnecessary testing constructor	2013-12-19 19:06:21 -06:00
Sam Harwell	45fd53bf2c	Remove unused method Lexer.nextTokenOrRuleToken	2013-12-19 19:06:20 -06:00
Sam Harwell	75b8174dc8	Clean up the result caching for getTokenTypeMap and getRuleIndexMap	2013-12-19 19:06:20 -06:00
Sam Harwell	df61690758	Clean up the caching of ATN instances with bypass alternatives	2013-12-19 19:06:19 -06:00
Terence Parr	2618aa335a	Merge pull request #376 from parrt/master fix null pointer bug with rule "a : a;"	2013-12-19 16:37:43 -08:00
Terence Parr	9ca6bf9bd3	fix null pointer bug with rule "a : a;"	2013-12-19 16:35:37 -08:00
Terence Parr	4e8353dea4	Merge pull request #365 from parrt/master convert toMap usage to parser method calls	2013-11-25 09:41:11 -08:00
Terence Parr	4b5cb78716	convert toMap usage to parser method calls	2013-11-25 09:40:05 -08:00
Terence Parr	0992aa856d	Merge pull request #362 from parrt/tree-patterns Add tree patterns	2013-11-25 09:30:14 -08:00
Terence Parr	bd91dc166d	add getTokenTypeMap(), getRuleIndexMap() to recognizer. Gen new fields for that an ATN with bypass alts. Then methods for that: getATNWithBypassAlts(). Big changes to interface for ParseTreeMatch; create Parser.compileParseTreePattern() method. Convert rule names to rule indexes.	2013-11-24 14:04:46 -08:00
Terence Parr	4c52a103e1	cleanup	2013-11-22 11:31:59 -08:00
Terence Parr	b2ec85d14d	updated comments, cleaned up the API, made helper routines.	2013-11-22 11:08:16 -08:00

1 2 3 4 5 ...

2475 Commits All Branches Search

2475 Commits

All Branches