simdjson

Commit Graph

Author	SHA1	Message	Date
Daniel Lemire	696b0e29e4	Fixing issue 961	2020-06-23 10:47:32 -04:00
Daniel Lemire	dada5090b0	These compilers are insane.	2020-06-22 20:25:55 -04:00
Daniel Lemire	1c4593c648	These compilers are really pedantic.	2020-06-22 20:04:37 -04:00
Daniel Lemire	e7004cef76	Removing a test so that it is all ASCII.	2020-06-22 16:55:16 -04:00
Daniel Lemire	2bb101bd19	Code reformatting.	2020-06-22 16:50:57 -04:00
Daniel Lemire	26baf70912	Pedantic compiler	2020-06-22 16:45:32 -04:00
Daniel Lemire	69a247d500	Adding tests.	2020-06-22 16:12:37 -04:00
Daniel Lemire	a76c67c19f	Fixing...	2020-06-22 15:57:54 -04:00
John Keiser	0c9dc11550	Use really_inline to help g++ detect initialized variable	2020-06-21 16:27:05 -07:00
John Keiser	1ff55c2729	Replace auto [x,error] with .get() everywhere	2020-06-21 16:26:59 -07:00
Daniel Lemire	38bb08778a	With an example.	2020-06-21 17:57:22 -04:00
Daniel Lemire	5dbcdf1484	Ok	2020-06-21 17:52:30 -04:00
John Keiser	6fa5abcd7e	Replace x.get<T>() with x.get(v) or T(x)	2020-06-21 14:36:38 -07:00
John Keiser	1b1a122b1f	Fix copy constructor issue on older gcc	2020-06-21 12:06:14 -07:00
John Keiser	ae1bd891e7	Remove deprecated uses of parse_many	2020-06-21 11:19:06 -07:00
John Keiser	9899e5021d	Allow use of document_stream with tie()	2020-06-20 21:15:05 -07:00
John Keiser	a7fc7d4ffb	Switch from get(v,e) to e = get(v)	2020-06-20 17:57:09 -07:00
John Keiser	f336103f63	Convert tools/docs/benchmarks to bool get() idiom	2020-06-20 17:55:46 -07:00
John Keiser	56e2b38048	Add bool result from tie()/get(), get<T>(T&,error_code&)	2020-06-20 17:55:46 -07:00
John Keiser	0b8c357eff	Add get_X and is_X methods	2020-06-19 13:27:33 -07:00
John Keiser	efc168f473	Make test changes only	2020-06-19 13:27:33 -07:00
John Keiser	d8428f98d9	Add cast_tester.h	2020-06-19 13:27:33 -07:00
John Keiser	60f17d26a3	Move test macros to a header	2020-06-19 13:27:00 -07:00
Daniel Lemire	5ccdbef7d5	Merge pull request #936 from simdjson/dlemire/new_examples New examples.	2020-06-18 18:29:06 -04:00
Daniel Lemire	c13c2650a2	Merge pull request #940 from simdjson/issue938 Verifying (and fixing) issue 938	2020-06-18 18:25:31 -04:00
John Keiser	f632e7c043	Put C++11 capable version back, change name to readme style	2020-06-18 12:50:49 -07:00
Daniel Lemire	04a19f9813	Fixes https://github.com/simdjson/simdjson/issues/937	2020-06-17 18:06:13 -04:00
Daniel Lemire	0655a135e6	Reverting.	2020-06-17 17:52:07 +00:00
Daniel Lemire	4474f8ef18	Cleaning a bit the examples.	2020-06-17 16:24:55 +00:00
Daniel Lemire	6537d0dc76	Avoiding the unused errors.	2020-06-17 14:19:58 +00:00
Daniel Lemire	8d609607e2	Verifying the bug.	2020-06-16 20:04:09 -04:00
Daniel Lemire	27a75a9085	Tweaking.	2020-06-15 17:54:34 -04:00
Daniel Lemire	954d6c326d	New examples.	2020-06-15 17:45:15 -04:00
John Keiser	fd44c2a2ff	Merge pull request #927 from simdjson/dlemire/exposingthestringminifier Exposing the string minifier.	2020-06-13 07:47:20 -07:00
John Keiser	a86a82b39c	Rename minify class to minifier so the minify() method is cleared up	2020-06-12 17:05:25 -07:00
Daniel Lemire	89b059b1ea	Testing with GCC 10 and clang 10 (#926 ) * Testing with GCC 10 and clang 10 * Fixing spurious space * gcc10 does not need the cmake installation. * We don't want to run the perf test on ARM. I ignore them systematically. ARM performance should be assessed manually. * Switching to GCC 10 and Clang 10 * Disabling some tests under sanitizers when they involve rapidjson or other parsers. Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-12 17:58:53 -04:00
Daniel Lemire	4dfbf98e4e	Using a worker instead of a thread per batch (#920 ) In the parse_many function, we have one thread doing the stage 1, while the main thread does stage 2. So if stage 1 and stage 2 take half the time, the parse_many could run at twice the speed. It is unlikely to do so. Still, we see benefits of about 40% due to threading. To achieve this interleaving, we load the data in batches (blocks) of some size. In the current code (master), we create a new thread for each batch. Thread creation is expensive so our approach only works over sizeable batches. This PR improves things and makes parse_many faster when using small batches. This fixes our parse_stream benchmark which is just busted. This replaces the one-thread per batch routine by a worker object that reuses the same thread. In benchmarks, this allows us to get the same maximal speed, but with smaller processing blocks. It does not help much with larger blocks because the cost of the thread create gets amortized efficiently. This PR makes parse_many beneficial over small datasets. It also makes us less dependent on the thread creation time. Unfortunately, it is going to be difficult to say anything definitive in general. The cost of creating a thread varies widely depending on the OS. On some systems, it might be cheap, in others very expensive. It should be expected that the new code will depend less drastically on the performances of the underlying system, since we create juste one thread. Co-authored-by: John Keiser <john@johnkeiser.com> Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-12 16:51:18 -04:00
Daniel Lemire	45e2178ada	Duh.	2020-06-11 17:20:28 +00:00
Daniel Lemire	a6e4933d93	Exposing the string minifier.	2020-06-11 13:07:18 -04:00
John Keiser	fe01da077e	Make threaded version work again	2020-06-07 16:21:00 -07:00
John Keiser	3e226795f0	Run all passing json against parse_many. Empty documents pass, too.	2020-06-07 16:20:51 -07:00
John Keiser	c4a0fe1606	Add tests for parse_many() errors	2020-06-07 16:20:46 -07:00
John Keiser	ef63a84a3e	Move document stream state to implementation	2020-06-07 16:20:44 -07:00
Daniel Lemire	7a69da16e4	Fixing issue 906 (#912 ) * Fixing issue 906 * Safe patching. * Now with explanations. * Bumping up memory allocation. * Putting the patch back. * fallback fixes. Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-05 15:37:09 -04:00
Daniel Lemire	12150baa5e	Using just ASCII. (#899 ) * Using just ASCII. * Let us prune checkperf. * Moving the description of lookup2 to the HACKING.md file.	2020-05-21 21:59:06 -04:00
Daniel Lemire	d2c9ea8a9a	Detect bash instead of relying on MSVC detection. (#894 )	2020-05-20 12:13:14 -04:00
John Keiser	5312fd30e5	Fix CRT_SECURE warnings in clang	2020-05-04 11:36:00 -07:00
John Keiser	1d06624d38	Unset /D_CRT_SECURE_NO_WARNINGS - Also localize DISABLE_DEPRECATED_WARNING so that we catch other deprecations	2020-05-04 11:35:05 -07:00
Furkan Usta	064eb0b24f	CMake: Make simdjson-internal-flags subsume simdjson-flags	2020-05-03 02:48:29 +03:00
Furkan Usta	af968c5b44	Merge branch 'master' of github.com:simdjson/simdjson into cmake-flags	2020-05-03 02:12:23 +03:00
Furkan Usta	1e9488d4a6	Remove Microsoft comment regarding dirent in parsingchecks	2020-05-02 16:01:30 +03:00
Furkan Usta	ff1d77ead9	Add NOMINMAX to parsingchecks	2020-05-02 15:33:53 +03:00
Furkan Usta	977e1a94b2	Use dirent_portable.h only in MSVC	2020-05-02 15:16:50 +03:00
Furkan Usta	60ee5fc844	Enable numberparsingcheck and stringparsingcheck on MSVC	2020-05-02 15:12:30 +03:00
Furkan Usta	293c104cc4	CMake: Separate public and private compilation flags simdjson-internal-flags for macros and warnings simdjson-flags for pthread, sanitizer, and libcpp	2020-05-02 04:08:47 +03:00
Daniel Lemire	fa4ce6a8bc	There is confusion between gigabytes and gigibytes. Let us standardize throughout. (#838 ) * There is confusion between gigabytes and gigibytes. * Trying to be consistent.	2020-05-01 12:16:18 -04:00
John Keiser	0e6ea76e88	Make checkperf work on Windows (#799 ) * Make command line arguments work for Windows * Run checkperf on Windows	2020-04-27 14:20:05 -04:00
Daniel Lemire	f397b6fedf	Another example. (#790 ) * Another example. * Adding a reference to error chaining.	2020-04-23 21:48:41 -04:00
Daniel Lemire	4f72d5cfac	This adds another example (#785 )	2020-04-23 18:29:28 -04:00
Daniel Lemire	e030f02776	Merge branch 'master' into jkeiser/wconversion	2020-04-22 22:03:34 -04:00
Daniel Lemire	f0ac55ec0c	testing on freebsd (#768 ) * Adding cirrus tests * Adding cirrus badge.	2020-04-22 21:22:09 -04:00
John Keiser	d4a37f6ef5	Enable conversion warnings on Linux and Windows	2020-04-22 14:21:30 -07:00
John Keiser	d3e44b1108	Add amalgamation support to cmake	2020-04-20 19:50:51 -07:00
John Keiser	53d28a713c	Fix cmake error when SIMDJSON_COMPETITION=OFF	2020-04-20 10:49:40 -07:00
John Keiser	e5e6a46c37	Consolidate multi-implementation tests Uses SIMDJSON_FORCE_IMPLEMENTATION to switch the implementation at test time.	2020-04-19 09:59:49 -07:00
John Keiser	22b9a53bef	Add SIMDJSON_FORCE_IMPLEMENTATION	2020-04-18 18:21:56 -07:00
John Keiser	ff09b6c824	Run fewer redundant steps and configs in CI	2020-04-17 12:23:05 -07:00
John Keiser	289cc3e7a0	Treat warnings as errors during compilation	2020-04-15 19:59:38 -07:00
John Keiser	fd418f568c	Fix c++11 warnings on clang - namespace x::y is C++17 - static_assert requires message in C++11	2020-04-15 17:27:48 -07:00
John Keiser	09cf18a646	Add C++11 tests to cmake - Add simdjson-flags target so callers don't have flags forced on them	2020-04-15 17:26:25 -07:00
Daniel Lemire	6d7c77ddc1	Let us try to check with the exceptions disabled. (#707 ) * Tweaking code so that we can run all tests with exceptions off. * Removing SIMDJSON_DISABLE_EXCEPTIONS	2020-04-15 16:45:36 -04:00
Daniel Lemire	efd706528b	Minor tweaks to the CMake.	2020-04-15 10:19:05 -04:00
Daniel Lemire	b523c43927	Can we provide a size() function to arrays and objects? (eager approach) [TO BE MERGED] (#690 ) * This is an implementation of "size()" for arrays and objects. * Adding benchmark * Adding a size() remark in the documentation. * Extending size() to result types.	2020-04-15 10:15:48 -04:00
Paul Dreik	75545ff70d	ref qualify parser methods to avoid use of dangling objects (#703 ) To avoid using data belonging to a temporary, the parse functions are ref qualified to get a compile error if used on an rvalue. See https://github.com/simdjson/simdjson/issues/696 Compilation tests are also added, to make sure bad usage fails to compile. Reviewed by jkeiser.	2020-04-15 09:57:52 +02:00
Daniel Lemire	3c6ef83046	Trying to correct the documentation so that it actually describes how the code behaves. (Attempt two) (#712 ) * Trying to correct the documentation so that it actually describes how the code behaves. * tweaking the wording. * Improving. * Removing confusing sentence. * Fixing formatting. * Now with working example, tested. * Added a smaller piece of code	2020-04-14 22:31:21 -04:00
John Keiser	b9ac0a79f1	Merge pull request #715 from simdjson/jkeiser/thorough-type-tests Test more variants of cast, get, etc.	2020-04-14 16:08:36 -07:00
Daniel Lemire	8539896f3d	It is inconvenient to be unable to print a padded_string. (#713 ) * It is inconvenient to be unable to print a padded_string. * Allows us to print the padded_string even when it is embedded in result object when exceptions are enabled.	2020-04-14 19:07:32 -04:00
John Keiser	a3b508ceff	Test get<>(), exception vs. no exception, explicit vs. implicit cast	2020-04-14 13:18:42 -07:00
John Keiser	1ff22c78b3	Add quickstart to cmake	2020-04-09 14:56:54 -07:00
John Keiser	ceb1def55c	Add quicktests, slowtests to cmake - Also add testjson2json.sh - Move test scripts to tests directory to consolidate concerns	2020-04-09 14:21:45 -07:00
John Keiser	7317fe1440	Don't reinitialize submodules Add ability to turn competitive benchmarks off (no need for submodules)	2020-04-09 08:52:29 -07:00
John Keiser	6dabfa176a	Add competition libraries	2020-04-09 08:52:29 -07:00
John Keiser	218c867f46	Disable failing VS2017 tests in cmake	2020-04-08 14:58:28 -07:00
John Keiser	beaa6a9a7a	Create simdjson-windows-headers interface library	2020-04-08 14:52:56 -07:00
John Keiser	a9c8224f40	Add numberparsingcheck and stringparsingcheck tests	2020-04-08 14:52:56 -07:00
John Keiser	3dcc188d93	Add more tests to cmake	2020-04-08 14:52:56 -07:00
John Keiser	10b7556a37	Specify cmake tests, benchmarks and tools idiomatically	2020-04-08 14:52:56 -07:00
John Keiser	54b7291c34	Reference simdjson by name, don't specify include files individually	2020-04-08 14:52:55 -07:00
John Keiser	1e30b6e334	Compile under C++ 11	2020-04-08 14:00:13 -07:00
John Keiser	406240bae3	Support C++ 14	2020-04-08 14:00:13 -07:00
John Keiser	6eec2d6b4f	Simplify cars example	2020-04-05 09:15:20 -07:00
Daniel Lemire	5731c5437a	Sanity test. (#675 )	2020-04-04 16:39:37 -04:00
Daniel Lemire	04f14ec026	This adds a test for std::ignore (#674 )	2020-04-04 11:53:03 -04:00
John Keiser	13aee51011	Add element.type() for type switching	2020-04-02 14:07:19 -07:00
John Keiser	d93af1161d	Remove set_capacity, replace with allocate Makes allocation point more predictable	2020-03-30 13:49:54 -07:00
John Keiser	434776db1a	Deprecate more things	2020-03-30 13:48:43 -07:00
John Keiser	2115596ed3	Compile performance.md examples in tests	2020-03-29 16:28:34 -07:00
John Keiser	0e3453f7c2	Compile examples from implementation-selection.md	2020-03-29 16:28:34 -07:00
John Keiser	7ed65e42d7	Add actual examples from basics.md to readme_examples	2020-03-29 16:28:29 -07:00
John Keiser	ea8a5020e2	Remove array indexer, make object indexer key lookup	2020-03-28 15:56:43 -07:00
John Keiser	622d9c9480	Replace as_X and is_X with get<T> and is<T>	2020-03-28 15:29:53 -07:00
John Keiser	62da98aef6	Rename dom::stream to dom::document_stream	2020-03-28 13:42:24 -07:00
John Keiser	03746b966b	Move document/element/etc. under dom	2020-03-28 13:42:21 -07:00
John Keiser	e836c28008	Deprecate parser error code methods - Also make competitions compile without warnings	2020-03-28 10:13:20 -07:00
John Keiser	5ad405006c	Return document::element from parse, load, parse_many, load_many	2020-03-27 12:24:41 -07:00
John Keiser	90a7503181	Rename pj -> doc, fix a few other idioms	2020-03-27 09:22:46 -07:00
John Keiser	c14b2fb36c	Remove const char* variants for at_key() - Remove const char * variants for at_key(), string_view covers them - Add at_key_case_insensitive variants on *_result - Add at(), at_key(), at_key_case_insensitive() tests	2020-03-27 09:09:08 -07:00
John Keiser	f0f111b387	Make ParsedJson::Iterator backcompat test	2020-03-27 09:07:39 -07:00
Daniel Lemire	6a8ec95a46	Various fixes.	2020-03-26 20:08:54 -04:00
Daniel Lemire	b6c6680add	Ported jsoncheck.	2020-03-26 19:56:04 -04:00
Daniel Lemire	5fb149f833	Converted inter_tests...	2020-03-26 19:52:17 -04:00
Daniel Lemire	abb0bf9247	Fixed basictests	2020-03-26 19:40:29 -04:00
Daniel Lemire	8f3ddd3a73	Updating allparserscheckfile	2020-03-26 17:15:33 -04:00
John Keiser	2e420169c3	Remove document::parse and document::load	2020-03-26 10:13:09 -07:00
John Keiser	5aec2671ea	Remove JsonStream. Use parse_many() instead.	2020-03-26 09:25:07 -07:00
John Keiser	a0bce440a6	Remove document_iterator, document::iterator, ParsedJsonIterator Keep ParsedJson::Iterator only, without template, in same form as it was in 0.2	2020-03-25 18:26:51 -07:00
John Keiser	e1b1500e3b	Make _padded available without using namespace simdjson	2020-03-25 09:37:18 -07:00
John Keiser	b28cafc1d1	Remove backslash unescaping from JSON pointer impl Also speed up non-escaped key lookup	2020-03-25 08:56:40 -07:00
John Keiser	0bcda5e384	Support JSON pointer in DOM navigation model	2020-03-23 15:05:20 -07:00
John Keiser	c34b1a1b2a	Organize basic tests to make easier to turn on/off	2020-03-21 18:12:16 -07:00
John Keiser	e4df0ca368	Add parse, parse_many, load, load_many tests	2020-03-21 18:12:16 -07:00
Daniel Lemire	8a91cecf41	testing only with ok documents.	2020-03-21 18:12:16 -07:00
Daniel Lemire	04e8710cf5	Testing issue 570	2020-03-21 18:12:16 -07:00
John Keiser	e8b3f9eaad	Support document::parse("[1,2,3]"_padded)	2020-03-21 11:15:20 -07:00
Daniel Lemire	5d1e3efce8	faster minifier (#568 ) * Fallback should use our scalar code. * parse should have a nicer error message. * Making it so that "minify" can use different architectures. * Let us change the minifier competition so that it tests all implementations. * Documenting the untaken optimization opportunity. Co-authored-by: John Keiser <john@johnkeiser.com>	2020-03-20 16:14:47 -04:00
John Keiser	7cf3a7511b	Add fallback implementation to CI - Also add SIMDJSON_IMPLEMENTATION_HASWELL/WESTMERE/ARM64/FALLBACK=1/0 to enable/disable various implemnentations	2020-03-17 14:59:47 -07:00
John Keiser	af203aaf86	Add fallback parser for pre-SSE4.2 machines	2020-03-17 14:59:47 -07:00
John Keiser	8e2c06cb0e	Compile with -fno-exceptions	2020-03-17 13:54:37 -07:00
John Keiser	1a5d8f1957	Add tests for SIMDJSON_EXCEPTIONS=0, add `tie()` support	2020-03-17 13:54:37 -07:00
Daniel Lemire	317fc6ba0e	accurate number parsing (#558 )	2020-03-15 22:30:21 -04:00
Daniel Lemire	d9a9fd387d	Adding a stress test.	2020-03-13 18:59:15 -07:00
John Keiser	acc7bd79b0	Support cout << json, cout << minify(json)	2020-03-13 18:59:15 -07:00
Daniel Lemire	12e6611ba4	Fix for printf.	2020-03-13 14:44:21 -04:00
Daniel Lemire	06c1dc3a29	Adding NDEBUG to release (#557 ) * Adding NDEBUG to release * Asserts are deleted with NDEBUG. We want hard asserts.	2020-03-13 14:37:02 -04:00
Daniel Lemire	89d9de2353	Adding a check to see whether document::stream copy constructor and assignment actually compile (#556 ) * Currently, document::stream contains an attribute that is a reference: ``` document::parser &parser; ``` Yet we try to have it default on the move operator: ``` stream &operator=(document::stream &&other) = default; stream &operator=(const document::stream &) = delete; // Disallow copying ``` ``` stream(document::stream &&other) = default; stream(const document::stream &) = delete; // Disallow copying ``` I am not sure what the move is supposed to do with the reference. I cannot find where we test the copy constructor and assignment. This has been concerned that it is either dead code or buggy code. * Remove non-working, unnecessary move constructors * We still want to disallow copies. Co-authored-by: John Keiser <john@johnkeiser.com>	2020-03-13 12:53:42 -04:00
John Keiser	ac0899c043	Add error tests, doc_ref_result[] chaining	2020-03-11 17:19:41 -07:00
John Keiser	40c6213d7e	Add parser.load() and load_many() to load files	2020-03-11 17:19:41 -07:00
John Keiser	d140bc23f5	Automatically allocate memory as needed in parse	2020-03-11 16:14:54 -07:00
John Keiser	3bdfe167de	Support cout << error	2020-03-06 15:41:51 -08:00
John Keiser	31e8a12e88	Make error_message(error_code) return C string - Also move all error message logic to include inline	2020-03-06 15:41:51 -08:00
John Keiser	9a7c8fb5be	Use parse_many in examples/tests/docs	2020-03-05 12:04:45 -08:00
John Keiser	cfef4ff2ad	Create parser.parse_many() API	2020-03-05 12:04:45 -08:00
John Keiser	b3ea8c406e	Add simdjson.cpp for unified use (#515 )	2020-03-04 10:12:27 -08:00
John Keiser	99667f7c55	Create top level simdjson.h (#515 ) - Allows everyone to #include the same way, singleheader or not.	2020-03-04 10:12:27 -08:00
John Keiser	0b21203141	Document navigation API	2020-03-02 14:49:03 -08:00
Daniel Lemire	68670301e3	Adding instructions regarding how to check for an unsupported CPU (#508 ) * Adding instructions. * Slighty more documentation.	2020-02-25 11:09:51 -05:00
John Keiser	910f272467	Add parser implementation interface and selection API (#501 ) * Make architecture implementations virtual functions - Easier to add new architectures (add implementation to implementation.cpp) - Easier to add new algorithms / functions to architecture selection (add to implementation.h, implement) - Automatically select best implementation in static initialization - Allow user to explicitly select implementation with a string (i.e. parameter) - Allow user to inspect current implementation name/description - Allow user to list available implementations - Eliminate architecture enum and architecture-based templating - Add noexcept in non-inline functions * Move implementation static methods to their own classes * Detect best supported implementation on first use * available_implementationsI() -> available_implementations	2020-02-21 16:34:27 -05:00
John Keiser	4dc2adf7f8	Update README, add README examples	2020-02-18 08:37:07 -08:00
John Keiser	8e7d1a5f09	Separate document state from ParsedJson This creates a "document" class with only user-facing document state (no parser internals). - document: user-facing document state - document::iterator: iterator (equivalent of ParsedJsonIterator) - document::parser: parser state plus a "docked" document we parse into (equivalent of ParsedJson) Usage: ```c++ auto doc = simdjson::document::parse(buf, len); // less efficient but simplest ``` ```c++ simdjson::document::parser parser; // reusable parser parser.allocate_capacity(len); simdjson::document* doc = parser.parse(buf, len); // pointer to doc inside parser doc = parser.parse(buf2, len); // reuses all buffers and overwrites doc; more efficient ```	2020-02-07 10:02:36 -08:00
Daniel Lemire	c924aaede9	Fix issue472: make JsonStream a template. (#473 ) * Fix issue472: make JsonStream a template. * Adding missing include. * Tweaking headers and some minor formatting. * Removing file from aggregation. * Moving jsoncharutils * Adding new header. * Trying another header. * Let us try to route around Visual Studio's nonesense.	2020-01-30 17:16:41 -05:00

1 2 3 4 5 ...

324 Commits