simdjson

Commit Graph

Author	SHA1	Message	Date
Convery	e6258377d9	Additional define to silence the 32-bit portability warning. (#1279 ) Under compilers like MSVC the `#pragma message` about portability will be issued for each translation module; regardless of whether or not `SIMDJSON_PORTABILITY_H` has already been defined. As such a new define `SIMDJSON_NO_PORTABILITY_WARNING`, can be defined prior to the inclusion of `<simdjson.h>` to silence it.	2020-11-02 08:37:13 -05:00
Paul Dreik	54ffbbe7db	remove asserts from compute_float_64 (#1276 )	2020-11-01 18:34:42 +01:00
Paul Dreik	0b82f07115	fix segfault in numberparsing #1273 (#1274 ) This was a read overflow.	2020-11-01 18:27:21 +01:00
Paul Dreik	265db2e533	fix non ascii sources (#1275 ) Master does not build because of non-ascii sources, merging without waiting for CI.	2020-11-01 11:14:01 +01:00
Paul Dreik	f93fb21c95	optionally disable deprecated apis (#1271 ) Introduce cmake option SIMDJSON_DISABLE_DEPRECATED_API (default Off) which turns off deprecated simdjson api functions by setting the macro SIMDJSON_DISABLE_DEPRECATED_API. For non-cmake users, users will have to set SIMDJSON_DISABLE_DEPRECATED_API by some other means to disable the api. Closes #1264	2020-11-01 06:38:52 +01:00
Daniel Lemire	b1444b4dfb	Mostly tiny changes, with one optimization to fallback for number parsing. (#1265 ) * Mostly tiny changes, with one optimization to fallback for number parsing. * Missed an update.	2020-10-29 11:18:11 -04:00
Danila Kutenin	f46a0f64f2	PPC64 support (#1254 ) * Initial PPC64 support * Add travis CI * Fix outdated cmake version for travis * Fix indendtation * Try another workaround for outdated cmake in travis * Try beta cmake * Add dash before beta * Use builtin snaps * Use cmake as rocksdb * Test cmake on bionic * Remove unnecessary things from travis * Remove unnecessary things from travis * Another try of compiler install * Add all major compilers * Add all major compilers * Add all major compilers * Tweak travis a bit * Typo * More robust travis * Typos typos typos * Add fewer compilers, add non specific build for clang and gcc, should be the final config * CMAKE_FLAGS is in incorrect place * Remove default implementation * Limit build thread number * Fall back prefix_xor to a usual implementation, no performance boost is noticed * Test for power9 as it is the main architecture for OpenPOWER right now * Add to documentation to build with power9 as the implementation is compatible but compiler optimizations is not * Replace ARM with PPC in the comment	2020-10-27 18:43:39 -04:00
Daniel Lemire	a75c07065f	Fix for issue 1246. We document the relationship between parser instances and elements (#1250 ) * Fix for issue 1246. * Adopting John's wording.	2020-10-26 08:40:45 -04:00
Daniel Lemire	6a86ef5a7d	Issue release	2020-10-23 09:32:25 -04:00
Daniel Lemire	0d6919dd99	Reenable the on-demand tests and allows us to convert a raw string into a C++ string. (#1232 ) * Reenable the on-demand tests and allows us to convert a raw string into a C++ string. * Fixing a 1-byte buffer overrun. * More documentation. * Adding more tests. * Enabling the new tests * Committing a nicer example. * Not yet happy but this should fix our failures. * Duh. * Ok. Making it easier to get string_view instances from field instances. * It is a struct. * Trying to satisfy VS. * Adopting John's name.	2020-10-19 20:22:24 -04:00
Paul Dreik	f1b4a54991	add fuzz element (#1204 ) * add definitions for is_number and tie (by lemire) * add fuzzer for element * update fuzz documentation * fix UB in creating an empty padded string * don't bother null terminating padded_string, it is done by the std::memset already * refactor fuzz data splitting into a separate class	2020-10-17 05:48:50 +02:00
Paul Dreik	58e7106df1	remove unused function parse_unsigned	2020-10-16 22:17:11 +02:00
Paul Dreik	7bf391c54a	fix potential use of uninitialized value warning, avoid casting away const This fixes a "potentially use of uninitialized value" warning, as well as a cstyle cast to non-const.	2020-10-16 22:14:42 +02:00
Daniel Lemire	07a6e098c8	This would allow users to find out what builtin is. (#1227 ) * This would allow users to find out what builtin is. * Trying another approach. * Added instructions. * Cleaning up the printout. * Let us be less invasive. * Adding a comment.	2020-10-15 21:58:42 -04:00
Daniel Lemire	e4897d6b54	We have hardcoded 32 (#1236 )	2020-10-15 21:57:10 -04:00
Daniel Lemire	bb2bc98a22	Fix issue https://github.com/simdjson/simdjson/issues/1127 (#1224 )	2020-10-13 09:18:54 -04:00
Daniel Lemire	43da4f7ccc	Corrected number	2020-10-12 17:59:13 -04:00
Daniel Lemire	37e6d1e9c7	new number parsing (#1222 ) * Remove our dependency on strtod_l by bundling our own slow path. * Ok. Let us drop strtod entirely. * Trimming down the powers to -342. * Removing useless line. * Many more comments. * Adding some DLL exports. * Let the gods help those who rely on windows+gcc. * Marking the subnormals as unlikely. This is pretty much "performance neutral", but it might help just a bit with twitter.json.	2020-10-10 12:47:49 -04:00
John Keiser	676a3d068c	Add non-top-level array iteration test	2020-10-06 11:29:46 -07:00
John Keiser	5533f8d87b	Add object iteration error tests	2020-10-06 11:29:46 -07:00
John Keiser	364ad5529d	Add basic error tests	2020-10-06 11:29:46 -07:00
John Keiser	9088792b0e	Disable value["x"] (unsafe, convert to object first)	2020-10-06 11:29:45 -07:00
John Keiser	00f9bb8a07	Add null tests	2020-10-06 11:29:45 -07:00
John Keiser	a90e1637cb	Add boolean tests	2020-10-06 11:29:45 -07:00
John Keiser	ce09d82fc7	Test strings	2020-10-06 11:29:45 -07:00
John Keiser	5b926b8196	Support array iteration over document	2020-10-06 11:29:45 -07:00
John Keiser	c719ccdb48	Add tests for empty object/array	2020-10-06 11:29:45 -07:00
John Keiser	9f1786aeb1	Add .get(T) to value/document	2020-10-06 11:29:45 -07:00
John Keiser	0bb83e06bc	Fix root number parsing	2020-10-06 11:29:45 -07:00
John Keiser	2ba67c2bc2	[WIP] && and & versions of each operator	2020-10-06 11:29:45 -07:00
John Keiser	cae91983ec	Fix issue with early destruction	2020-10-06 11:29:45 -07:00
John Keiser	c7c1372833	Allow reuse of value to try multiple types	2020-10-06 11:29:45 -07:00
John Keiser	ba02cda55f	Allow direct document iteration	2020-10-06 11:29:45 -07:00
John Keiser	512a94afaa	Move error to json_iterator	2020-10-06 11:29:45 -07:00
John Keiser	6d978c383a	Kinder, gentler implementation selection - Allow user to specify SIMDJSON_BUILTIN_IMPLEMENTATION - Make cmake -DSIMDJSON_IMPLEMENTATION=haswell only specify haswell - Move negative implementation selection to -DSIMDJSON_EXCLUDE_IMPLEMENTATION - Automatically select SIMDJSON_BUILTIN_IMPLEMENTATION if SIMDJSON_IMPLEMENTATION is set - Move implementation enablement mostly to implementation files - Make implementation enablement and selection simpler and more robust - Fix bug where programs linked against simdjson were not passed SIMDJSON_XXX_IMPLEMENTATION or SIMDJSON_EXCEPTIONS	2020-10-06 11:29:45 -07:00
John Keiser	88f0dc4726	Add API docs, ensure parser and field methods called with &	2020-10-06 11:29:45 -07:00
John Keiser	8ae7910aba	Namespace documentation	2020-10-06 11:29:45 -07:00
John Keiser	4859cb8528	Add SIMDJSON_ONDEMAND_SAFETY_RAILS	2020-10-06 11:29:45 -07:00
John Keiser	c42b91980b	Fix VC++ forward friend declarations	2020-10-06 11:29:45 -07:00
John Keiser	29bc78a486	Add ondemand to arm64/fallback/westmere	2020-10-04 12:47:30 -07:00
John Keiser	85cefd5a00	Alias simdjson::builtin to simdjson::SIMDJSON_BUILTIN_IMPLEMENTATION	2020-10-04 12:47:30 -07:00
John Keiser	b5a328e0ca	Set SIMDJSON_BUILTIN_IMPLEMENTATION to minimum supported	2020-10-04 12:47:30 -07:00
John Keiser	209a2e8fc3	Fix ARM compile	2020-10-04 12:47:30 -07:00
John Keiser	8b978e6aea	Don't call functions max() (conflicts with Windows macro)	2020-10-04 12:47:30 -07:00
John Keiser	76aeda6b00	Fix [] on simdjson_result<value> to require temp object	2020-10-04 12:47:30 -07:00
John Keiser	1a0ad6d9c3	Fix bug document.get_xxx()	2020-10-04 12:47:30 -07:00
John Keiser	1e90691013	Add parse_unsigned/parse_integer/parse_double to SKIP_NUMBERPARSING	2020-10-04 12:47:30 -07:00
John Keiser	3577c87c88	Fix amalgamation with generic/ files in include/	2020-10-04 12:47:30 -07:00
John Keiser	a700848bae	Move ondemand implementation to include/	2020-10-04 12:47:30 -07:00
John Keiser	283ac3191f	Rename parse->iterate, add iterate_raw	2020-10-04 12:47:29 -07:00
John Keiser	ebcb3c6b3b	On-demand parse implementation	2020-10-04 12:47:29 -07:00
Daniel Lemire	9865bb6904	Make it possible to check that an implementation is supported at runtime (#1197 ) * Make it possible to check that an implementation is supported at runtime. * add CI fuzzing on arm 64 bit This adds fuzzing on drone.io arm64 For some reason, leak detection had to be disabled. If it is enabled, the fuzzer falsely reports a crash at the end of fuzzing. Closes: #1188 * Guarding the implementation accesses. * Better doc. * Updating cxxopts. * Make it possible to check that an implementation is supported at runtime. * Guarding the implementation accesses. * Better doc. * Updating cxxopts. * We need to accomodate cxxopts Co-authored-by: Paul Dreik <github@pauldreik.se>	2020-10-02 11:04:51 -04:00
Daniel Lemire	da093c1982	Fixing "undefined behavior" issue in new fast_itoa functions (#1186 ) * Fixing "undefined behavior" issue. * Simplifying our custom atoi * Fixing minor bug	2020-09-29 19:17:03 -04:00
Daniel Lemire	0e584fa4a5	Attempt to fix issue 1187. (#1192 )	2020-09-27 12:04:47 -04:00
Daniel Lemire	60c139a844	Faster and more correct serialization (#1168 ) * Adding new files. * Better. * Fixing minifier and adding tests. * Adding benchmarks. * Including the array header. * Replacing old stream-based code by the new code. * Doubling up the itoa. * Hidden away to_chars in internal namespace. * Removing the repetitions. * Documented the atoi functions. * Tuning the escape sequences. * Moving the operators off the main namespace. * Added more tests. * Tweaking the implementation so that it works with and without exp. * The string_builder template and mini_formatter class are not part of our public API and are subject to change at any time! * Adding a benchmark and some optimization. * Cleaning. * Strictly speaking, this header is needed.	2020-09-23 10:00:39 -04:00
Daniel Lemire	f410213003	Improve documentation on padding - Improves and clarifies the documentation on padding. - Use std:: prefix for memcpy, strlen etc. Related to issues #1175 and #1178	2020-09-23 09:07:14 +02:00
Daniel Lemire	72c83d9430	This avoids locale-dependent number parsing at the standard library level (#1157 ) * This avoids locale-dependent number parsing at the standard library level. * Adding missing cast. * Inserting the missing "endif" * Trial and error. * Another attempt. * Another tweak. * Another fix. * Restricting it even more. * Tweaking our symbol checks. * Somewhat smarter tests. * Nice comments. * Minor simplification. * Adding cerr.	2020-09-15 11:36:18 -04:00
Daniel Lemire	3e5497e2f9	Fixes issue 1170 and makes the usage of minify easier. (#1171 ) * Fixes issue 1170 and makes the usage of minify easier. * This should get the fallback implementation to detect unclosed strings.	2020-09-12 16:20:20 -04:00
Daniel Lemire	0552335ec1	Fixing the issue. (#1151 )	2020-09-02 18:41:59 -04:00
Daniel Lemire	7aea774b21	Adding a tests and a fix for empty strings in at_pointer (#1148 ) * Adding a test. * More tests.	2020-09-02 17:04:56 -04:00
Daniel Lemire	5b10c38e43	Make parse_many safer. (#1137 )	2020-08-20 22:22:46 -04:00
Daniel Lemire	3316df9195	Adding test for issue 1133 and improving documentation (#1134 ) * Adding test. * Saving. * With exceptions. * Added extensive tests. * Better documentation. * Tweaking CI * Cleaning. * Do not assume make. * Let us make the build verbose * Reorg * I do not understand how circle ci works. * Breaking it up. * Better syntax.	2020-08-20 14:03:14 -04:00
Daniel Lemire	5d355f1a8b	release candidate (#1132 )	2020-08-19 18:12:23 -04:00
John Keiser	2ff91103ca	Remove SIMDJSON_DO_NOT_USE_THREADS_NO_MATTER_WHAT (#1131 )	2020-08-19 17:11:13 -04:00
John Keiser	708a56872d	Move inline/* to *-inl.h	2020-08-19 09:09:31 -07:00
Daniel Lemire	8a8eea53a2	Prefixing macros (issue 1035) (#1124 ) * Renaming partially done. * More prefixing. * I thought that this was fixed. * Missed one. * Missed a few. * Missed another one. * Minor fixes.	2020-08-18 18:25:36 -04:00
Daniel Lemire	09bd7e8ef8	Verification and fix for issue 1063 (JSON Pointers) (#1064 ) * Specification is not followed. * Fixes. * Do not pass string_view by reference. * Better documentation. * The example is written for exceptions. * Better documentation. * Updating with deprecation. * Updating example. * Updating example.	2020-08-18 17:23:18 -04:00
Daniel Lemire	fc15147cf5	This allows the users to disable threading. (#1122 ) * This allows the users to disable threading. * This would disable bash scripts under FreeBSD. (#1118) * This would disable bash scripts under FreeBSD. * Let us also disable GIT. * Let us try to just disable GIT * Nope. We must have both bash and git disabled. * This allows the users to disable threading.	2020-08-18 16:43:08 -04:00
John Keiser	1b69612246	Remove information about nonexistent computed gotos :)	2020-08-10 16:29:24 -07:00
Daniel Lemire	83615ff351	Fixes issue 1088 (#1096 )	2020-08-06 11:42:13 -04:00
Daniel Lemire	75c75ac00c	This is dead code. (#1095 )	2020-08-05 16:48:10 -04:00
John Keiser	c0010f60e6	Add view support to simdjson_result<array/object>	2020-08-04 13:17:23 -07:00
John Keiser	75301e4cf5	Fix C++20 compilation	2020-08-04 13:17:23 -07:00
John Keiser	875c8fdcbe	Merge pull request #1071 from pps83/quick-example-at-the-top-of-simdjson.h Add a quick example at the top of simdjson.h	2020-08-04 11:07:21 -07:00
Pavel P	164fcb49d9	+ reformat sample code	2020-08-03 19:48:41 +06:00
Daniel Lemire	f6a3205d10	Avoid allocations in global objects (#1082 )	2020-07-30 13:14:56 -04:00
Daniel Lemire	f873a140ce	Tweaking	2020-07-27 16:24:27 -04:00
PavelP	36dfc5bbd1	Add missing strings.h include for non-windows builds (required by strcasecmp) (#1067 ) addresses #1066	2020-07-27 16:23:06 -04:00
Daniel Lemire	f80668e87f	This removes the crazy alignment requirements. (#1073 ) * This removes the crazy alignment requirements.	2020-07-27 16:19:01 -04:00
Pavel P	1513cdf7bc	Add a quick example at the top of simdjson.h	2020-07-26 14:31:03 -04:00
Daniel Lemire	e33af1a3f8	Adding strings.h header. (#1074 )	2020-07-25 15:27:19 -04:00
John Keiser	af8b52e7e8	Target region for entire compilation of an implementation	2020-07-24 02:48:25 -07:00
Daniel Lemire	796588900c	Reenabling C++ 20 features. (#1059 )	2020-07-21 18:12:04 -04:00
Daniel Lemire	e2cfcc52b3	Disabling cxx20 (#1058 ) * Disabling C++ 20 features. * Updating single-header.	2020-07-21 17:15:31 -04:00
Daniel Lemire	fc8a46025e	Better documentation of default_batch_size (#1056 ) * Better documentation of default_batch_size * Retweaking.	2020-07-21 15:15:21 -04:00
Joe Jevnik	d2bea0c228	Add support for C++ 20 ranges. (#1050 ) C++ 20 adds a new feature called "ranges", which provides components for dealing with sequences of values: https://en.cppreference.com/w/cpp/ranges. A range is like a normal object containing `begin` and `end`, except there are also composable operations like maps, filters, joins, etc. The iterator objects returned by a range's `begin` and `end` require a more strict set of operations than is needed for a range-for loop. This PR adds the extra operations needed to support turning `dom::array` and `dom::object` into a range. This PR does not depend on any C++ 20 behavior, the added operators are all valid C++ 11, and are already part of the LegacyIterator concepts. This PR adds extra code behind: `#if defined(__cpp_lib_ranges)` guards, which is the new C++ 20 specified feature test macro for ranges support. When ranges support is detected, extra compile time checks are added to ensure that `dom::array` and `dom::object` satisfy the range concept. No runtime tests have been added yet because these compile time checks should be sufficient. If desired, the `static_assert` code could be moved out of the actual code headers and put into a test file.	2020-07-21 13:27:39 -04:00
Vitaly Baranov	6bd64c6873	Fix clang warning -Wused-but-marked-unused. (#1042 ) * Fix clang warning -Wused-but-marked-unused. * Fix build.	2020-07-15 13:28:51 -04:00
Daniel Lemire	74870a8189	Fixing issue 1013. (#1016 ) * Fixing issue 1013. * Bumping to 0.4.6 Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-07-01 14:14:51 -04:00
Daniel Lemire	0ef4d90ad0	Fix for issue 1014. (#1015 ) * Fix for issue 1014. * Explanation. Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-30 19:36:26 -04:00
Daniel Lemire	e38fe3d361	Version update	2020-06-30 09:41:20 -04:00
Daniel Lemire	3faae67663	New release	2020-06-29 21:11:58 -04:00
Daniel Lemire	0ba76ac066	This enables building the library under Visual Studio 2015 (#1002 ) Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-29 08:43:47 -04:00
Daniel Lemire	077907b7c3	Preparing a new patch release.	2020-06-28 12:40:13 -04:00
Daniel Lemire	6b85b9a416	Merge pull request #998 from simdjson/issue976 Final steps.	2020-06-26 21:05:06 -04:00
Daniel Lemire	4582a13360	Final steps.	2020-06-26 20:31:24 -04:00
Daniel Lemire	3a064535ae	Merge pull request #994 from simdjson/issue976 Fix for issue 976 (something like 32-bit support)	2020-06-26 20:02:15 -04:00
Daniel Lemire	444ec4ad27	Stupid me	2020-06-26 19:29:28 -04:00
Daniel Lemire	94e910586d	Removing a cast.	2020-06-26 19:06:39 -04:00
Daniel Lemire	6c33f518a8	This introduces a new option to forcefully disable threads.	2020-06-26 13:23:44 -04:00
Daniel Lemire	88da62ba09	Better documentation in the code.	2020-06-26 13:02:12 -04:00
Daniel Lemire	b6997a56df	Patching things up and adding tests.	2020-06-26 12:15:16 -04:00
Daniel Lemire	2956bce047	Minor fixes to avoid 32-bit warnings.	2020-06-25 21:12:26 -04:00
Brendan Knapp	41f33ecbb9	Permit 32-bit GCC compilation	2020-06-25 17:07:17 -07:00
Daniel Lemire	86241e2871	Merge pull request #987 from simdjson/issue985 Removing optional since it is not C++11, and it is not used	2020-06-25 11:04:36 -04:00
Daniel Lemire	1b63a9a9b5	Removing optional since it is not C++11	2020-06-25 10:25:57 -04:00
Daniel Lemire	32348c2b0b	Elaborating.	2020-06-25 10:14:29 -04:00
Daniel Lemire	5e690c5d04	Fixing the string_view issue.	2020-06-25 10:02:10 -04:00
Daniel Lemire	e01f1434fb	Bumping up the version number	2020-06-23 20:55:52 -04:00
John Keiser	187084ce46	Merge pull request #970 from simdjson/jkeiser/singleheader-tests Make singleheader tests be test-only	2020-06-23 17:07:03 -07:00
Daniel Lemire	544fa57641	Damn merge conflicts.	2020-06-23 19:15:47 -04:00
John Keiser	d9929edbc1	Run -Weffc++ in CI	2020-06-23 13:44:25 -07:00
Daniel Lemire	b84a3a0230	Merge branch 'master' into issue961	2020-06-23 14:33:06 -04:00
Daniel Lemire	49d70232f8	Merge pull request #969 from simdjson/dlemire/minor_pre0.4_cleaning Very minor cleaning.	2020-06-23 14:30:47 -04:00
John Keiser	257089884f	Merge pull request #958 from simdjson/jkeiser/is Make simdjson_result<element>.is() return bool	2020-06-23 09:51:37 -07:00
John Keiser	c650ea9765	Merge pull request #960 from simdjson/jkeiser/idiomatic-get Convert simdjson to use .get()	2020-06-23 09:49:41 -07:00
John Keiser	e369d45b9c	Fix non-compileable examples	2020-06-23 09:48:17 -07:00
John Keiser	2d84b6f6d9	Make simdjson_result<element>.is() return bool	2020-06-23 09:09:24 -07:00
John Keiser	eef1171944	Merge pull request #954 from simdjson/jkeiser/parse-many-result Return error from parse_many	2020-06-23 09:06:20 -07:00
Daniel Lemire	f1a03bfb04	Very minor cleaning.	2020-06-23 11:05:58 -04:00
Daniel Lemire	696b0e29e4	Fixing issue 961	2020-06-23 10:47:32 -04:00
Daniel Lemire	33e003616d	Fixing the name of the variable	2020-06-22 16:29:38 -04:00
Daniel Lemire	bf03d77ab9	Passing by value the string_view	2020-06-22 16:28:35 -04:00
Daniel Lemire	d6f056f266	Fixing documentation issues.	2020-06-22 16:17:11 -04:00
Daniel Lemire	a76c67c19f	Fixing...	2020-06-22 15:57:54 -04:00
John Keiser	1ff55c2729	Replace auto [x,error] with .get() everywhere	2020-06-21 16:26:59 -07:00
Daniel Lemire	5dbcdf1484	Ok	2020-06-21 17:52:30 -04:00
Daniel Lemire	f03a6ab5a4	Tweaking.	2020-06-21 17:39:24 -04:00
John Keiser	6fa5abcd7e	Replace x.get<T>() with x.get(v) or T(x)	2020-06-21 14:36:38 -07:00
Daniel Lemire	5dc07ed295	It builds.	2020-06-21 17:20:33 -04:00
John Keiser	1b1a122b1f	Fix copy constructor issue on older gcc	2020-06-21 12:06:14 -07:00
John Keiser	ae1bd891e7	Remove deprecated uses of parse_many	2020-06-21 11:19:06 -07:00
John Keiser	9899e5021d	Allow use of document_stream with tie()	2020-06-20 21:15:05 -07:00
John Keiser	94440e0170	Return simdjson_result from load_many/parse_many	2020-06-20 20:51:53 -07:00
John Keiser	a7fc7d4ffb	Switch from get(v,e) to e = get(v)	2020-06-20 17:57:09 -07:00
John Keiser	56e2b38048	Add bool result from tie()/get(), get<T>(T&,error_code&)	2020-06-20 17:55:46 -07:00
John Keiser	1d8c2d6c22	Make get_xxx the primary functions	2020-06-20 13:29:12 -07:00
John Keiser	0b8c357eff	Add get_X and is_X methods	2020-06-19 13:27:33 -07:00
John Keiser	05bc664c11	Don't extend from tape_ref in public classes	2020-06-19 13:25:52 -07:00
Daniel Lemire	c13c2650a2	Merge pull request #940 from simdjson/issue938 Verifying (and fixing) issue 938	2020-06-18 18:25:31 -04:00
Daniel Lemire	2f6091419f	Merge pull request #944 from simdjson/issue680 Document the complexity of array.at	2020-06-18 18:24:08 -04:00
Daniel Lemire	2022dd7d74	Merge pull request #945 from simdjson/issue678 Fixing issue 678	2020-06-18 18:23:56 -04:00
Daniel Lemire	ef688a74fe	Minor tweak to the documentation.	2020-06-18 18:18:12 -04:00
Daniel Lemire	04a19f9813	Fixes https://github.com/simdjson/simdjson/issues/937	2020-06-17 18:06:13 -04:00
Daniel Lemire	2cbc591c9d	Fixing issue 678	2020-06-17 16:17:17 -04:00
Daniel Lemire	3586fc4910	Fix for issue 680	2020-06-17 18:49:22 +00:00
Daniel Lemire	0b9df6d8c4	It turns out that we need fairly complicated logic.	2020-06-17 15:17:10 +00:00
Daniel Lemire	803b0c4bdb	Light touch.	2020-06-17 11:00:13 -04:00
Daniel Lemire	0d4e501239	Fixing the bug.	2020-06-17 10:06:16 -04:00
John Keiser	fd44c2a2ff	Merge pull request #927 from simdjson/dlemire/exposingthestringminifier Exposing the string minifier.	2020-06-13 07:47:20 -07:00
John Keiser	a86a82b39c	Rename minify class to minifier so the minify() method is cleared up	2020-06-12 17:05:25 -07:00
Daniel Lemire	4dfbf98e4e	Using a worker instead of a thread per batch (#920 ) In the parse_many function, we have one thread doing the stage 1, while the main thread does stage 2. So if stage 1 and stage 2 take half the time, the parse_many could run at twice the speed. It is unlikely to do so. Still, we see benefits of about 40% due to threading. To achieve this interleaving, we load the data in batches (blocks) of some size. In the current code (master), we create a new thread for each batch. Thread creation is expensive so our approach only works over sizeable batches. This PR improves things and makes parse_many faster when using small batches. This fixes our parse_stream benchmark which is just busted. This replaces the one-thread per batch routine by a worker object that reuses the same thread. In benchmarks, this allows us to get the same maximal speed, but with smaller processing blocks. It does not help much with larger blocks because the cost of the thread create gets amortized efficiently. This PR makes parse_many beneficial over small datasets. It also makes us less dependent on the thread creation time. Unfortunately, it is going to be difficult to say anything definitive in general. The cost of creating a thread varies widely depending on the OS. On some systems, it might be cheap, in others very expensive. It should be expected that the new code will depend less drastically on the performances of the underlying system, since we create juste one thread. Co-authored-by: John Keiser <john@johnkeiser.com> Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-12 16:51:18 -04:00
John Keiser	bbd61eb13f	Let tape writing be put in a register	2020-06-12 09:18:20 -07:00
Daniel Lemire	a6e4933d93	Exposing the string minifier.	2020-06-11 13:07:18 -04:00
John Keiser	fe01da077e	Make threaded version work again	2020-06-07 16:21:00 -07:00
John Keiser	d43a4e9df9	Remove SUCCESS_AND_HAS_MORE (internal only value)	2020-06-07 16:20:55 -07:00
John Keiser	ef63a84a3e	Move document stream state to implementation	2020-06-07 16:20:44 -07:00
Daniel Lemire	7a69da16e4	Fixing issue 906 (#912 ) * Fixing issue 906 * Safe patching. * Now with explanations. * Bumping up memory allocation. * Putting the patch back. * fallback fixes. Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-05 15:37:09 -04:00
John Keiser	b75fa26dc1	Move containing_scope and ret_address to .cpp	2020-06-01 12:15:55 -07:00
John Keiser	3d22a2d845	One weird trick: set a bogus error value in the parser impl This makes us faster under both gcc and clang somehow.	2020-06-01 12:15:55 -07:00
John Keiser	1aab4752e2	Store all parser state in the implementation	2020-06-01 12:15:54 -07:00
John Keiser	6a71b24495	Reuse stored buf and len from parser	2020-06-01 12:14:09 -07:00
John Keiser	a3a9bde83e	Move DOM parsing into concrete interface implementation	2020-06-01 12:14:09 -07:00
Daniel Lemire	40d57da83c	fixes issue 891 (#893 )	2020-05-20 11:54:53 -04:00
John Keiser	e6c9dfbd91	Make include files more fine-grained	2020-05-19 14:42:04 -07:00
John Keiser	7ad4020829	Make main compilation chunks into .cpp files	2020-05-19 13:32:35 -07:00
John Keiser	a476531524	Share ref_address everywhere it's used	2020-05-19 13:30:34 -07:00
Daniel Lemire	e03c5e9f23	We should guard the include (#881 )	2020-05-13 20:02:46 -04:00
John Keiser	dbb3316511	Move current_string_buf_loc to stage 2	2020-05-11 06:11:32 -07:00
John Keiser	cd6f204c77	Move write_tape() to stage 2 code	2020-05-11 06:09:48 -07:00
John Keiser	269131ed21	Move on_number_* to stage 2 code	2020-05-11 06:04:54 -07:00
John Keiser	65d784e88e	Move on_start/end_string to stage 2 code	2020-05-11 05:49:40 -07:00
John Keiser	35afb6cae0	Move on_error, on_success to stage 2 code	2020-05-11 05:46:18 -07:00
John Keiser	4f25b6ac0c	Move on_end_* to stage 2 code	2020-05-11 05:34:49 -07:00
John Keiser	3d5ed1a7e3	Move on_start_* to stage 2 code	2020-05-11 05:30:35 -07:00
John Keiser	a03115a4a6	Move end_scope to stage 2 code	2020-05-11 05:24:12 -07:00
John Keiser	7219d28a31	Call end_scope directly from stage 2 code	2020-05-11 05:20:04 -07:00
John Keiser	0875bce68f	Don't pass depth to on_end_*	2020-05-11 05:15:39 -07:00
John Keiser	54fe302907	Don't pass depth to end_scope	2020-05-11 05:06:41 -07:00
John Keiser	edaa8f811f	Move on_start_* depth management to stage 2 code	2020-05-11 05:03:25 -07:00
John Keiser	2c8fd109de	Move increment_count to stage 2	2020-05-11 04:58:50 -07:00
John Keiser	16d88cc095	Don't pass depth to increment_count	2020-05-11 04:15:02 -07:00
Daniel Lemire	2a6e6b3dbd	Cleaning string_view (#872 ) * Cleaning string_view * Corrected typo * Alignment.	2020-05-10 16:05:52 -04:00
John Keiser	afb369950c	Disable Intellisense-only warnings in simdjson.h/cpp	2020-05-04 11:47:04 -07:00
John Keiser	1d06624d38	Unset /D_CRT_SECURE_NO_WARNINGS - Also localize DISABLE_DEPRECATED_WARNING so that we catch other deprecations	2020-05-04 11:35:05 -07:00
Pavel P	d40069a018	Disable deprecation warnings for VS builds fopen/getenv are standard c++ that are not deprecated.	2020-05-04 11:34:00 -07:00
Furkan Usta	e04cbd71d0	Only install singleheader/simdjson.h as part of the public API	2020-05-02 01:44:11 +03:00
Daniel Lemire	fc1ddcd2f8	Faster case-insensitive comparisons. (#837 ) * Faster case-insensitive comparisons.	2020-04-30 15:52:28 -04:00
Furkan Usta	73d7d704c1	CMake: Remove export_private_library Since we are exporting all the targets as part of the main simdjson target we do not need private exports anymore	2020-04-30 02:06:19 +03:00
Furkan Usta	eee07e6cfd	Use the same export name for all targets	2020-04-29 23:47:27 +03:00
Nong Li	0f9dbf84b7	Fix incorrect check for case insensitive key lookup (#824 )	2020-04-29 13:55:28 -04:00
Daniel Lemire	2a1f8fa8f1	Provides support for clang under Windows. (#817 )	2020-04-27 22:09:27 -04:00
John Keiser	49da7e74cd	usage.md -> basics.md (#823 )	2020-04-27 16:03:19 -04:00
PavelP	0514588175	Improves clang-cl build with Visual Studio (#809 )	2020-04-27 08:59:32 -04:00
Daniel Lemire	b99a7344c9	missing spaces.	2020-04-25 22:26:18 -04:00
Daniel Lemire	f3ac0be0e6	Merge branch 'master' of github.com:simdjson/simdjson	2020-04-23 18:39:56 -04:00
Daniel Lemire	18c9468af5	Fixed typo	2020-04-23 18:39:32 -04:00
ostri	d4239aaa8f	default initialisaiton (#779 ) * padded_string.* default initialisation parsedjson_iterator - copy constructor; depth_index not necessary	2020-04-23 18:32:11 -04:00
Daniel Lemire	4d0c7d706d	Warn 32-bit users about their doom. (#783 )	2020-04-23 16:01:19 -04:00
Daniel Lemire	382392e03b	This should enable -Weffc++ (#777 ) * Enabling -Weffc++	2020-04-23 13:03:04 -04:00
Daniel Lemire	0d1c574cb1	A few more changes... (#775 ) * More nitpicking.	2020-04-23 11:36:52 -04:00

... 2 3 4 5 6 ...

685 Commits