simdjson

Commit Graph

Author	SHA1	Message	Date
John Keiser	f336103f63	Convert tools/docs/benchmarks to bool get() idiom	2020-06-20 17:55:46 -07:00
John Keiser	56e2b38048	Add bool result from tie()/get(), get<T>(T&,error_code&)	2020-06-20 17:55:46 -07:00
John Keiser	7339f67dd7	Merge pull request #462 from simdjson/jkeiser/if-backslash Wrap backslash processing in a branch	2020-06-17 07:07:58 -07:00
Daniel Lemire	7ea05d038e	New API traversal tests. (#931 ) Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-15 13:15:52 -04:00
Daniel Lemire	33930ff046	Adding link.	2020-06-15 13:07:53 -04:00
John Keiser	fd44c2a2ff	Merge pull request #927 from simdjson/dlemire/exposingthestringminifier Exposing the string minifier.	2020-06-13 07:47:20 -07:00
John Keiser	a86a82b39c	Rename minify class to minifier so the minify() method is cleared up	2020-06-12 17:05:25 -07:00
Daniel Lemire	d1a54249e7	New API traversal tests.	2020-06-12 17:42:57 -04:00
Daniel Lemire	4dfbf98e4e	Using a worker instead of a thread per batch (#920 ) In the parse_many function, we have one thread doing the stage 1, while the main thread does stage 2. So if stage 1 and stage 2 take half the time, the parse_many could run at twice the speed. It is unlikely to do so. Still, we see benefits of about 40% due to threading. To achieve this interleaving, we load the data in batches (blocks) of some size. In the current code (master), we create a new thread for each batch. Thread creation is expensive so our approach only works over sizeable batches. This PR improves things and makes parse_many faster when using small batches. This fixes our parse_stream benchmark which is just busted. This replaces the one-thread per batch routine by a worker object that reuses the same thread. In benchmarks, this allows us to get the same maximal speed, but with smaller processing blocks. It does not help much with larger blocks because the cost of the thread create gets amortized efficiently. This PR makes parse_many beneficial over small datasets. It also makes us less dependent on the thread creation time. Unfortunately, it is going to be difficult to say anything definitive in general. The cost of creating a thread varies widely depending on the OS. On some systems, it might be cheap, in others very expensive. It should be expected that the new code will depend less drastically on the performances of the underlying system, since we create juste one thread. Co-authored-by: John Keiser <john@johnkeiser.com> Co-authored-by: Daniel Lemire <lemire@gmai.com>	2020-06-12 16:51:18 -04:00
Daniel Lemire	1b6258ec8c	Added std::minify	2020-06-12 16:37:41 -04:00
John Keiser	7c6723d912	Print progress bar even if there is only one file	2020-06-12 10:01:19 -07:00
Daniel Lemire	a6e4933d93	Exposing the string minifier.	2020-06-11 13:07:18 -04:00
John Keiser	ae6dddfff4	Merge pull request #903 from simdjson/jkeiser/dom-parser-implementation Move parser state to implementation-specific class	2020-06-04 13:09:57 -07:00
John Keiser	1aab4752e2	Store all parser state in the implementation	2020-06-01 12:15:54 -07:00
John Keiser	6a71b24495	Reuse stored buf and len from parser	2020-06-01 12:14:09 -07:00
John Keiser	a3a9bde83e	Move DOM parsing into concrete interface implementation	2020-06-01 12:14:09 -07:00
Daniel Lemire	2fe2dd170b	The "competition tests" are being made portable (#907 ) * More portable competition * This will enable SIMDJSON_COMPETITION everywhere by default. * Minor fixes	2020-05-31 20:34:06 -04:00
John Keiser	e6c9dfbd91	Make include files more fine-grained	2020-05-19 14:42:04 -07:00
Daniel Lemire	8927a0561f	Obvious fix. (#885 )	2020-05-14 20:39:44 -04:00
John Keiser	8c600ca553	Make benchfeatures work again	2020-05-05 09:39:29 -07:00
John Keiser	5312fd30e5	Fix CRT_SECURE warnings in clang	2020-05-04 11:36:00 -07:00
Furkan Usta	af968c5b44	Merge branch 'master' of github.com:simdjson/simdjson into cmake-flags	2020-05-03 02:12:23 +03:00
Furkan Usta	293c104cc4	CMake: Separate public and private compilation flags simdjson-internal-flags for macros and warnings simdjson-flags for pthread, sanitizer, and libcpp	2020-05-02 04:08:47 +03:00
Furkan Usta	5a3035bb72	Propagate some CMake variables to checkperf Although it passes user-defined options, if the project is build in Debug mode or with Clang (since CXX defaults to gcc on Linux) results can flactuate	2020-05-02 02:40:13 +03:00
Daniel Lemire	fa4ce6a8bc	There is confusion between gigabytes and gigibytes. Let us standardize throughout. (#838 ) * There is confusion between gigabytes and gigibytes. * Trying to be consistent.	2020-05-01 12:16:18 -04:00
John Keiser	c3dec1a5ea	Default SIMDJSON_GOOGLE_BENCHMARKS to ON.	2020-04-29 15:21:43 -07:00
Daniel Lemire	f0d5337818	Adding independent benchmarks using Google Benchmark (#826 ) * Adding independent benchmarks using Google Benchmark	2020-04-29 13:53:54 -04:00
Daniel Lemire	4cd9de5c37	This will change the default of the parse benchmark so that it work over hot buffers (#827 ) * This will change the default of the parse benchmark so that it work over hot buffers by default, thus omitting memory allocation as part of the benchmark. * Everyone should be using '-H' from now on.	2020-04-29 13:43:27 -04:00
John Keiser	92d7af0881	Don't include benchmark overhead in documents/s	2020-04-28 13:15:01 -07:00
John Keiser	0e6ea76e88	Make checkperf work on Windows (#799 ) * Make command line arguments work for Windows * Run checkperf on Windows	2020-04-27 14:20:05 -04:00
Daniel Lemire	0d1c574cb1	A few more changes... (#775 ) * More nitpicking.	2020-04-23 11:36:52 -04:00
Daniel Lemire	e030f02776	Merge branch 'master' into jkeiser/wconversion	2020-04-22 22:03:34 -04:00
Daniel Lemire	80dbf9a32a	We should not enable checkperf on anything but Linux. (#762 )	2020-04-22 21:54:03 -04:00
John Keiser	d4a37f6ef5	Enable conversion warnings on Linux and Windows	2020-04-22 14:21:30 -07:00
John Keiser	3091e2dc0e	Add fallback, westmere and unthreaded checkperf	2020-04-20 11:18:40 -07:00
John Keiser	fc50a36cc5	Make checkperf build master with the same options	2020-04-20 11:14:46 -07:00
John Keiser	9bf9fba2ec	Add checkperf to cmake	2020-04-20 11:14:46 -07:00
John Keiser	289cc3e7a0	Treat warnings as errors during compilation	2020-04-15 19:59:38 -07:00
John Keiser	7480b87e07	Merge pull request #693 from simdjson/jkeiser/cmake-quickstartcpp Add C++11 tests to cmake	2020-04-15 19:53:14 -07:00
Daniel Lemire	befa6423be	This massively improves the performance of tight loops relying on a type() call. (#721 ) * This massively improves the performance of tight loops relying on a type() call. * Adding a few more benchmarks	2020-04-15 20:45:40 -04:00
John Keiser	09cf18a646	Add C++11 tests to cmake - Add simdjson-flags target so callers don't have flags forced on them	2020-04-15 17:26:25 -07:00
Daniel Lemire	326c175dcb	Massive performance boost for get<double>. (#719 ) * Massive performance boost for get<double>.	2020-04-15 20:09:45 -04:00
Daniel Lemire	6d7c77ddc1	Let us try to check with the exceptions disabled. (#707 ) * Tweaking code so that we can run all tests with exceptions off. * Removing SIMDJSON_DISABLE_EXCEPTIONS	2020-04-15 16:45:36 -04:00
Daniel Lemire	b523c43927	Can we provide a size() function to arrays and objects? (eager approach) [TO BE MERGED] (#690 ) * This is an implementation of "size()" for arrays and objects. * Adding benchmark * Adding a size() remark in the documentation. * Extending size() to result types.	2020-04-15 10:15:48 -04:00
Paul Dreik	75545ff70d	ref qualify parser methods to avoid use of dangling objects (#703 ) To avoid using data belonging to a temporary, the parse functions are ref qualified to get a compile error if used on an rvalue. See https://github.com/simdjson/simdjson/issues/696 Compilation tests are also added, to make sure bad usage fails to compile. Reviewed by jkeiser.	2020-04-15 09:57:52 +02:00
John Keiser	7317fe1440	Don't reinitialize submodules Add ability to turn competitive benchmarks off (no need for submodules)	2020-04-09 08:52:29 -07:00
John Keiser	7b58fea911	Add benchmark competitions to cmake	2020-04-09 08:52:29 -07:00
John Keiser	6dabfa176a	Add competition libraries	2020-04-09 08:52:29 -07:00
John Keiser	3dcc188d93	Add more tests to cmake	2020-04-08 14:52:56 -07:00
John Keiser	10b7556a37	Specify cmake tests, benchmarks and tools idiomatically	2020-04-08 14:52:56 -07:00

1 2 3 4 5

232 Commits