Commit Graph

170 Commits

Author SHA1 Message Date
Daniel Lemire 18633e02d2 Added more thorough testing. 2018-10-23 20:19:33 -04:00
Daniel Lemire f0af315315 Elaborating on the limitations. 2018-10-23 14:03:30 -04:00
Daniel Lemire 9738af68c8 Fixing up the code point parsing. I think that what is there is now correct.
I believe that there was a case of early optimization.
2018-10-19 22:07:22 -04:00
Daniel Lemire 8315f4c888 Cleaning up the code. 2018-10-17 21:31:22 -04:00
Daniel Lemire 35381279c3 Maybe we can do away with the fast ASCII trick. 2018-10-17 21:05:38 -04:00
Daniel Lemire e517414080 We include character-encoding validation. 2018-10-17 19:22:09 -04:00
Daniel Lemire 355e5d2ed3 Checking for unescaped chars. 2018-10-17 15:08:49 -04:00
Daniel Lemire 9fc8d8444b We want to allow more than just arrays and objects, as per the JSON spec. 2018-10-17 13:57:42 -04:00
Daniel Lemire 57d7544b52 Adding... 2018-10-17 13:23:45 -04:00
Daniel Lemire 6cc5131f7a Adding an allparserscheckfile program. 2018-10-17 12:00:44 -04:00
Daniel Lemire 0453d546f8 Adding remark. 2018-10-16 20:43:37 -04:00
Daniel Lemire 8f704fdb7c Making the test tougher. 2018-10-08 16:32:54 -04:00
Daniel Lemire 7eb7cd265a We can now parse crazy things like pi to 100 digits. 2018-10-08 15:24:16 -04:00
Daniel Lemire 70c122074f Tests. 2018-10-08 14:41:36 -04:00
Daniel Lemire e2a3f751cf Counting numbers. 2018-10-04 09:48:00 -04:00
Daniel Lemire aeacd26366 Adding mispredicted branch counts. 2018-10-04 09:47:34 -04:00
Daniel Lemire 930533b6da Normalizing the number of cycles. 2018-10-04 09:33:41 -04:00
Daniel Lemire 2ad9891b66 I think NO_PDEP_PLEASE should be defined by default. It seems
to be generally better/faster. More instructions, but also
more instructions per cycle, so it ends up being a net win.
2018-10-03 21:42:27 -04:00
Daniel Lemire 431aed6e88 Going to "-O3" as the results are better with GCC. 2018-09-30 19:55:22 -04:00
Daniel Lemire 37adea9387 Adding a comment. 2018-09-30 14:44:30 -04:00
Daniel Lemire 314356d561 We have faster number parsing...? 2018-09-28 18:26:27 -04:00
Daniel Lemire 4ee515fa4b The new number parsing code is faster. 2018-09-28 14:45:34 -04:00
Daniel Lemire 57b840327f Faster number parsing? 2018-09-28 14:38:40 -04:00
Geoff Langdale 1e5d8ece56 Update API a bit 2018-09-28 14:59:30 +10:00
Geoff Langdale 89fd074ec9 Draft API. No implementation yet. 2018-09-28 14:55:57 +10:00
Geoff Langdale ceb55cc8db Pick new number parser as winner; move string parsing to own header 2018-09-28 14:27:48 +10:00
Daniel Lemire 577d6792f4 Integrating sajson. 2018-09-28 00:00:52 -04:00
Daniel Lemire ecbe1158ed Added testing for number parsing. 2018-09-27 20:26:27 -04:00
Daniel Lemire e4094afe08 Moving toward having number-parsing testing. 2018-09-27 17:38:15 -04:00
Daniel Lemire 7606a43aa9 Merge branch 'master' of github.com:lemire/simdjson 2018-09-26 23:36:19 -04:00
Daniel Lemire 1c8339297d With new number parser (faster!). Removing the dependency on the doubleconv library (which proves to be useless). 2018-09-26 23:35:33 -04:00
Geoff Langdale ccb3670c7c DEBUG mode fixes. 2018-09-27 13:10:33 +10:00
Daniel Lemire 6239b9c13e Overallocation 2018-09-26 14:20:28 -04:00
Geoff Langdale 9f91650e72 Remove old 4-stage path. 2018-09-26 15:22:55 +10:00
Geoff Langdale b9706d462c Minor cleanups. 2018-09-26 15:09:54 +10:00
Geoff Langdale 36fadde3c7 Minor twiddles. 2018-09-26 13:52:05 +10:00
Geoff Langdale 0d5797a827 Wrap the tape dump in debug code. 2018-09-26 13:28:16 +10:00
Geoff Langdale e9586b6b4d Very first char is considered to follow "whitespace" for pseudo-structural character detection purposes 2018-09-26 13:27:39 +10:00
Geoff Langdale 35503f1d8f Oops noisy. 2018-09-26 13:21:05 +10:00
Geoff Langdale fa6c8990ff Added a terrifying hack to append a idx-to-0-char to stage 2 output. 2018-09-26 13:20:08 +10:00
Geoff Langdale c4c51627d3 Fix compile - jsonparser needs to include unified header 2018-09-26 11:33:35 +10:00
Geoff Langdale 682c224d1a Merge branch 'master' of https://github.com/lemire/simdjson 2018-09-26 11:29:23 +10:00
Geoff Langdale b0c05c03cc Fix linkage between call sites and headers, add dump code, cleanup 2018-09-26 11:28:22 +10:00
Daniel Lemire dee1bbe54e Integrating the new 3-stage approach. 2018-09-25 17:26:58 -04:00
Daniel Lemire cb26dc9c7b Creating a unified binary 2018-09-25 17:20:54 -04:00
Geoff Langdale 555926849d Bug cleanup (many vestiges of old 32-bit tape stil there) and more encapsulation of tapes. 2018-09-25 16:24:39 +10:00
Geoff Langdale 8b2d00a337 Bug fix for ,] issue and cleanup. 2018-09-25 15:35:17 +10:00
Geoff Langdale 64d07cd04c Fix bug where strings were not parsed on 2nd and subsequent key:value pairs. 2018-09-24 15:16:22 +10:00
Geoff Langdale 77bfe6c984 Fix some bad messages and the failure to parse key strings. 2018-09-24 10:54:29 +10:00
Geoff Langdale 2a46b40457 Adding new stage34, a more straightforward replacement for stage 3 and 4 using a DFA and explicit stack 2018-09-24 10:44:05 +10:00