John Keiser
b8426584fc
Make field lookup order-insensitive.
2020-12-19 13:57:29 -08:00
Daniel Lemire
6a2435ab32
Merge branch 'master' into dlemire/prune_validate_utf8
2020-12-19 10:46:14 -05:00
Daniel Lemire
85001c55fb
Fixing UTF-8 validation under PPC64 ( #1346 )
...
* Entering a new UTF-8 test
* Maybe *I* had a bug in the tests.
* Replacing nulls with 1s.
* Let us try to be more verbose.
* Return 0.
* Fixing issue.
* Adding puzzler scenario.
* Fixing PPC64
Co-authored-by: Daniel Lemire <dlemire@rcs-power9-talos>
2020-12-19 10:42:27 -05:00
Daniel Lemire
8c4cf7757f
I meant to remove these files a while ago.
2020-12-18 15:29:08 -05:00
John Keiser
f785f76d98
Merge pull request #1342 from simdjson/jkeiser/iter-safety
...
Safety: assert in debug mode when on demand values are used out of order
2020-12-16 15:58:29 -08:00
Daniel Lemire
c90ee57203
This might make the fuzzer error debuggable. ( #1345 )
2020-12-16 18:31:29 -05:00
John Keiser
d670906ff3
Don't keep a separate at_start boolean in object
2020-12-15 11:29:31 -08:00
Daniel Lemire
c578f63f34
Fixing issue 1337 ( #1338 )
...
* Fixing issue 1337
* This test should always run.
* Removing bad attribute access.
2020-12-14 16:20:36 -05:00
Daniel Lemire
9a404bdcdc
Reenabling the optimized kernels (main branch). ( #1344 )
...
* Reenabling the optimized kernels (main branch).
* Defining SIMDJSON_CAN_ALWAYS_RUN_PPC64 and SIMDJSON_CAN_ALWAYS_RUN_ARM64
* Adding the bad UTF8 string from the fuzzer.
* Taking into account John's comments.
* Bumping the lib version.
* Update CMakeLists.txt
2020-12-14 14:28:45 -05:00
friendlyanon
91b07ba075
Allow build without download ( #1334 )
...
* Add option for downloading dependencies
* Format the boost json import code
Co-authored-by: friendlyanon <friendlyanon@users.noreply.github.com>
2020-12-13 19:07:29 -05:00
John Keiser
f9c6dedca1
Add test for out-of-order parse asserts
2020-12-13 13:39:47 -08:00
John Keiser
73c7b7db3d
Safety: check index of scalar values before use
2020-12-11 11:57:09 -08:00
John Keiser
6b02b06581
Merge pull request #1330 from simdjson/jkeiser/depth-tracking
...
Permit partial iteration in On Demand
2020-12-09 12:40:58 -08:00
Paul Dreik
5f7b2bac12
fuzz on power ( #1326 )
...
* first try
* use ubuntu 20.04, do the fuzzing
* new try at power fuzz
* hard code clang version
* setting env variables does not seem to work
* use fuzzer-no-link
* switch to Debian Buster for power fuzz
* use non-sanitizer build for power
* me not like yaml
* fix bad syntax
2020-12-07 18:12:36 -05:00
John Keiser
806cb39103
Clean up some more benchmark/test code
2020-12-07 13:44:58 -08:00
John Keiser
2eaeac53e4
Revamp design documentation to match new design
2020-12-07 13:09:44 -08:00
John Keiser
3baba73cf5
Don't call functions in assume (VS2019 Clang doesn't like it)
2020-12-07 11:17:24 -08:00
John Keiser
db6bf15361
Allow all array/object/etc. to be copied
2020-12-06 16:29:53 -08:00
John Keiser
3aa3175378
Add tests for recovering from partial child iteration
2020-12-06 16:19:06 -08:00
John Keiser
4c63956624
Enable object["x"]["y"]
2020-12-06 15:23:52 -08:00
John Keiser
c89647af9e
Make all values use same underlying iterator
2020-12-06 15:23:52 -08:00
John Keiser
1303a88769
Unconditionally track depth
2020-12-06 15:23:52 -08:00
John Keiser
aa1eabbb56
Add benchmark that stops early
2020-12-06 15:23:51 -08:00
Daniel Lemire
cbacec0760
Releasing 0.7.0.
2020-12-04 18:56:15 -05:00
0xflotus
82fcf141f7
fix: small typo error ( #1328 )
2020-12-03 13:16:45 -05:00
Daniel Lemire
aaa12c8fb6
more thread testing ( #1324 )
...
* Adding a few tests.
* More tests.
* Trailing.
* More links/documentations.
2020-12-02 11:33:36 -05:00
Daniel Lemire
9b5150daef
Fixing issue 1247 ( #1325 )
2020-12-01 19:09:55 -05:00
Daniel Lemire
9304d88920
Prototype test for issue 1299: using parse_many, find the location of the end of the last document ( #1301 )
...
* Prototype test for issue 1299.
* This improves the documentation.
* Removing trailing white spaces.
* Removing trailing spaces
* Trailing.
2020-12-01 15:59:20 -05:00
Paul Dreik
725ca010e7
add ndjson fuzzer ( #1304 )
...
* add ndjson fuzzer
* reproduce #1310 in the newly added unit test
Had to replace the input, because:
1)
the fuzzer uses the first part of the input to determine
the batch_size to use, so that has to be cut off
2)
the master now protects against low values of batch_size
I also made the test not return early, so the error is triggered.
2020-12-01 15:58:41 -05:00
Daniel Lemire
59c857e969
Merge branch 'master' of github.com:simdjson/simdjson
2020-12-01 15:31:44 -05:00
Daniel Lemire
45c9136443
Fixed odd issue.
2020-12-01 15:31:24 -05:00
Daniel Lemire
bc4087ac96
Hunting for data races in document_stream ( #1323 )
...
* Adding matching tests.
* Actually adding thread testing actions.
2020-12-01 12:40:15 -05:00
Daniel Lemire
d9c4191e8a
This should fix an issue we have with unclosed strings with document_stream ( #1321 )
...
* This should fix an issue we have with unclosed strings.
* Patching the fallback kernel.
* Better guarding the code.
* Patching the fallback.
2020-11-30 13:43:57 -05:00
Daniel Lemire
3fa40b8dc2
Adding an example corresponding to issue 1316 (documentation enhancement) ( #1317 )
...
* Adding an example.
* Updated other doc file.
* Trying to take into account @jkeiser's comments.
* Some people prefer empty final lines.
2020-11-27 17:40:29 -05:00
Daniel Lemire
dc69bc28ae
Trying to verify recent document stream issues. ( #1318 )
...
* Trying to verify recent document stream issues.
* Adding another one.
* More thorough tests.
* Removing trailing spaces.
* Working toward exposing some issues.
* Tweaking.
2020-11-27 17:04:10 -05:00
Daniel Lemire
53577f11e1
Explicit worker disposal.
2020-11-27 15:39:43 -05:00
Daniel Lemire
1abb64f6f6
This fixes issue 1302 ( #1303 )
...
* This verifies issue 1302
* I believe that this fixes the issue.
* Let us do it differently.
* Better comments.
2020-11-27 14:19:36 -05:00
Mikhail Matrosov
b442b29ccf
Mark comparison operators as const ( #1320 )
...
Co-authored-by: Mikhail Matrosov <mikhail.matrosov@aimtech.com>
2020-11-27 14:18:42 -05:00
Daniel Lemire
5609851747
This will guard the batch_size so it cannot be so low as to accidentally burn through your CPU. ( #1319 )
...
* This will guard the batch_size so it cannot be so low as to accidentally burn through your CPU.
2020-11-27 09:37:54 -05:00
Paul Dreik
68a8004518
don't memcpy after failed alloc ( #1315 )
...
Should fix https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=27675
2020-11-21 07:52:24 +01:00
Paul Dreik
0b39e3a6cf
add fuzzer for padded_string ( #1312 )
...
This also fixes an overflow problem.
2020-11-19 16:51:56 +01:00
Paul Dreik
924ad2d592
when amalagamating, replace the SIMDJSON_IMPLEMENTATION ( #1306 )
2020-11-17 20:50:49 +01:00
Paul Dreik
5202d07a77
add script and instructions for minimizing and cleansing fuzz crashes ( #1305 )
2020-11-17 20:49:29 +01:00
friendlyanon
b632107a7a
Use the org's own PPA for CMake ( #1295 )
...
Co-authored-by: friendlyanon <friendlyanon@users.noreply.github.com>
2020-11-10 13:55:59 -05:00
Paul Dreik
f62ca21dd1
enable boost json ( #1292 )
...
* bump boost.json and see if it works in simdjson CI
* enable boost json
* clean up
* add boost json to deps
* use boost if std::string_view is available
* add build with c++20
* use docker image which has the proper libc++ installed
2020-11-10 13:55:04 -05:00
friendlyanon
553befa012
Disable Github CI in feature branches ( #1298 )
...
Forks that would like to contribute via PRs from feature branches
needlessly run CI on those branches on top of the PRs.
This is a waste of resources.
Co-authored-by: friendlyanon <friendlyanon@users.noreply.github.com>
2020-11-10 13:52:58 -05:00
Paul Dreik
def624a50c
update version tag ( #1297 )
2020-11-08 10:17:06 -05:00
Daniel Lemire
268f26a84f
Tweak of the documentation.
2020-11-07 16:33:12 -05:00
Paul Dreik
d9456b3030
fix numeric_limits usage problem on windows caused by the max macro ( #1293 )
...
See https://stackoverflow.com/questions/2789481/problem-calling-stdmax/2789509#2789509
2020-11-06 06:55:27 +01:00
Daniel Lemire
f195f849d0
Helping contributors. ( #1289 )
2020-11-05 10:27:35 -05:00