Daniel Lemire
ab6d4871d8
Adding haswell amal. tests ( #447 )
...
* Adding an extra test.
* Disabling the AVX-accelerated minifier.
* Updating amalgamation.
2020-01-15 19:49:11 -05:00
Daniel Lemire
f87e64f988
Add option to make buffers hot and remove recent benchmarking changes ( #443 )
...
* This revert the code back to how it was prior to the silly "run two stages" routine and instead
adds an option to benchmark the code over hot buffers. It turns out that it can be expensive,
when the files are large, to allocate the pages.
2020-01-15 19:48:00 -05:00
Daniel Lemire
27861f6358
SIMDJSON_PADDING is now an absolute constant. This is temporary since
...
padding should go away once https://github.com/lemire/simdjson/issues/174
is resolved.
2020-01-15 15:49:50 -05:00
Daniel Lemire
f611b65bc0
This updates the minifier. ( #446 )
2020-01-15 13:45:32 -05:00
Daniel Lemire
2dc61fbdc4
Update README.md
2020-01-15 11:57:48 -05:00
Daniel Lemire
22be05400d
Update README.md
2020-01-15 10:18:03 -05:00
Daniel Lemire
a9f501fe7d
Update README.md
2020-01-14 11:50:14 -05:00
Daniel Lemire
e9077370ec
Update README.md
2020-01-14 10:42:04 -05:00
Daniel Lemire
a804351a76
I think that i and idx should be size_t (64-bit). ( #438 )
2020-01-13 17:42:52 -05:00
Daniel Lemire
f97b655f02
Instead of emulating the whole parsing as stage 1 + stage 2, let us benchmark the real thing. ( #441 )
...
* Instead of emulating the whole parsing as stage 1 + stage 2, let us
benchmark the real thing.
* Adding explicit constructor.
* Adding warning to the benchmark user.
* Making re-running optional.
2020-01-11 10:14:22 -05:00
Daniel Lemire
1498b78342
Minor simplifications.
2020-01-10 14:07:57 -05:00
dbj
85e84fc1fa
improved string padded ( #440 )
...
* dirent portable latest version
* improved
std::string argument passed by const reference
ctor added with std::string_view argument
`allocate_padded_buffer()` moved here with **optional** check on `length < 1`
* allocate_padded_buffer moved to padded_string.h
2020-01-10 10:15:48 -05:00
Daniel Lemire
833e5d8bf1
Update README.md
2020-01-09 17:01:41 -05:00
UKABUER
773883c486
Fix #420 ( #421 )
2020-01-09 09:56:43 -05:00
Daniel Lemire
6e5e0278c2
Exposing bug #420
2020-01-09 09:55:54 -05:00
Daniel Lemire
951c4bedf8
Simpler jsonstream ( #436 )
...
* One simplification.
* Removing untested functions.
2020-01-07 19:10:02 -05:00
dbj
9842e1f9d0
dirent portable latest version ( #435 )
2020-01-07 18:41:57 -05:00
Daniel Lemire
4c0c1c9830
Updating a comment.
2020-01-06 22:01:23 -05:00
Daniel Lemire
6706d6053e
Upgrading gcc to gcc 8
2020-01-06 18:28:29 -05:00
Daniel Lemire
0a874a5063
Some tuning
2020-01-06 11:41:07 -05:00
dbj
2caa6e3370
C++ language version detection ( #418 )
...
* added visual_studio folder where visual_studio cmake generated, local artefacts are
* C++ version detection
2020-01-06 11:38:09 -05:00
Daniel Lemire
a9e990251d
removing left over debug
2020-01-04 12:50:04 -05:00
Daniel Lemire
7bde23590a
Debugging jsonstream ( #432 )
...
Fixes #424 (and provide tests for it), as well as #401
2020-01-03 22:22:47 -05:00
Daniel Lemire
5042dd52ce
This is implementing @jkeiser optimization idea. ( #431 )
2020-01-03 09:21:36 -05:00
John Keiser
3b9e6bff3c
Print stage 2 information in feature benchmarker
2020-01-02 17:23:21 -07:00
Daniel Lemire
a2d05b21ff
Merge branch 'master' of github.com:lemire/simdjson
2020-01-02 15:27:00 -05:00
Daniel Lemire
f4f5f670a2
Better documentation of the padding.
2020-01-02 15:25:03 -05:00
John Keiser
165e23773f
Refactor stage 2 into structural_parser class
2020-01-02 13:12:22 -07:00
Daniel Lemire
8dbb598057
Adding another test
2020-01-02 14:22:43 -05:00
Daniel Lemire
ba9dc12164
Adding tests motivated by https://github.com/lemire/simdjson/pull/430
2020-01-02 14:20:51 -05:00
Paul Dreik
399d08c86c
use unique_ptr in class parsedjson ( #417 )
...
* refactor parsedjson to use unique_ptr instead of owning raw pointer
* fix a potential undefined behavior
* output only first cpu in /proc/cpuinfo
2019-12-31 14:31:45 -05:00
Daniel Lemire
6f799435b6
Removing commented out stuff.
2019-12-30 22:21:04 -05:00
Daniel Lemire
3d14154a29
Merge branch 'master' of github.com:lemire/simdjson
2019-12-30 15:08:55 -05:00
Daniel Lemire
7e331957c4
Adding ruby script.
2019-12-30 15:07:42 -05:00
Paul Dreik
4da06830f1
add new fuzzers for print_json and dump_raw_tape ( #416 )
...
after looking at the coverage report available at
https://storage.googleapis.com/oss-fuzz-coverage/simdjson/reports/20191222/linux/src/simdjson/report.html
2019-12-27 13:42:44 -05:00
Paul Dreik
27293cc1c1
don't add integers to string literals ( #410 )
...
* string literal + integer means unintended and incorrect pointer arithmetic
fixes a clang warning. it could not be triggered, because it can only be
triggered if the string given to getopt is not covered among the
cases in the switch.
* handle review comment
2019-12-24 20:19:22 +01:00
Daniel Lemire
2caac2b218
Adding two test files from https://github.com/lemire/simdjson/pull/404/files ( #415 )
2019-12-24 14:04:11 -05:00
Nexus Web Development
0dc80ccf21
Update README.md : Simple typo ( #412 )
...
Exemple to example.
2019-12-23 11:35:42 -05:00
Nexus Web Development
f2b48ede4c
Update JsonStream.md : simple typo ( #413 )
...
Exemple to example.
2019-12-23 11:35:09 -05:00
Stenal P Jolly
6cefdc2f5c
Fix README typo ( #407 )
2019-12-23 11:08:55 -05:00
Paul Dreik
29e78413fe
output failed fuzz cases ( #409 )
...
this is to get more debug output from cases like this:
c9cd8e6211/checks
?check_suite_id=351526761
https://pipelines.actions.githubusercontent.com/gFSIASDqcDhMdHkcuMJEdsUcdOsOrVBr8d56BjAkszcoMkibGp/_apis/pipelines/1/runs/274/signedlogcontent/3?urlExpires=2019-12-21T18%3A50%3A08.3387613Z&urlSigningMethod=HMACV1&urlSignature=DVF3u4jEs81xJP6Hmi8LLQlLOztJke8MsP62J0MXruQ%3D
which ends with:
2019-12-20T23:08:42.6830393Z ../src/generic/numberparsing.h:243:31: runtime error: 1e+311 is outside the range of representable values of type 'double'
2019-12-20T23:08:42.6836790Z SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior ../src/generic/numberparsing.h:243:31 in
2019-12-20T23:08:42.6837715Z MS: 2 InsertRepeatedBytes-CrossOver-; base unit: 47cf79b80bc84f2ec8b39c1c73daa6c1222ef624
2019-12-20T23:08:42.6866416Z artifact_prefix='./'; Test unit written to ./crash-3fa2f47d980d4bf5812af18fb0d0a0ce9b5cc65a
2019-12-20T23:08:42.6999448Z ##[error]Process completed with exit code 1.
2019-12-20T23:08:42.7030863Z Cleaning up orphan processes
The problem is that github actions does not store artifacts on failure.
2019-12-21 20:32:32 +01:00
Paul Dreik
8192e63a4b
update apt repos before installing ( #408 )
...
This should hopefully resolve the recent errors in the github action
for running the fuzzers.
2019-12-21 19:23:51 +01:00
Daniel Lemire
b2ebdb0d07
I think we can align the numbers better (so it is prettier). ( #399 )
...
* I think we can align the numbers better (so it is prettier).
* Remove space before %, align third line better
Co-authored-by: John Keiser <john@johnkeiser.com>
2019-12-20 19:58:49 -05:00
dbj
9c3828fefe
STRINGIFY implemented ( #402 )
...
* STRINGIFY implemented
* SIMDJSON_THREADS_ENABLED def/undef
2019-12-20 07:57:00 -05:00
John Keiser
60916318f7
Show miss rate, make it more accurate
2019-12-18 14:38:25 -08:00
John Keiser
d7c83397e4
lookup+cont-check algorithm
2019-12-18 14:37:21 -08:00
Daniel Lemire
1d621bba37
Being more explicit about EMPTY errors.
2019-12-18 14:39:48 +00:00
John Keiser
e2f349e7bd
Measure impact of utf-8 blocks and structurals per block directly
2019-12-17 11:41:13 -08:00
Daniel Lemire
102262c7ab
Fixing issue386 ( #396 )
...
* Creating arch-specific bitmanipulation.h files.
* Improving system and compiler portability.
* We want to allow trailing_zeroes on zero inputs.
2019-12-16 19:09:18 -05:00
Daniel Lemire
f02babe427
Adding analysis by @sebpop from https://github.com/lemire/simdjson/pull/391#issuecomment-565551462
2019-12-13 13:39:15 -05:00