Update iterate_many.md

This commit is contained in:
Daniel Lemire 2021-08-20 14:09:30 -04:00 committed by GitHub
parent d3f0e2afb3
commit ca3f3cc49d
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
1 changed files with 9 additions and 3 deletions

View File

@ -1,8 +1,14 @@
iterate_many iterate_many
========== ==========
An interface providing features to work with files or streams containing multiple small JSON documents. An interface providing features to work with files or streams containing multiple small JSON documents. Given an input such as
As fast and convenient as possible. ```JSON
{"text":"a"}
{"text":"b"}
{"text":"c"}
...
```
... you want to read the entries (individual JSON documents) as quickly and as conveniently as possible. Importantly, the input might span several gigabytes, but you want to use a small (fixed) amount of memory. Ideally, you'd also like the parallelize the processing (using more than one core) to speed up the process.
Contents Contents
-------- --------