Use streams more efficiently

Our support for readable streams in our library is now almost everywhere possible, which is awesome. But, the way we use them is not in the most efficient way.

How we do it, using bucket.getFiles for example:
1. Make API request to server to get all of the files from a bucket
2. Get a large JSON response with all of the files' metadata
3. Split the array into transformed "File" objects
4. Push all of the objects onto a readable stream that the user is reading from
5. If there's a "nextPageToken", repeat step 1.

How we could do it:
1. Make _streaming_ API request to server to get all of the files from a bucket 
2. Parse the JSON response as it comes in in chunks
3. Transform the returned files into "File" objects

How we do it currently (simplified):

``` js
bucket.getFiles = function() {
  var stream = through.obj();

  request.get('https://.../bucket/files', function(err, resp) {
    // A potentially huge JSON response exists in memory now
    resp.items.forEach(function(fileMetadata) {
      var file = new File(fileMetadata.name);
      file.metadata = fileMetadata;
      stream.push(file);
    });

    stream.push(null);
  });

  return stream;
};
```

How it would look (simplified):

``` js
bucket.getFiles = function() {
  return request.get('https://...bucket/files')
    .pipe(JSONStream.parse('items.*')) // link to JSONStream below
    .pipe(through.obj(toFileObject));

  function toFileObject(obj, enc, next) {
    var file = new File(obj.name);
    file.metadata = obj;
    this.push(file);
    next();
  }
};
```

With the second example, the object is only in memory until it is flushed to the next handler in the user's own pipeline. At that point, the user is free to let it buffer up until they're all in or write them to a destination in chunks, not letting any extra memory pile up.

\* JSONStream: https://github.com/dominictarr/JSONStream


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use streams more efficiently #802

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Use streams more efficiently #802

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions