Large table uploads fail

The upload to s3 completes, but the celery task to move data from s3 to arango fails. This is likely fixable by chunking the upload to arango. Here are some relevant info from Jake:

>So btw, chunking was implemented, but was reverted with this commit: https://github.com/multinet-app/multinet-api/commit/2d4685cf3a4db0ab3f2be3ade306312471b07133
So here is what chunking looked like before: https://github.com/multinet-app/multinet-api/blob/51287eab3a019283dc3092193fb92a1cdc4efdac/multinet/api/models/table.py#L63-L79


	def put_rows(self, rows: List[Dict]) -> RowInsertionResponse:
	"""Insert/update rows in the underlying arangodb collection."""
	errors = []

	# Limit the amount of rows inserted per request, to prevent timeouts
	for chunk in chunked(rows, DOCUMENT_CHUNK_SIZE):
	res = self.get_arango_collection().insert_many(chunk, overwrite=True)
	errors.extend(
	(
	RowModifyError(index=i, message=doc.error_message)
	for i, doc in enumerate(res)
	if isinstance(doc, DocumentInsertError)
	)
	)

	inserted = len(rows) - len(errors)
	return RowInsertionResponse(inserted=inserted, errors=errors)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Large table uploads fail #85

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Large table uploads fail #85

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions