Skip to content

populate_metadata.py: add batches to write_to_omero (rebased onto metadata53)#5220

Closed
atarkowska wants to merge 10 commits intoome:metadata53from
atarkowska:rebased/metadata53/populate-batches
Closed

populate_metadata.py: add batches to write_to_omero (rebased onto metadata53)#5220
atarkowska wants to merge 10 commits intoome:metadata53from
atarkowska:rebased/metadata53/populate-batches

Conversation

@atarkowska
Copy link
Copy Markdown
Member

This is the same as gh-4754 but rebased onto metadata53.


What this PR does

The new QueryContext implementations in populate_metadata.py were not batching in their write_to_omero methods. For very large screens (e.g. idr0016):

  • BulkToMapAnnotationContext tends to hit MESSAGESIZEMAX limits and
  • DeleteMapAnnotationContext hits Caused by: java.io.IOException: Tried to send an out-of-range integer as a 2-byte value: 109734

Now both are done in batches of 1000.

Note: projections are also causing issues so batching is being added to invocations of projection as well.

Testing this PR

  1. A bulk annotation YML configuration is required. This is minimal possible via test/integration/metadata/test_populate.py.
  2. There should be no change in functionality and no overt slowdown in creation or deletion.

Related reading

cc: @eleanorwilliams

@atarkowska
Copy link
Copy Markdown
Member Author

--rebased-from #4754

@atarkowska atarkowska closed this Apr 4, 2017
@atarkowska atarkowska deleted the rebased/metadata53/populate-batches branch April 4, 2017 15:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants