Metadata column column types and datasets (rebased onto metadata53) by atarkowska · Pull Request #5218 · ome/openmicroscopy

atarkowska · 2017-04-04T10:15:02Z

This is the same as gh-4637 but rebased onto metadata53.

This PR carries on from work by @emilroz (PR 120) which makes it possible to specify the column types for OMERO.tables when using populate_metadata.py to parse the csv files and adds support for datasets and projects as the target object for a metadata table.

Most methods for dataset loading and parsing were left unimplement. Now a `Dataset:`-style object can be passed to populate_metadata.py and images will be looked up by name. Note: there's a small bug with name lookup that will be corrected separately.

The assumptions for well/imaging naming in a plate or screen differ from those from image naming in a dataset since there's no unique way to reference an image in a dataset like there is well "A1" for example. This commit loosens some of those rules to allow image columns and image name columns to work together in the case of datasets. The assumption is that for population the ID of the image in a dataset won't be known. Instead names of images will be used as a unique identifier. Currently only a warning is issued if the name is not unique.

In general, populate_metadata.py looks to be in line for a refactoring. The number of if-clauses as well as the unhandled cases (like no catch-all for unknown targets in delete) is making this ever harder to work with. All tests passing.

In order to allow Projects to smartly handle multiple images with the same name (though not in the same dataset), the internals of ValueResolver have been hidden within a ValueWrapper class. ValueResolver chooses once which ValueWrapper to use internally after which the various if/then blocks based on target object are no longer necessary (needs further refactoring). There *are* still if/then blocks basked on column-type. These could use some cleaning but will likely remain to be necessary for multiple-dispatch style handling.

atarkowska · 2017-04-04T10:15:05Z

--rebased-from #4637

emilroz and others added 21 commits April 4, 2017 11:08

Add columns flag to the parser

d7c7925

Add parse columns and expand supported types

51f415d

Flake8

6ce3b0d

Add column type support to HeaderResolver

1a5a86d

Add column type support to ValueResolver

2a83bbc

Add column type support to ParsingContext

1cbd25f

Return long not int

f6384b5

Set the StringColumn size

3af0369

Check that number of columns and column types equal

68fc034

Check if rows[0] is column types, HeaderResolver

06f606e

Check if rows[0] is column types, ParsingContext

7e0a43d

Raise when number of columns != number of types

9fe879c

populate_metadata: Fix map and delete contexts

b2f75f4

In general, populate_metadata.py looks to be in line for a refactoring. The number of if-clauses as well as the unhandled cases (like no catch-all for unknown targets in delete) is making this ever harder to work with. All tests passing.

populate_metadata: adding passing screen test

96b0a07

populate_metadata: refactor test in preparation for projects

6a73b6c

Tables: Add DatasetColumn (new API)

2579800

populate_metadata: add support for ProjectColumn

16d0f27

populate_metadata: disallow image name conflicts

e1643ef

atarkowska mentioned this pull request Apr 4, 2017

Metadata column column types and datasets #4637

Merged

fix flake8

36cddd0

sbesson added the metadata53 label Apr 4, 2017

atarkowska closed this Apr 4, 2017

atarkowska deleted the rebased/metadata53/populate_metadata_column_types branch April 4, 2017 16:04

This was referenced Apr 4, 2017

Merge populate_metadata PRs #5232

Merged

Merge populate_metadata PRs (rebased onto develop) #5241

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metadata column column types and datasets (rebased onto metadata53)#5218

Metadata column column types and datasets (rebased onto metadata53)#5218
atarkowska wants to merge 22 commits intoome:metadata53from
atarkowska:rebased/metadata53/populate_metadata_column_types

atarkowska commented Apr 4, 2017

Uh oh!

atarkowska commented Apr 4, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

atarkowska commented Apr 4, 2017

Uh oh!

atarkowska commented Apr 4, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants