External Vocabulary Support by qqmyers · Pull Request #7946 · IQSS/dataverse

qqmyers · 2021-06-15T23:10:58Z

What this PR does / why we need it: This is an extension/modification of #7712 that implements the consensus changes from the Metadata Working Group to support vocabularies from multiple services, minimize the coupling between Dataverse and the services, support single and compound fields, capture values for search, metadata export, and archival purposes in an internal table.

More details to follow. Creating this as a draft PR to simplify seeing the comparison with the dev branch.

Which issue(s) this PR closes:

Closes #7711

Special notes for your reviewer:

Suggestions on how to test this:

Does this PR introduce a user interface change? If mockups are available, please link/include them here:

Is there a release notes update needed for this change?:

Additional documentation:

Add controlled vocabulary

…xternal-cvoc

Merge branch 'develop' into external-cvoc

Fix problem where pressing tab skips some fields.

* - Add Vocabulary URL field - Add parent term url - Min search letter 1 - Add scrollbar for long list * - Add Vocabulary URL field - Add parent term url - Min search letter 1 - Add scrollbar for long list * Remove unused key setting. Co-authored-by: Paul Boon <paul.boon@dans.knaw.nl>

DD-375 Disable editing of the cvoc URL fields

* Added minChars option to the cvoc configuration, default 0, ignoring negative numbers * Added log info for minChars option * Added option to hide cvoc URL fields when readonly and some javascript refactoring

…put field (IQSS#53)

* Uses the vocab-uri parameter from the cvoc config to store in the metadata * Removed commented-out code fragment

Merge branch 'develop' into external-cvoc

Added the cvoc-interface.js file to the application resources

…T_CVOC Merge back develop in ext cvoc

merge from QDR branch

temporary measure to trigger Dataverse to not escape HTML (because format contains '<a' )

Conflicts: src/main/java/edu/harvard/iq/dataverse/DatasetPage.java

json for config to simplify making changes - could switch back

pdurbin

I didn't run the code this time but I'm feeling like most of my original issues have been addressed (thanks!). I did leave a couple minor new comments. As to whether to use id as a primary key, I'll defer to @scolapasta (and will assign him to this pull request).

As discussed in standup, there's a desire to spin this up so that design folks and others can take a look. For that part, I'll assign @djbrooke

Apart from those two things, I think this is ready for QA. I will note that I had trouble getting the fields working from the demo metadata block when there were four of them.

The docs here look fine but most of the complex config stuff has been put in a separate repo.

Overall, I'm excited that this external vocabularies feature is coming!

pdurbin · 2021-09-08T20:36:34Z

doc/sphinx-guides/source/admin/metadatacustomization.rst

-|                                   | compound field is displayed in    |
-|                                   | the UI as IMPs: A                 |
-|                                   | collection of NMR data.           |
-+-----------------------------------+-----------------------------------+


Are the changes to the tables above meaningful? Is the content or look changing? Or can these changes simply be ignored?

No changes intended. It looks like Eclipse tried to do some cleanup. That might be good but I see in one case (the fieldType param for fields) it added a new column to the table, so I'll just revert all of it since the existing stuff works.

doc/sphinx-guides/source/installation/config.rst

src/main/java/edu/harvard/iq/dataverse/DatasetFieldServiceBean.java

pdurbin · 2021-09-08T20:47:55Z

src/main/java/edu/harvard/iq/dataverse/DatasetFieldServiceBean.java

+                    JsonArray childFields = jo.getJsonArray("child-fields");
+                    for (JsonString elm : childFields.getValuesAs(JsonString.class)) {
+                        dft = findByNameOpt(elm.getString());
+                        logger.info("Found: " + dft.getName());


I still think logger.fine would be better here (or delete).

src/main/java/edu/harvard/iq/dataverse/ExternalVocabularyValue.java

pdurbin · 2021-09-08T20:49:04Z

src/main/java/edu/harvard/iq/dataverse/ExternalVocabularyValue.java

+ */
+@Entity
+@Table(indexes = { @Index(columnList = "datasetfieldtype_id"), @Index(columnList = "displayorder") })
+public class ExternalVocabularyValue implements Serializable {


Ok, hopefully it'll work for @kcondon .

pdurbin · 2021-09-08T20:50:21Z

src/main/java/edu/harvard/iq/dataverse/ExternalVocabularyValue.java

+    @Id
+    @Column(columnDefinition = "TEXT", nullable = false)
+    private String uri;


I defer to @scolapasta on the consistency of using id.

djbrooke · 2021-09-08T21:02:07Z

@TaniaSchlatter - let me know when you'd like me to spin this up. I can do as early as tomorrow (Thursday) morning.

qqmyers · 2021-09-09T19:52:28Z

W.r.t. where to make comments: This PR itself shouldn't have any UI impacts (!). If you configure to use external vocab support for specific fields, Dataverse adds invisible data-cvoc-* attributes on those fields and includes the specified Javascript(s) in pages that would use it/them. All of the UI changes should then be the result of the Javascript(s). So - if the mechanism itself doesn't work or we see log errors, etc., we should track them with the PR. The hidden attributes are added in several places - the metadata input panel, metadata display panel, facets, advanced search panel, etc. If there are more places that's needed, it would also be this PR.

For any UI issues, it is probably better to add issues at the external vocab script repo. At present, both the ORCID and SKOSMOS scripts leverage some common code and jquery 'select2' (https://select2.org/), so most comments would probably apply to both. However, some details about the displayed values are script specific (i.e. only ORCID can show the a person's name) and the details of the layout depend on whether more than one vocab is allowed (skosmos only), whether free-text entries are allowed, etc. so detailed comments should make the exact circumstances clear.

There's also an ARDC script available - slightly out of date and subject to further change, so I'd suggest only looking at the ORCID/SKOSMOS ones for now. We're also hoping to have a map-based one to handle the bounding box fields at some point - not sure that will exist before people take a look.

Also FWIW: We also have not had any discussion of connecting existing fields (e.g. keywords, topics, contacts in the citation block) to this mechanism by default, so the impact is only on custom fields at this point. (In fact, to handle a field like contacts where non-ORCID entries would be allowed and some child fields wouldn't come from ORCID will require more work on the ORCID script itself. And keywords would need to have a fourth child field added for this to work, which is not in this PR. So it isn't just a matter of adding some config info for some of the existing compound fields.)

Co-authored-by: Philip Durbin <philipdurbin@gmail.com>

…mmunityConsortium/dataverse.git into external-cvoc2

Conflicts: doc/sphinx-guides/source/installation/config.rst

janvanmansum · 2021-09-11T08:55:02Z

@qqmyers there is a typo in the link to the external vocab script repo in your comment above. Also, I noticed that there are apparently two github organizations for GDCC ...?

qqmyers · 2021-09-11T13:47:33Z

Thanks - I fixed the original link as well. As for GDCC - we're slowly transitioning from github.com/GlobalDataverseCommunityConsortium/* to github.com/gdcc/*. Most repos can just move whenever someone has time. For the previewers one, it's a bit more complex since we serve content from the related github.io link so we need to coordinate with people using those previewers and/or run parallel repos for a while instead of just moving, etc. tbd. In any case, the gdcc repo is preferred for new work.

pdurbin

I think we're good. Thanks to all who participated in the UI review on Friday. I see the fix to make the id column a long made it in. Tests failed on the most recent run but I think it was due to problems connecting to the EC2 instance? Here's the log: https://jenkins.dataverse.org/job/IQSS-Dataverse-Develop-PR/job/PR-7946/30/consoleFull

I'm going to send this to QA.

donsizemore · 2021-09-13T18:59:19Z

I think we're good. Thanks to all who participated in the UI review on Friday. I see the fix to make the id column a long made it in. Tests failed on the most recent run but I think it was due to problems connecting to the EC2 instance? Here's the log: https://jenkins.dataverse.org/job/IQSS-Dataverse-Develop-PR/job/PR-7946/30/consoleFull

I'm going to send this to QA.

@pdurbin The job triggered by the most recent commit passed all tests: https://jenkins.dataverse.org/job/IQSS-Dataverse-Develop-PR/job/PR-7946/31/console

ekoi and others added 30 commits January 20, 2021 13:27

Add controlled vocabulary

5076dd1

Merge pull request #41 from ekoi/external-cvoc

6960a4c

Add controlled vocabulary

Merge branch 'develop' into external-cvoc

f1d5fb5

Merge branch 'external-cvoc' of github.com:DANS-KNAW/dataverse into e…

6ee5a09

…xternal-cvoc

Merge pull request #44 from PaulBoon/external-cvoc

56721ad

Merge branch 'develop' into external-cvoc

Fix problem where pressing tab skips some fields.

68d5aa3

Merge pull request #49 from JingMa87/external-cvoc

6224e3b

Fix problem where pressing tab skips some fields.

Have cvoc URL fields use the readonly setting

4cb9ca9

Merge pull request #50 from PaulBoon/DD-375

847dc65

DD-375 Disable editing of the cvoc URL fields

DD-377 Improving the cvoc metadata term selection input (#52)

aa43772

* Added minChars option to the cvoc configuration, default 0, ignoring negative numbers * Added log info for minChars option * Added option to hide cvoc URL fields when readonly and some javascript refactoring

Added ajax loading indicator css for the extenal cvoc autocomplete in…

8ef8408

…put field (IQSS#53)

DD-386: Uses the vocab-uri parameter from the cvoc config (IQSS#55)

c0d9c30

* Uses the vocab-uri parameter from the cvoc config to store in the metadata * Removed commented-out code fragment

Merge branch 'develop' into external-cvoc

43d6631

Merge pull request IQSS#56 from PaulBoon/external-cvoc

e369ffe

Merge branch 'develop' into external-cvoc

Added the cvoc-interface.js file to the application resources

feefe49

Changed the default js-url to this compiled in resource

78f3ec2

Merge pull request IQSS#60 from PaulBoon/FixInterfaceJs

5d78401

Added the cvoc-interface.js file to the application resources

Merge branch 'develop' into external-cvoc

a480b4f

Merged back develop in external cvoc

30de328

Merge pull request IQSS#71 from janvanmansum/MERGE_BACK_DEVELOP_IN_EX…

ec5d019

…T_CVOC Merge back develop in ext cvoc

Merge branch 'develop' into external-cvoc

00e5df4

Proof-of-concept ORCID integration for a new 'creator' field

1792fc2

merge from QDR branch

add scripts - temporary measure

2359567

fix when query doesn't exist yet

74c9952

Allow plain text entries

cb60c23

avoid html escaping - temporary

dc09b20

temporary measure to trigger Dataverse to not escape HTML (because format contains '<a' )

Merge remote-tracking branch 'IQSS/develop' into external-cvoc

8228ba3

Conflicts: src/main/java/edu/harvard/iq/dataverse/DatasetPage.java

Merge branch 'CVV-ORCID' into external-cvoc

5273ba0

add table, move config to servicebean, use json

3cb810d

json for config to simplify making changes - could switch back

djbrooke assigned pdurbin and unassigned qqmyers Sep 8, 2021

pdurbin reviewed Sep 8, 2021

View reviewed changes

pdurbin assigned scolapasta and djbrooke and unassigned pdurbin Sep 8, 2021

djbrooke assigned TaniaSchlatter Sep 8, 2021

qqmyers and others added 5 commits September 9, 2021 17:51

revert table changes

dbbf15b

Update doc/sphinx-guides/source/installation/config.rst

35d5839

Co-authored-by: Philip Durbin <philipdurbin@gmail.com>

add generated id

792f0e9

Merge branch 'external-cvoc2' of https://github.com/GlobalDataverseCo…

b9cb3a7

…mmunityConsortium/dataverse.git into external-cvoc2

Merge remote-tracking branch 'IQSS/develop' into external-cvoc2

11b1cef

Conflicts: doc/sphinx-guides/source/installation/config.rst

pdurbin approved these changes Sep 13, 2021

View reviewed changes

pdurbin unassigned scolapasta, djbrooke and TaniaSchlatter Sep 13, 2021

blank line needed

c63a2ec

kcondon self-assigned this Sep 13, 2021

kcondon merged commit ba13493 into IQSS:develop Sep 14, 2021

philippconzett mentioned this pull request Sep 15, 2021

Support for Controlled Vocabularies (CVV) DataverseNO/dataverse#28

Open

djbrooke added this to the 5.7 milestone Sep 15, 2021

pdurbin mentioned this pull request Feb 1, 2022

Feature Request: Authority Control on Names #7937

Closed

landreev added the External CV label May 3, 2022

landreev mentioned this pull request May 3, 2022

Spike: What work has already been done towards support for controlled vocabularies for metadata fields #8571

Closed

4 tasks

mreekie added the Feature: Controlled Vocabulary Includes both Internal and external controlled vocabularies label May 9, 2022

Comments

Conversation

qqmyers commented Jun 15, 2021

Uh oh!

pdurbin left a comment

Choose a reason for hiding this comment

Uh oh!

pdurbin Sep 8, 2021

Choose a reason for hiding this comment

Uh oh!

qqmyers Sep 9, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pdurbin Sep 8, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pdurbin Sep 8, 2021

Choose a reason for hiding this comment

Uh oh!

pdurbin Sep 8, 2021

Choose a reason for hiding this comment

Uh oh!

djbrooke commented Sep 8, 2021

Uh oh!

qqmyers commented Sep 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janvanmansum commented Sep 11, 2021

Uh oh!

qqmyers commented Sep 11, 2021

Uh oh!

pdurbin left a comment

Choose a reason for hiding this comment

Uh oh!

donsizemore commented Sep 13, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

14 participants

qqmyers commented Sep 9, 2021 •

edited

Loading