Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I don't know which is harvesting server link #4203

Closed
Thalia-Uranga opened this issue Oct 13, 2017 · 20 comments
Closed

I don't know which is harvesting server link #4203

Thalia-Uranga opened this issue Oct 13, 2017 · 20 comments
Assignees
Milestone

Comments

@Thalia-Uranga
Copy link

When I enable harvesting server it does not give me a link to perform interoperability tests. How do I see this part?

@pdurbin
Copy link
Member

pdurbin commented Oct 13, 2017

@thalia25 and I have been discussing this at http://irclog.iq.harvard.edu/dataverse/2017-10-13#i_58686 and it sounds like there are two problems:

  • There is a usability issue wherein people who have just set up the Dataverse installation as a harvesting server by following http://guides.dataverse.org/en/4.8/admin/harvestserver.html gets no indication in the user interface of what URL they should share with the people who run the harvesting client. The goal is for the harvesting client (DSpace, for example) to start harvesting data from the Dataverse installation.
  • The docs above should be improved to explain this.

@pdurbin pdurbin added Feature: Harvesting UX & UI: Design This issue needs input on the design of the UI and from the product owner Type: Suggestion an idea User Role: Superuser Has access to the superuser dashboard and cares about how the system is configured labels Oct 13, 2017
@pdurbin
Copy link
Member

pdurbin commented Oct 13, 2017

Yeah, the UI doesn't tell you which URL to share. Maybe the URL could be included where it says "Successfully created harvesting set". Here's a screenshot:

screen shot 2017-10-13 at 2 02 22 pm

@pdurbin
Copy link
Member

pdurbin commented Oct 13, 2017

I'm looking at http://guides.dataverse.org/en/4.8/admin/harvestserver.html but I'm not sure what URL to send for the set I just created. The only OAI-PMH verb I know is "Identify".

https://dev1.dataverse.org/oai?verb=Identify shows this:

screen shot 2017-10-13 at 2 05 50 pm

@jggautier
Copy link
Contributor

jggautier commented Oct 13, 2017

Hope this helps!: I think the server URL for @pdurbin's example would be https://dev1.dataverse.org/oai. For Harvard Dataverse, it would be https://dataverse.harvard.edu/oai. That's what I would include as the Server URL when I'm adding harvesting client info in the Dataverse dashboard.

So I think if you followed that pattern with your installation (repositoryURL.xxx/oai?verb=ListSets) and got that page of xml of your list of oai sets, you'd know that the server URL is repositoryURL.xxx/oai.

We've noted that the guides, and I agree with @pdurbin that the application, should show what the server URL is.

@Thalia-Uranga
Copy link
Author

@jggautier Hi, So the link that I must share to interoperate with another repository is with the link that you mention adding oai?

@jggautier
Copy link
Contributor

jggautier commented Oct 16, 2017

@thalia25 Yes, that's what I would think. I'm only familiar with how to set up harvesting between Dataverse repositories, but I looked through DSpace's latest docs (version 6.x) about importing metadata, and it has some info about using an "OAIPMHDataLoader".

The OAIPMHDataLoader has the following properties:
a) fieldMap: Same as above, the mapping between the input keys holding the metadata and the ones that we want to have internal in BTE.
b) serverAddress: The base address of the OAI provider (server). Base address can be specified also in the "-s" option of the command prompt. If is specified in both places, the one specified from the command line is preferred.
c) prefix: The metadata prefix to be used in OAI requests.

As you said, I think that the serverAddress for your Dataverse installation would be the address of your installation plus /oai at the end.

Let me know if that makes sense and is helpful (or doesn't and isn't!).

(Eventually I think we'll need to get touch with someone who can say authoritatively what oai-pmh info Dataverse admins need, and how to get it, in order to share the metadata in their installation.with other systems.)

@Thalia-Uranga
Copy link
Author

@jggautier and @pdurbin What is the difference between OAI server and OAI sets?

@jggautier
Copy link
Contributor

The server URL points to the OAI server. When you enable harvesting, your Dataverse installation is a server. The Dataverse guides have a section on what OAI sets are that I think should help:

Once the service is enabled, you define collections of local datasets that will be available to remote harvesters as OAI Sets.

Have you seen this page on DSpace's guides about harvesting non-DSpace content into a DSpace installation?: https://wiki.duraspace.org/display/DSDOC6x/XMLUI+Configuration+and+Customization#XMLUIConfigurationandCustomization-HarvestingItemsfromXMLUIviaOAI-OREorOAI-PMH

@jggautier
Copy link
Contributor

jggautier commented May 20, 2018

Would it be helpful if in the "Managing Harvesting Server and Sets" section of the Dataverse guides we add that

the default OAI-PMH service base address for Dataverses that enable harvesting is https://example.dataverse.org/oai

?

Also, I came across DataCite's OAI-PMH guide recently. (Wish I'd seen it a year ago!) I think it very clearly answers a lot of the types of questions Thalia was asking. Might be nice to borrow from it as well :)

@pdurbin
Copy link
Member

pdurbin commented May 20, 2018

@jggautier yes, I think an example like that would be very helpful and easy to add.

@Thalia-Uranga
Copy link
Author

@pdurbin the one who did not know the OAI-PMH address was in the installation of dataverse version 4.7, it was thanks to you that I learned about the URL. I think it would be good to mention it in the versions that don't show the OAI-PMH address on the dataverse page.

@jggautier
Copy link
Contributor

Thanks @thalia25! I just want to clarify that the OAI-PMH address is not in the UI of any Dataverse installation version on and after 4.0. (I'm not sure about Dataverse versions before 4.0.)

@pdurbin pdurbin added Type: Suggestion an idea User Role: Superuser Has access to the superuser dashboard and cares about how the system is configured labels Dec 18, 2018
@pdurbin
Copy link
Member

pdurbin commented Dec 18, 2018

Today at http://irclog.iq.harvard.edu/dataverse/2018-12-18#i_83107 @poikilotherm said, "The docs could be more specific about the endpoint URL" and he said he'd roll a small doc change for this issue into pull request #5371 since he's already editing that part of the guides.

@pdurbin pdurbin removed Hackathon: Low Hanging Fruit Help Wanted: Documentation Mentor: pdurbin UX & UI: Design This issue needs input on the design of the UI and from the product owner Type: Suggestion an idea User Role: Superuser Has access to the superuser dashboard and cares about how the system is configured labels Dec 18, 2018
poikilotherm added a commit to poikilotherm/dataverse that referenced this issue Dec 18, 2018
@poikilotherm
Copy link
Contributor

Hey @thalia25,
do you think my recent docs addition in 0258d19 is a sufficient solution to this issue?
Cheers,
Oliver

@pdurbin
Copy link
Member

pdurbin commented Jan 4, 2019

@poikilotherm it looks fine to me. I haven't seen @thalia25 in IRC lately. (We miss you! 😄 ) Can you please go ahead and make a pull request? Thanks!

@poikilotherm
Copy link
Contributor

This is already part of #5371 as discussed on IRC (sry, cannot find it right now in the logs)

@pdurbin
Copy link
Member

pdurbin commented Jan 5, 2019

@poikilotherm oh! I forgot. Please remind us to move this issue to QA once that pull request is moved to QA.

@pdurbin
Copy link
Member

pdurbin commented Mar 28, 2019

@poikilotherm pull request #5371 currently has merge conflicts. Do you feel like cherry-picking 0258d19 into a new branch and pull request?

@poikilotherm
Copy link
Contributor

@pdurbin sorry, I missed this/forgot. Done, see #5834

@kcondon kcondon self-assigned this May 10, 2019
@djbrooke djbrooke added this to the 4.15 milestone Jun 11, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants