Uploaded image for project: 'DSpace'
  1. DSpace
  2. DS-396

Provide metatags used by Google Scholar for enhanced indexing

    Details

    • Type: New Feature
    • Status: Closed (View Workflow)
    • Priority: Major
    • Resolution: Fixed
    • Fix Version/s: 1.7.0
    • Component/s: None
    • Labels:
      None
    • Attachments:
      2
    • Comments:
      12
    • Documentation Status:
      Complete or Committed

      Description

      In August 2009 I posted a query to the DSpace general list about whether or not anyone else had seen strange indexing of items in DSpace (see http://mailman.mit.edu/pipermail/dspace-general/2009-August/002996.html) and MacKenzie Smith posted a follow up that stated she had seen the same things (http://mailman.mit.edu/pipermail/dspace-general/2009-August/002998.html).

      MacKenzie followed up with Anurag Acharya at Google Scholar to inquire about this and he indicated that better support for the Google metatags schema would help mitigate this problem. It looks like the metatags are (though we'd want to confirm this with the Google Scholar team).

      <meta name="citation_journal_title" content="Journal Name">
      <meta name="citation_authors" content="Last Name1, First Name1; Last Name2, First Name2">
      <meta name="citation_title" content="Article Title">
      <meta name="citation_date" content="01/01/2007">
      <meta name="citation_volume" content="10">
      <meta name="citation_issue" content="1">
      <meta name="citation_firstpage" content="1">
      <meta name="citation_lastpage" content="15">
      <meta name="citation_doi" content="10.1074/jbc.M309524200">
      <meta name="citation_pdf_url" content="http://www.publishername.org/10/1/1.pdf">
      <meta name="citation_abstract_html_url" content="http://www.publishername.org/cgi/content/abstract/10/1/1">
      <meta name="citation_fulltext_html_url" content="http://www.publishername.org/cgi/content/full/10/1/1">
      <meta name="dc.Contributor" content="Last Name1, First Name1">
      <meta name="dc.Contributor" content="Last Name2, First Name2">
      <meta name="dc.Title" content="Article Title">
      <meta name="dc.Date" content="01/01/2007">
      <meta name="citation_publisher" content="Publisher Name">

      I realize that supporting this level of granularity puts pressure on how we structure metadata but I believe as a repository manager that it's critical that we get this kind of support for Google Scholar into DSpace out of the box - and hopefully will help minimize the problems that IDEALS has seen.

        Attachments

          Activity

          Hide
          peterdietz Peter Dietz added a comment -

          Added a simple patch which provides for basic implementation of the required changes. It will display a citation_pdf_url in the case where there is only one public content bitstream and its mimetype is PDF. We should have something committed to improve the vast majority of instances in Scholar for 1.7

          (This is not the work that MIT has been working on, but a quick and dirty implementation to move forward on this).

          Show
          peterdietz Peter Dietz added a comment - Added a simple patch which provides for basic implementation of the required changes. It will display a citation_pdf_url in the case where there is only one public content bitstream and its mimetype is PDF. We should have something committed to improve the vast majority of instances in Scholar for 1.7 (This is not the work that MIT has been working on, but a quick and dirty implementation to move forward on this).
          Hide
          sands Sands Fish added a comment -

          Attaching a patch here to integrate the Google Scholar metadata into dri2xhtml-alt & Mirage themes. Please apply if desired. I didn't want to commit them and disrupt any work being done by the authors of those themes.

          Show
          sands Sands Fish added a comment - Attaching a patch here to integrate the Google Scholar metadata into dri2xhtml-alt & Mirage themes. Please apply if desired. I didn't want to commit them and disrupt any work being done by the authors of those themes.
          Hide
          sands Sands Fish added a comment -

          The Mirage & dri2xhtml-alt integration patch has been applied, for the record.

          Show
          sands Sands Fish added a comment - The Mirage & dri2xhtml-alt integration patch has been applied, for the record.
          Hide
          tdonohue Tim Donohue added a comment -

          Hi Sands,

          Two brief comments on the code in 1.7.0 RC1:

          (1) First, I wonder if the 'google-metadata.properties' file you created should be placed in 'config/crosswalks/' so that it's alongside the existing 'xhml-head-item.properties' (which also maps metadata to the HTML metatags). That way it's easier to do all the HTML <meta> tag mappings in one place, as needed.

          (2) I'd also like to continue to discuss enabling this feature by default in 1.7.0. I'll bring that up in tomorrow's DSpace Developers meeting so we can see if we can gain a consensus.

          Show
          tdonohue Tim Donohue added a comment - Hi Sands, Two brief comments on the code in 1.7.0 RC1: (1) First, I wonder if the 'google-metadata.properties' file you created should be placed in 'config/crosswalks/' so that it's alongside the existing 'xhml-head-item.properties' (which also maps metadata to the HTML metatags). That way it's easier to do all the HTML <meta> tag mappings in one place, as needed. (2) I'd also like to continue to discuss enabling this feature by default in 1.7.0. I'll bring that up in tomorrow's DSpace Developers meeting so we can see if we can gain a consensus.
          Hide
          sands Sands Fish added a comment -

          Mapping and default configuration completed. Any additional feedback on default mappings (in config/crosswalks/google-metadata.properties) from community would be valuable.

          Show
          sands Sands Fish added a comment - Mapping and default configuration completed. Any additional feedback on default mappings (in config/crosswalks/google-metadata.properties) from community would be valuable.

            People

            • Assignee:
              sands Sands Fish
              Reporter:
              sshreeves Sarah Shreeves
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: