RubyGems - bibsync - Versions diffs - 0.0.1 → 0.0.2 - Mend

bibsync 0.0.1 → 0.0.2

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (40) hide show

checksums.yaml +7 -0
data/.travis.yml +10 -0
data/Gemfile +3 -0
data/LICENSE +21 -0
data/README.md +88 -0
data/Rakefile +16 -0
data/bibsync.gemspec +4 -2
data/lib/bibsync/actions/{check_versions.rb → check_arxiv_versions.rb} +6 -6
data/lib/bibsync/actions/determine_arxiv_doi.rb +70 -0
data/lib/bibsync/actions/fetch_from_arxiv.rb +11 -9
data/lib/bibsync/actions/find_my_citations.rb +4 -4
data/lib/bibsync/actions/jabref_format.rb +2 -2
data/lib/bibsync/actions/synchronize_files.rb +5 -6
data/lib/bibsync/actions/synchronize_metadata.rb +14 -57
data/lib/bibsync/actions/validate.rb +16 -6
data/lib/bibsync/actions.rb +1 -7
data/lib/bibsync/bibliography.rb +60 -23
data/lib/bibsync/command.rb +13 -8
data/lib/bibsync/log.rb +22 -20
data/lib/bibsync/transformer.rb +1 -1
data/lib/bibsync/utils.rb +7 -9
data/lib/bibsync/version.rb +1 -1
data/test/actions/test_check_arxiv_versions.rb +4 -0
data/test/actions/test_determine_arxiv_doi.rb +61 -0
data/test/actions/test_fetch_from_arxiv.rb +4 -0
data/test/actions/test_find_my_citations.rb +4 -0
data/test/actions/test_jabref_format.rb +4 -0
data/test/actions/test_synchronize_files.rb +4 -0
data/test/actions/test_synchronize_metadata.rb +34 -0
data/test/actions/test_validate.rb +4 -0
data/test/fixture/FileWithEmbeddedArXiv.pdf +0 -0
data/test/fixture/FileWithEmbeddedArXiv.tex +7 -0
data/test/fixture/FileWithEmbeddedDOI.pdf +0 -0
data/test/fixture/FileWithEmbeddedDOI.tex +7 -0
data/test/fixture/entry.bib +8 -0
data/test/fixture/test.bib +34 -0
data/test/helper.rb +21 -0
data/test/test_bibliography.rb +222 -0
data/test/test_utils.rb +54 -0
metadata +63 -16

checksums.yaml ADDED Viewed

@@ -0,0 +1,7 @@
+---
+SHA1:
+  metadata.gz: 21c0564e45a66b0339bf5b7bafaef25633adcd8e
+  data.tar.gz: 34a9ae41d395ba912e95accbb0c2cb87c8de69c3
+SHA512:
+  metadata.gz: 46e87958376899e94b3bec951241bb02e3085f2e5141146477d9f8bb0de533e27c0c8d07b4c4e5a314f2d734cd386ee092362d38307e18a4d49303a746ad42c0
+  data.tar.gz: 46b15a8cf96461ce3fa3e45aef1585e6806177be668a20410ca940687c5d1f5667d30b1e76701b37503984c0a5ca0aa23025728b0154ee306c868ba8bcdaf1db

data/.travis.yml ADDED Viewed

@@ -0,0 +1,10 @@
+language: ruby
+rvm:
+  - 1.9.3
+  - 2.0.0
+  - ruby-head
+  - jruby-19mode
+  - rbx-19mode
+before_install:
+  - sudo apt-get install -qq poppler-utils

data/Gemfile ADDED Viewed

@@ -0,0 +1,3 @@
+source 'https://rubygems.org/'
+gemspec

data/LICENSE ADDED Viewed

@@ -0,0 +1,21 @@
+The MIT License
+Copyright (c) 2013 Daniel Mendler
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in
+all copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
+THE SOFTWARE.

data/README.md ADDED Viewed

@@ -0,0 +1,88 @@
+BibSync
+=======
+BibSync is a tool to synchronize your paper database with a [BibTeX](http://en.wikipedia.org/wiki/BibTeX) file which might be most
+useful for Physicists and Mathematicians since it supports synchronization with [DOI](http://dx.doi.org/) and [arXiv](http://arxiv.org/).
+I created this tool during the work on my diploma thesis in physics since I was unhappy
+with existing tools like [Mendeley](http://www.mendeley.com/). I use this tool together with Git for version control
+and [JabRef](http://jabref.sourceforge.net/) for browsing. This tool adheres more to the Unix philosophy that a small tool
+for each task is better than one thing which tries to solve everything. If you use [JabRef](http://jabref.sourceforge.net/)
+for browsing and tagging it is unnecessary to sort the papers into different sub directories by hand.
+Just throw them all in one directory!
+__Note__: This tool is derived from a script which I used during my thesis. It worked
+quite well and reliable during that time. But be aware that I used Git for version control
+of the [BibTeX](http://en.wikipedia.org/wiki/BibTeX) file. So any mistakes which might be made by this tool could be reverted.
+Features
+--------
+BibSync supports the following features:
+* Synchronization between a [BibTeX](http://en.wikipedia.org/wiki/BibTeX) file and a directory containing the papers in pdf, ps or djvu format
+* [JabRef](http://jabref.sourceforge.net/) file fields are generated, so you can open the existing papers directly out of [JabRef](http://jabref.sourceforge.net/)
+* Downloading of [arXiv](http://arxiv.org/) or [DOI](http://dx.doi.org/) metadata
+* Extraction of [arXiv](http://arxiv.org/) or [DOI](http://dx.doi.org/) id out of the file using [pdftotext](http://en.wikipedia.org/wiki/Pdftotext)
+* Downloading of new versions of [arXiv](http://arxiv.org/) papers
+* Simple validation of [BibTeX](http://en.wikipedia.org/wiki/BibTeX) files (Checks for missing fields etc)
+* Simple transformation of [BibTeX](http://en.wikipedia.org/wiki/BibTeX) fields (Normalization of author, year and journal field...)
+* Works under every platform supporting Ruby (Linux, Windows, ...)
+Quick start
+-----------
+BibSync requires Ruby >= 1.9.2 to run. It is distributed as a RubyGems package. You can install it via
+the command line
+~~~
+$ gem install bibsync
+~~~
+After that you can use the 'bibsync' tool on the command line. At first let's validate
+a [BibTeX](http://en.wikipedia.org/wiki/BibTeX) file called 'thesis.bib'.
+~~~
+$ bibsync -b ~/thesis/thesis.bib
+~~~
+Then we want to synchronize all the papers in our paper directory with 'bibsync' and automatically download
+the missing metadata.
+~~~
+$ bibsync -d ~/thesis/papers -b ~/thesis/thesis.bib
+~~~
+BibSync tries to download the metadata from [arxiv.org](http://arxiv.org) and [dx.doi.org](http://dx.doi.org). If you want to know more about the functions of 'bibsync' take a look at the command line help.
+~~~
+$ bibsync --help
+~~~
+My setup
+--------
+* BibSync for synchronizing
+* [JabRef](http://jabref.sourceforge.net/) for browsing the bibliography, tagging and categorizing papers
+* [Biblatex](http://www.ctan.org/pkg/biblatex) to include a bibliography in LaTeX with full Unicode support
+Alternatives
+------------
+* [Mendeley](http://www.mendeley.com/) (Commercial, synchronizes with their server, limited disk space, bloated gui application)
+* [Zotero](http://www.zotero.org/) (Firefox plugin, Open source)
+A better name?
+--------------
+If you have a suggestion for a better name, just let me know...
+Author
+------
+Daniel Mendler
+License
+-------
+See LICENSE

data/Rakefile ADDED Viewed

@@ -0,0 +1,16 @@
+begin
+  require 'bundler'
+  Bundler::GemHelper.install_tasks
+rescue Exception
+end
+require 'rake/testtask'
+Rake::TestTask.new :test do |t|
+  t.libs << 'lib' << 'test'
+  t.test_files = FileList['test/**/test_*.rb']
+  t.verbose = true
+  t.ruby_opts << '-w' << '-v'
+end
+task :default => :test

data/bibsync.gemspec CHANGED Viewed

@@ -8,8 +8,8 @@ Gem::Specification.new do |s|
   s.date              = Date.today.to_s
   s.authors           = ['Daniel Mendler']
   s.email             = ['mail@daniel-mendler.de']
-  s.summary           = 'BibSync is a tool to synchronize scientific papers and bibtex bibliography files'
-  s.description       = 'BibSync is a tool to synchronize scientific papers and bibtex bibliography files'
+  s.summary           = 'BibSync is a tool to synchronize scientific papers and BibTeX bibliography files'
+  s.description       = 'BibSync is a tool to synchronize scientific papers and BibTeX bibliography files'
   s.homepage          = 'https://github.com/minad/bibsync'
   s.rubyforge_project = s.name
@@ -18,4 +18,6 @@ Gem::Specification.new do |s|
   s.require_paths = %w(lib)
   s.add_runtime_dependency('nokogiri')
+  s.add_development_dependency('rake')
+  s.add_development_dependency('minitest')
 end

data/lib/bibsync/actions/{check_versions.rb → check_arxiv_versions.rb} RENAMED Viewed

@@ -1,14 +1,14 @@
 module BibSync
   module Actions
-    class CheckVersions
+    class CheckArXivVersions
       include Log
       include Utils
       SliceSize = 20
       def initialize(options)
-        raise 'Bibliography must be set' unless @bib = options[:bib]
-        raise 'Directory must be set' unless @dir = options[:dir]
+        raise 'Option :bib is required' unless @bib = options[:bib]
+        raise 'Option :dir is required' unless @dir = options[:dir]
         @update = options[:update]
       end
@@ -16,16 +16,16 @@ module BibSync
         notice 'Check for newer version on arXiv'
         @bib.select {|e| e[:arxiv] }.each_slice(SliceSize) do |entry|
           begin
-            xml = fetch_xml("http://export.arxiv.org/api/query?id_list=#{entry.map{|e| arxiv_id(e, :version => false, :prefix => true) }.join(',')}&max_results=#{SliceSize}")
+            xml = fetch_xml("http://export.arxiv.org/api/query?id_list=#{entry.map{|e| arxiv_id(e, version: false, prefix: true) }.join(',')}&max_results=#{SliceSize}")
             xml.xpath('//entry/id').map(&:content).each_with_index do |id, i|
               id.gsub!('http://arxiv.org/abs/', '')
               if id != entry[i][:arxiv]
-                info("#{entry[i][:arxiv]} replaced by http://arxiv.org/pdf/#{id}", :key => entry[i])
+                info("#{entry[i][:arxiv]} replaced by http://arxiv.org/pdf/#{id}", key: entry[i])
                 arxiv_download(@dir, id) if @update
               end
             end
           rescue => ex
-            error('arXiv query failed', :ex => ex)
+            error('arXiv query failed', ex: ex)
           end
         end

data/lib/bibsync/actions/determine_arxiv_doi.rb ADDED Viewed

@@ -0,0 +1,70 @@
+module BibSync
+  module Actions
+    class DetermineArXivDOI
+      include Utils
+      include Log
+      def initialize(options)
+        raise 'Option :bib is required' unless @bib = options[:bib]
+        @force = options[:resync]
+      end
+      def run
+        notice 'Determine arXiv and DOI identifiers'
+        @bib.each do |entry|
+          next if entry.comment? ||
+                  (entry[:doi] && entry[:arxiv]) ||
+                  (!@force && entry[:title] && entry[:author] && entry[:year])
+          determine_arxiv_and_doi(entry)
+          @bib.save
+        end
+      end
+      private
+      def determine_arxiv_and_doi(entry)
+        if file = entry.file
+          if file[:type] == :PDF && !entry[:arxiv] && !entry[:doi]
+            debug('Searching for arXiv or doi identifier in pdf file', key: entry)
+            text = `pdftotext -f 1 -l 2 #{Shellwords.escape file[:path]} - 2>/dev/null`
+            entry[:arxiv] = $1 if text =~ /arXiv:\s*([\w\.\/\-]+)/
+            entry[:doi] = $1 if text =~ /doi:\s*([\w\.\/\-]+)/i
+          end
+          if !entry[:arxiv] && file[:name] =~ /^(\d+.\d+v\d+)\.\w+$/
+            debug('Interpreting file name as arXiv identifier', key: entry)
+            entry[:arxiv] = $1
+          end
+          if !entry[:doi] && file[:name] =~ /^(PhysRev.*?|RevModPhys.*?)\.\w+$/
+            debug('Interpreting file name as doi identifier', key: entry)
+            entry[:doi] = "10.1103/#{$1}"
+          end
+        end
+        if !entry[:arxiv] && entry[:doi]
+          begin
+            info('Fetch missing arXiv identifier', key: entry)
+            xml = fetch_xml("http://export.arxiv.org/api/query?search_query=doi:#{entry[:doi]}&max_results=1")
+            if xml.xpath('//entry/doi').map(&:content).first == entry[:doi]
+              id = xml.xpath('//entry/id').map(&:content).first
+              if id =~ %r{\Ahttp://arxiv.org/abs/(.+)\Z}
+                entry[:arxiv] = $1
+              end
+            end
+          rescue => ex
+            error('arXiv query by DOI failed', ex: ex, key: entry)
+          end
+        end
+        unless entry[:arxiv] || entry[:doi]
+          warning('No arXiv or DOI identifier found', key: entry)
+        end
+      end
+    end
+  end
+end

data/lib/bibsync/actions/fetch_from_arxiv.rb CHANGED Viewed

@@ -7,17 +7,19 @@ module BibSync
       include Utils
       def initialize(options)
-        raise 'Fetch must be set' unless @fetch = options[:fetch]
-        raise 'Directory must be set' unless @dir = options[:dir]
+        raise 'Option :fetch is required' unless @fetch = options[:fetch]
+        raise 'Option :dir is required' unless @dir = options[:dir]
       end
       def run
-        ids = []
+        arxivs = []
         urls = []
         @fetch.each do |url|
-          if url =~ %r{^http://arxiv.org/abs/(\d+\.\d+)$}
-            ids << $1
+          if url =~ /\A(\d+\.\d+)(v\d+)?\Z/
+            arxivs << $1
+          elsif url =~ %r{\Ahttp://arxiv.org/abs/(\d+\.\d+)\Z}
+            arxivs << $1
           else
             urls << url
           end
@@ -31,18 +33,18 @@ module BibSync
           end
         end
-        unless ids.empty?
+        unless arxivs.empty?
           notice 'Downloading from arXiv'
-          ids.each_slice(SliceSize) do |ids|
+          arxivs.each_slice(SliceSize) do |ids|
             begin
               xml = fetch_xml("http://export.arxiv.org/api/query?id_list=#{ids.join(',')}&max_results=#{SliceSize}")
               xml.xpath('//entry/id').map(&:content).each_with_index do |id, i|
                 id.gsub!('http://arxiv.org/abs/', '')
-                info 'arXiv download', :key => id
+                info 'arXiv download', key: id
                 arxiv_download(@dir, id)
               end
             rescue => ex
-              error('arXiv query failed', :ex => ex)
+              error('arXiv query failed', ex: ex)
             end
           end
         end

data/lib/bibsync/actions/find_my_citations.rb CHANGED Viewed

@@ -5,8 +5,8 @@ module BibSync
       include Utils
       def initialize(options)
-        raise 'Bibliography must be set' unless @bib = options[:bib]
-        raise 'Tex directory must be set' unless @dir = options[:citedbyme]
+        raise 'Option :bib is required' unless @bib = options[:bib]
+        raise 'Option :citedbyme is required' unless @dir = options[:citedbyme]
         raise "#{@dir} is not a directory" unless File.directory?(@dir)
       end
@@ -19,7 +19,7 @@ module BibSync
             $1.split(/\s*,\s*/).each do |key|
               key.strip!
               file = @bib.relative_path(file)
-              debug("Cited in #{file}", :key => key)
+              debug("Cited in #{file}", key: key)
               (cites[key] ||= []) << file
             end
           end
@@ -35,7 +35,7 @@ module BibSync
           if @bib[key]
             @bib[key][:citedbyme] = files
           else
-            warning("Cited in #{files} but not found in #{@bib.file}", :key => key)
+            warning("Cited in #{files} but not found in #{@bib.file}", key: key)
           end
         end

data/lib/bibsync/actions/jabref_format.rb CHANGED Viewed

@@ -1,11 +1,11 @@
 module BibSync
   module Actions
-    class JabrefFormat
+    class JabRefFormat
       include Utils
       include Log
       def initialize(options)
-        raise 'Bibliography must be set' unless @bib = options[:bib]
+        raise 'Option :bib is required' unless @bib = options[:bib]
       end
       def run

data/lib/bibsync/actions/synchronize_files.rb CHANGED Viewed

@@ -7,8 +7,8 @@ module BibSync
       FileTypes = %w(djvu pdf ps)
       def initialize(options)
-        raise 'Bibliography must be set' unless @bib = options[:bib]
-        raise 'Directory must be set' unless @dir = options[:dir]
+        raise 'Option :bib is required' unless @bib = options[:bib]
+        raise 'Option :dir is required' unless @dir = options[:dir]
       end
       def run
@@ -17,16 +17,15 @@ module BibSync
         files = {}
         Dir[File.join(@dir, "**/*.{#{FileTypes.join(',')}}")].sort.each do |file|
           name = File.basename(file)
-          key, type = split_filename(name)
+          key = name_without_ext(name)
           raise "Duplicate file #{name}" if files[key]
           files[key] = file
         end
         files.each do |key, file|
           unless entry = @bib[key]
-            info('New file', :key => key)
-            entry = Bibliography::Entry.new
-            entry.key = key
+            info('New file', key: key)
+            entry = Bibliography::Entry.new(key: key)
             @bib << entry
           end

data/lib/bibsync/actions/synchronize_metadata.rb CHANGED Viewed

@@ -5,7 +5,7 @@ module BibSync
       include Log
       def initialize(options)
-        raise 'Bibliography must be set' unless @bib = options[:bib]
+        raise 'Option :bib is required' unless @bib = options[:bib]
         @force = options[:resync]
       end
@@ -16,10 +16,8 @@ module BibSync
           next if entry.comment?
           if @force || !(entry[:title] && entry[:author] && entry[:year])
-            determine_arxiv_and_doi(entry)
             if entry[:arxiv]
-              if entry.key == arxiv_id(entry, :prefix => false, :version => true)
+              if entry.key == arxiv_id(entry, prefix: false, version: true)
                 entry = rename_arxiv_file(entry)
                 next unless entry
               end
@@ -40,28 +38,28 @@ module BibSync
       private
       def update_aps_abstract(entry)
-        info("Downloading APS abstract", :key => entry)
+        info("Downloading APS abstract", key: entry)
         html = fetch_html("http://link.aps.org/doi/#{entry[:doi]}")
         entry[:abstract] = html.css('.aps-abstractbox').map(&:content).first
       rescue => ex
-        error('Abstract download failed', :key => entry, :ex => ex)
+        error('Abstract download failed', key: entry, ex: ex)
       end
       def update_doi(entry)
-        info('Downloading doi.org metadata', :key => entry)
+        info('Downloading DOI metadata', key: entry)
         text = fetch("http://dx.doi.org/#{entry[:doi]}", 'Accept' => 'text/bibliography; style=bibtex')
         raise text if text == 'Unknown DOI'
         Bibliography::Entry.parse(text).each {|k, v| entry[k] = v }
       rescue => ex
         entry.delete(:doi)
-        error('doi download failed', :key => entry, :ex => ex)
+        error('DOI download failed', key: entry, ex: ex)
       end
       # Rename arxiv file if key contains version
       def rename_arxiv_file(entry)
         file = entry.file
-        key = arxiv_id(entry, :prefix => false, :version => false)
+        key = arxiv_id(entry, prefix: false, version: false)
         if old_entry = @bib[key]
           # Existing entry found
@@ -71,7 +69,7 @@ module BibSync
           entry[:arxiv] =~ /v(\d+)$/
           new_version = $1
           if old_version && new_version && old_version >= new_version
-            info('Not updating existing entry with older version', :key => old_entry)
+            info('Not updating existing entry with older version', key: old_entry)
             File.delete(file[:path]) if file
             return nil
           end
@@ -79,14 +77,14 @@ module BibSync
           old_entry[:arxiv] = entry[:arxiv]
           old_entry[:doi] = entry[:doi]
           entry = old_entry
-          info('Updating existing entry', :key => entry)
+          info('Updating existing entry', key: entry)
         else
           # This is a new entry
           entry.key = key
         end
         if file
-          new_path = file[:path].sub(arxiv_id(entry, :prefix => false, :version => true), key)
+          new_path = file[:path].sub(arxiv_id(entry, prefix: false, version: true), key)
           File.rename(file[:path], new_path)
           entry.file = new_path
         end
@@ -97,8 +95,8 @@ module BibSync
       end
       def update_arxiv(entry)
-        info('Downloading arXiv metadata', :key => entry)
-        xml = fetch_xml("http://export.arxiv.org/oai2?verb=GetRecord&identifier=oai:arXiv.org:#{arxiv_id(entry, :prefix => true, :version => false)}&metadataPrefix=arXiv")
+        info('Downloading arXiv metadata', key: entry)
+        xml = fetch_xml("http://export.arxiv.org/oai2?verb=GetRecord&identifier=oai:arXiv.org:#{arxiv_id(entry, prefix: true, version: false)}&metadataPrefix=arXiv")
         error = xml.xpath('//error').map(&:content).first
         raise error if error
@@ -108,7 +106,7 @@ module BibSync
         entry[:author] = xml.xpath('//arXiv/authors/author').map do |author|
           "{#{author.xpath('keyname').map(&:content).first}}, {#{author.xpath('forenames').map(&:content).first}}"
         end.join(' and ')
-        entry[:journal] = ArXivJournal
+        entry[:journal] = 'ArXiv e-prints'
         entry[:eprint] = entry[:arxiv]
         entry[:archiveprefix] = 'arXiv'
         date = xml.xpath('//arXiv/updated').map(&:content).first || xml.xpath('//arXiv/created').map(&:content).first
@@ -124,49 +122,8 @@ module BibSync
         entry[:url] = "http://arxiv.org/abs/#{entry[:arxiv]}"
       rescue => ex
         entry.delete(:arxiv)
-        error('arXiv download failed', :key => entry, :ex => ex)
+        error('arXiv download failed', key: entry, ex: ex)
       end
-      def determine_arxiv_and_doi(entry)
-        if file = entry.file
-          if file[:type] == :PDF && !entry[:arxiv] && !entry[:doi]
-            debug('Searching for arXiv or doi identifier in pdf file', :key => entry)
-            text = `pdftotext -f 1 -l 2 #{Shellwords.escape file[:path]} - 2>/dev/null`
-            entry[:arxiv] = $1 if text =~ /arXiv:\s*([\w\.\/\-]+)/
-            entry[:doi] = $1 if text =~ /doi:\s*([\w\.\/\-]+)/i
-          end
-          if !entry[:arxiv] && file[:name] =~ /^(\d+.\d+v\d+)\.\w+$/
-            debug('Interpreting file name as arXiv identifier', :key => entry)
-            entry[:arxiv] = $1
-          end
-          if !entry[:doi] && file[:name] =~ /^(PhysRev.*?|RevModPhys.*?)\.\w+$/
-            debug('Interpreting file name as doi identifier', :key => entry)
-            entry[:doi] = "10.1103/#{$1}"
-          end
-        end
-        if !entry[:arxiv] && entry[:doi]
-          begin
-            info('Fetch missing arXiv identifier', :key => entry)
-            xml = fetch_xml("http://export.arxiv.org/api/query?search_query=doi:#{entry[:doi]}&max_results=1")
-            if xml.xpath('//entry/doi').map(&:content).first == entry[:doi]
-              id = xml.xpath('//entry/id').map(&:content).first
-              if id =~ %r{\Ahttp://arxiv.org/abs/(.+)\Z}
-                entry[:arxiv] = $1
-              end
-            end
-          rescue => ex
-            error('arXiv doi query failed', :ex => ex, :key => entry)
-          end
-        end
-        unless entry[:arxiv] || entry[:doi]
-          warning('No arXiv or doi identifier found', :key => entry)
-        end
-      end
     end
   end
 end

data/lib/bibsync/actions/validate.rb CHANGED Viewed

@@ -10,7 +10,7 @@ module BibSync
       def run
         notice 'Check validity'
-        titles, arxivs = {}, {}
+        titles, arxivs, dois = {}, {}, {}
         @bib.each do |entry|
           next if entry.comment?
@@ -18,14 +18,16 @@ module BibSync
           w = []
           file = entry.file
-          w << 'Missing file' unless file && File.file?(file[:path])
-          w += [:title, :author, :year, :abstract].reject {|k| entry[k] }.map {|k| "Missing #{k}" }
+          missing = []
+          missing << :file unless file && File.file?(file[:path])
+          missing += [:title, :author, :year, :abstract].reject {|k| entry[k] }
+          w << "Missing #{missing.map(&:to_s).sort.join(', ')}" unless missing.empty?
-          w << 'Invalid file' if split_filename(file[:name]).first != entry.key if file
+          w << 'File name does not match entry key' if name_without_ext(file[:name]) != entry.key if file
           if entry[:arxiv]
-            id = arxiv_id(entry, :version => false, :prefix => true)
+            id = arxiv_id(entry, version: false, prefix: true)
             if arxivs.include?(id)
               w << "ArXiv duplicate of '#{arxivs[id]}'"
             else
@@ -33,6 +35,14 @@ module BibSync
             end
           end
+          if id = entry[:doi]
+            if dois.include?(id)
+              w << "DOI duplicate of '#{dois[id]}'"
+            else
+              dois[id] = entry.key
+            end
+          end
           if entry[:title]
             if titles.include?(entry[:title])
               w << "Title duplicate of '#{titles[entry[:title]]}'"
@@ -41,7 +51,7 @@ module BibSync
             end
           end
-          warning(w.join(', '), :key => entry) unless w.empty?
+          warning(w.join('; '), key: entry) unless w.empty?
         end
       end
     end

data/lib/bibsync/actions.rb CHANGED Viewed

@@ -1,7 +1 @@
-require 'bibsync/actions/check_versions'
-require 'bibsync/actions/synchronize_files'
-require 'bibsync/actions/synchronize_metadata'
-require 'bibsync/actions/validate'
-require 'bibsync/actions/jabref_format'
-require 'bibsync/actions/fetch_from_arxiv'
-require 'bibsync/actions/find_my_citations'
+Dir[File.join(File.dirname(__FILE__), 'actions', '*.rb')].each {|f| require f }