RubyGems - chunker - Versions diffs - 0.1.53 - Mend

chunker 0.1.53

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (6) hide show

data/LICENSE ADDED Viewed

@@ -0,0 +1,29 @@
+Copyright (c) 2008, Mahlon E. Smith
+All rights reserved.
+Redistribution and use in source and binary forms, with or without modification, are
+permitted provided that the following conditions are met:
+    * Redistributions of source code must retain the above copyright notice, this
+      list of conditions and the following disclaimer.
+    * Redistributions in binary form must reproduce the above copyright notice, this
+      list of conditions and the following disclaimer in the documentation and/or
+      other materials provided with the distribution.
+    * Neither the name of the author, nor the names of contributors may be used to
+      endorse or promote products derived from this software without specific prior
+      written permission.
+THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
+"AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
+LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
+A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR
+CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
+EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
+PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
+PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF
+LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING
+NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
+SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

data/README ADDED Viewed

@@ -0,0 +1,59 @@
+Preface:
+	Ruby provides an automatic constant called DATA, which is an IO object
+	that references all text in the current file under an __END__ token.
+	I find it convenient to use the __END__ area to store all sorts of
+	stuff, rather than have to worry about distributing separate files.
+The problem:
+	The DATA constant is determined from whatever ruby believes $0 to be.
+	It doesn't work inside of other required libraries, so you'll see stuff
+	like this all the time:
+	END = File.open( __FILE__ ).read.split( /^__END__/, 2 ).last
+	It works, but it's more work than I want to do.
+A workaround:
+	Chunker solves this by parsing __END__ tokens for you, and making it
+	available in the form of a 'DATA_END' constant.  It installs this
+	constant into the class that includes Chunker, so you can use it again
+	and again, assuming you use a different file for each class.
+	It also automatically parses out other things that look like tokens, so
+	you can easily have multiple, distinct documents all embedded into the
+	__END__ block.
+Usage:
+	There is no direct interface to Chunker.  Just include it from a
+	class to have that file's __END__ data blocks magically become DATA_*
+	IO constants within that class.
+Example:
+	This produces the string "Yep.\n".
+		require 'chunker'
+		class Foom
+			include Chunker
+		end
+		puts Foom.new.class.const_get( :DATA_WICKED ).read
+		__END__
+		Stuff in the END block!
+		__WOW__
+		Ultimate success!
+		__WICKED__
+		Yep.

data/Rakefile ADDED Viewed

@@ -0,0 +1,156 @@
+#!/usr/bin/env rake
+#
+# Chunker Rakefile
+#
+require 'rubygems'
+require 'pathname'
+require 'rake'
+require 'rake/packagetask'
+require 'rake/gempackagetask'
+require 'spec/rake/spectask'
+require 'rubygems/installer'
+require 'rubygems/uninstaller'
+######################################################################
+### P A T H S  A N D  F I L E S
+######################################################################
+BASEDIR = Pathname.new( __FILE__ ).expand_path.dirname.relative_path_from( Pathname.getwd )
+TEXT_FILES = %w{ Rakefile README LICENSE }.collect {|f| BASEDIR + f }
+SPECDIR    = BASEDIR + 'spec'
+SPEC_FILES = Pathname.glob( SPECDIR + '**/*_spec.rb' ).reject {|f| f =~ /^\.svn/ }
+LIBDIR    = BASEDIR + 'lib'
+LIB_FILES = Pathname.glob( LIBDIR + '**/*.rb').reject {|i| i =~ /\.svn/ }
+RELEASE_FILES = TEXT_FILES + LIB_FILES + SPEC_FILES
+######################################################################
+### H E L P E R S
+######################################################################
+### Given a +file+ path, find the first captured match of +pattern+,
+### or the string 'UNKNOWN' if not found. (easy to notice something is wrong.)
+###
+def find_pattern( file, pattern )
+	ver = nil
+	File.open( file ) do |f|
+		ver = f.each do |line|
+			break $1 if line =~ pattern
+		end
+	end
+	return ver.is_a?( String ) ? ver : 'UNKNOWN'
+end
+######################################################################
+### P A C K A G E   C O N S T A N T S
+######################################################################
+PKG_NAME      = 'chunker'
+PKG_VERSION   = find_pattern( LIBDIR + 'chunker.rb', /VERSION = ['"](\d\.\d(?:\/\d)?)['"]/ )
+PKG_REVISION  = find_pattern( LIBDIR + 'chunker.rb', /SVNRev = .+Rev: (\d+)/ )
+PKG_FILE_NAME = "#{PKG_NAME}-#{PKG_VERSION}.#{PKG_REVISION}"
+######################################################################
+### T A S K S
+######################################################################
+task :default => [ :test, :package ]
+### Task: run rspec tests
+###
+desc "Run tests"
+Spec::Rake::SpecTask.new('test') do |task|
+	task.spec_files = SPEC_FILES
+	task.spec_opts  = %w{ -c -fs }
+end
+### Task: generate ctags
+### This assumes exuberant ctags, since ctags 'native' doesn't support ruby anyway.
+###
+desc "Generate a ctags 'tags' file from Chunker source"
+task :ctags do
+	sh "ctags -R #{LIBDIR}"
+end
+### Task: Create gem from source
+###
+gem = Gem::Specification.new do |gem|
+	pkg_build = PKG_REVISION || 0
+	gem.summary           = "A convenience library for parsing __END__ tokens consistently."
+	gem.name              = PKG_NAME
+	gem.version           = "%s.%s" % [ PKG_VERSION, pkg_build ]
+	gem.author            = 'Mahlon E. Smith'
+	gem.email             = 'mahlon@martini.nu'
+	gem.homepage          = 'http://projects.martini.nu/ruby-modules/wiki/Chunker'
+	gem.rubyforge_project = 'mahlon'
+	gem.has_rdoc          = true
+	gem.files = RELEASE_FILES.
+		collect {|f| f.relative_path_from(BASEDIR).to_s }
+	gem.test_files	= SPEC_FILES.
+		collect {|f| f.relative_path_from(BASEDIR).to_s }
+	gem.description = <<-EOF
+	Ruby provides an automatic constant called DATA, which is an IO object
+	that references all text in the current file under an __END__ token.
+	I find it convenient to use the __END__ area to store all sorts of
+	stuff, rather than have to worry about distributing separate files.
+	The DATA constant is determined from whatever ruby believes $0 to be.
+	It doesn't work inside of other required libraries, so you'll see stuff
+	like this all the time:
+	END = File.open( __FILE__ ).read.split( /^__END__/, 2 ).last
+	It works, but it's more work than I want to do.
+	Chunker solves this by parsing __END__ tokens for you, and making it
+	available in the form of a 'DATA_END' constant.  It installs this
+	constant into the class that includes Chunker, so you can use it again
+	and again, assuming you use a different file for each class.
+	It also automatically parses out other things that look like tokens, so
+	you can easily have multiple, distinct documents all embedded into the
+	__END__ block.
+	EOF
+end
+Rake::GemPackageTask.new( gem ) do |pkg|
+	pkg.need_zip     = true
+	pkg.need_tar     = true
+	pkg.need_tar_bz2 = true
+end
+### Task: install
+###
+task :install_gem => [ :package ] do
+	$stderr.puts
+	installer = Gem::Installer.new( "pkg/#{PKG_FILE_NAME}.gem" )
+	installer.install
+end
+task :install => [ :install_gem ]
+### Task: uninstall
+###
+task :uninstall_gem do
+	uninstaller = Gem::Uninstaller.new( PKG_NAME )
+	uninstaller.uninstall
+end
+task :uninstall => [ :uninstall_gem ]

data/lib/chunker.rb ADDED Viewed

@@ -0,0 +1,135 @@
+#!/usr/bin/ruby
+#
+# Chunker: A convenience library for parsing __END__ tokens consistently.
+#
+# == Version
+#
+#	$Id: chunker.rb 53 2008-11-09 00:27:36Z mahlon $
+#
+# == Author
+#
+# * Mahlon E. Smith <mahlon@martini.nu>
+#
+# :include: LICENSE
+#
+### Namespace for the datablock parser.
+###
+module Chunker
+	require 'strscan'
+	require 'stringio'
+	# SVN Revision
+	#
+	SVNRev = %q$Rev: 53 $
+	# SVN Id
+	#
+	SVNId = %q$Id: chunker.rb 53 2008-11-09 00:27:36Z mahlon $
+	# Package version
+	#
+	VERSION = '0.1'
+	### Parser class for __END__ data blocks.
+	### Find each __TOKEN__ within the __END__, and put each into a
+	### DATA_TOKEN constant within the namespace that included us.
+	###
+	class DataParser
+		# The mark for a DATA block.
+		#
+		END_TOKEN = /^__END__\r?\n/
+		# The mark for a 'sub' block.
+		#
+		CHUNK_TOKEN = /^__([A-Z\_0-9]+)__\r?\n/
+		### Constructor: Given a +klass+ and an +io+ to the class file,
+		### extract the data blocks and install constants.
+		###
+		def initialize( klass, io )
+			io.open if io.closed?
+			end_string = io.read.split( END_TOKEN, 2 ).last
+			@klass   = klass
+			@scanner = StringScanner.new( end_string )
+			io.close
+			if @scanner.check_until( CHUNK_TOKEN )
+				# put each chunk into its own constant
+				self.extract_blocks
+			else
+				# no sub blocks, put the whole mess into DATA_END
+				@klass.const_set( :DATA_END, StringIO.new( end_string ) )
+			end
+		end
+		#########
+		protected
+		#########
+		### Parse the current +io+ for data blocks, set contents to
+		### IO constants in the including class.
+		###
+		def extract_blocks
+			label = nil
+			while @scanner.scan_until( CHUNK_TOKEN ) and ! @scanner.eos?
+				data = ''
+				# First pass, __END__ contents (until next token, instead
+				# of entire data block.)
+				#
+				if label.nil?
+					label = 'END'
+					data  = @scanner.pre_match
+					@scanner.pos = self.next_position
+				else
+					label = @scanner[1]
+					if data = @scanner.scan_until( CHUNK_TOKEN )
+						# Pull the next token text out of the data, set up the next pass
+						#
+						data         = data[ 0, data.length - @scanner[0].length ]
+						@scanner.pos = self.next_position
+					else
+						# No additional blocks
+						#
+						data = @scanner.rest
+					end
+				end
+				# Add the IO constant to the class that included me.
+				#
+				@klass.const_set( "DATA_#{label}".to_sym, StringIO.new( data ) )
+			end
+		end
+		### Return the next scanner position for searching.
+		###
+		def next_position
+			return @scanner.pos - @scanner[0].length
+		end
+	end
+	### Hook included: Find the file path for how we arrived here, and open
+	### it as an IO object.  Parse the IO for data block tokens.
+	###
+    def self.included( klass )
+		# klass.instance_eval{ __FILE__ }   awww, nope.
+		# __FILE__ won't work here, so we find the filename via caller().
+		#
+		io = File.open( caller(1).last.sub(/:.*?$/, ''), 'r' )
+		DataParser.new( klass, io )
+    end
+end

data/spec/chunker_spec.rb ADDED Viewed

@@ -0,0 +1,117 @@
+#!/usr/bin/env ruby
+BEGIN {
+	require 'pathname'
+	basedir = Pathname.new( __FILE__ ).dirname.parent
+	libdir = basedir + "lib"
+	$LOAD_PATH.unshift( libdir ) unless $LOAD_PATH.include?( libdir )
+}
+require 'chunker'
+require 'rubygems'
+require 'spec'
+ENDSTUFF = <<ENDSTUFF
+Stuff within the end block.
+Content of the END block
+Content of the END block
+Content of the END block
+Content of the END block
+ENDSTUFF
+HURGADURGA = <<HURGADURGA
+Content of the HURGADURGA block
+Content of the HURGADURGA block
+Content of the HURGADURGA block
+Content of the HURGADURGA block
+HURGADURGA
+HURRRRG = <<HURRRRG
+	123123123 123123123 123123123
+	123123123 123123123 123123123
+	123123123 123123123 123123123
+HURRRRG
+POOP = <<POOP
+Content of the POOP block
+POOP
+FILE_TEXT = <<EO_FILE_TEXT
+This is stuff we shouldn't see or care about.
+You know, stuff like code, presumably.
+__END__
+#{ENDSTUFF}
+EO_FILE_TEXT
+FILE_TEXT_MULTIPLE = <<EO_FILE_TEXT
+This is stuff we shouldn't see or care about.
+You know, stuff like code, presumably.
+__END__
+#{ENDSTUFF}
+__POOP__
+#{POOP}
+__HURRRRG__
+#{HURRRRG}
+__HURGADURGA__
+#{HURGADURGA}
+EO_FILE_TEXT
+describe Chunker::DataParser do
+	it "doesn't include content above the __END__ token" do
+		klass = Class.new
+		dp = Chunker::DataParser.new( klass, StringIO.new( FILE_TEXT_MULTIPLE ))
+		dp.instance_variable_get( :@scanner ).string.
+			should_not =~ /This is stuff we shouldn't see/
+	end
+	it "doesn't contain the __END__ token itself" do
+		klass = Class.new
+		dp = Chunker::DataParser.new( klass, StringIO.new( FILE_TEXT ))
+		dp.instance_variable_get( :@scanner ).string.should_not =~ /^__END__/
+	end
+end
+describe 'A class that includes Chunker' do
+	it "has all content in DATA_END if there are no sub blocks" do
+		File.stub!( :open ).and_return( StringIO.new( FILE_TEXT ))
+		klass = Class.new { include Chunker }
+		klass.constants.should_not include( 'DATA_POOP' )
+		klass.constants.should_not include( 'DATA_HURRRRG' )
+		klass.constants.should_not include( 'DATA_HURGADURGA' )
+		klass.constants.should include( 'DATA_END' )
+	end
+	it "separates data sub blocks into individual constants" do
+		File.stub!( :open ).and_return( StringIO.new( FILE_TEXT_MULTIPLE ))
+		klass = Class.new { include Chunker }
+		klass.constants.should include( 'DATA_END' )
+		klass.constants.should include( 'DATA_POOP' )
+		klass.constants.should include( 'DATA_HURRRRG' )
+		klass.constants.should include( 'DATA_HURGADURGA' )
+	end
+	it "has IO constants that contain the data block contents" do
+		File.stub!( :open ).and_return( StringIO.new( FILE_TEXT_MULTIPLE ))
+		klass = Class.new { include Chunker }
+		klass.const_get( :DATA_END ).read.chomp.should        == ENDSTUFF
+		klass.const_get( :DATA_POOP ).read.chomp.should       == POOP
+		klass.const_get( :DATA_HURRRRG ).read.chomp.should    == HURRRRG
+		klass.const_get( :DATA_HURGADURGA ).read.chomp.should == HURGADURGA
+	end
+end

metadata ADDED Viewed

@@ -0,0 +1,57 @@
+--- !ruby/object:Gem::Specification
+name: chunker
+version: !ruby/object:Gem::Version
+  version: 0.1.53
+platform: ruby
+authors:
+- Mahlon E. Smith
+autorequire:
+bindir: bin
+cert_chain: []
+date: 2008-11-10 00:00:00 -08:00
+default_executable:
+dependencies: []
+description: "Ruby provides an automatic constant called DATA, which is an IO object that references all text in the current file under an __END__ token.  I find it convenient to use the __END__ area to store all sorts of stuff, rather than have to worry about distributing separate files.  The DATA constant is determined from whatever ruby believes $0 to be. It doesn't work inside of other required libraries, so you'll see stuff like this all the time:  END = File.open( __FILE__ ).read.split( /^__END__/, 2 ).last  It works, but it's more work than I want to do.  Chunker solves this by parsing __END__ tokens for you, and making it available in the form of a 'DATA_END' constant.  It installs this constant into the class that includes Chunker, so you can use it again and again, assuming you use a different file for each class.  It also automatically parses out other things that look like tokens, so you can easily have multiple, distinct documents all embedded into the __END__ block."
+email: mahlon@martini.nu
+executables: []
+extensions: []
+extra_rdoc_files: []
+files:
+- Rakefile
+- README
+- LICENSE
+- lib/chunker.rb
+- spec/chunker_spec.rb
+has_rdoc: true
+homepage: http://projects.martini.nu/ruby-modules/wiki/Chunker
+post_install_message:
+rdoc_options: []
+require_paths:
+- lib
+required_ruby_version: !ruby/object:Gem::Requirement
+  requirements:
+  - - ">="
+    - !ruby/object:Gem::Version
+      version: "0"
+  version:
+required_rubygems_version: !ruby/object:Gem::Requirement
+  requirements:
+  - - ">="
+    - !ruby/object:Gem::Version
+      version: "0"
+  version:
+requirements: []
+rubyforge_project: mahlon
+rubygems_version: 1.3.1
+signing_key:
+specification_version: 2
+summary: A convenience library for parsing __END__ tokens consistently.
+test_files:
+- spec/chunker_spec.rb