RubyGems - edi4r - Versions diffs - 0.9.4.1 - Mend

edi4r 0.9.4.1

Files changed (62) hide show

data/AuthorCopyright +10 -0
data/COPYING +56 -0
data/ChangeLog +106 -0
data/README +66 -0
data/TO-DO +35 -0
data/Tutorial +609 -0
data/VERSION +1 -0
data/bin/edi2xml.rb +103 -0
data/bin/editool.rb +151 -0
data/bin/xml2edi.rb +50 -0
data/data/edifact/iso9735/SDCD.10000.csv +10 -0
data/data/edifact/iso9735/SDCD.20000.csv +10 -0
data/data/edifact/iso9735/SDCD.30000.csv +11 -0
data/data/edifact/iso9735/SDCD.40000.csv +31 -0
data/data/edifact/iso9735/SDCD.40100.csv +31 -0
data/data/edifact/iso9735/SDED.10000.csv +37 -0
data/data/edifact/iso9735/SDED.20000.csv +37 -0
data/data/edifact/iso9735/SDED.30000.csv +43 -0
data/data/edifact/iso9735/SDED.40000.csv +129 -0
data/data/edifact/iso9735/SDED.40100.csv +130 -0
data/data/edifact/iso9735/SDMD.10000.csv +0 -0
data/data/edifact/iso9735/SDMD.20000.csv +0 -0
data/data/edifact/iso9735/SDMD.30000.csv +6 -0
data/data/edifact/iso9735/SDMD.40000.csv +17 -0
data/data/edifact/iso9735/SDMD.40100.csv +17 -0
data/data/edifact/iso9735/SDSD.10000.csv +8 -0
data/data/edifact/iso9735/SDSD.20000.csv +8 -0
data/data/edifact/iso9735/SDSD.30000.csv +12 -0
data/data/edifact/iso9735/SDSD.40000.csv +34 -0
data/data/edifact/iso9735/SDSD.40100.csv +34 -0
data/data/edifact/untdid/EDCD.d01b.csv +200 -0
data/data/edifact/untdid/EDCD.d96a.csv +161 -0
data/data/edifact/untdid/EDED.d01b.csv +641 -0
data/data/edifact/untdid/EDED.d96a.csv +462 -0
data/data/edifact/untdid/EDMD.d01b.csv +3419 -0
data/data/edifact/untdid/EDMD.d96a.csv +2144 -0
data/data/edifact/untdid/EDSD.d01b.csv +158 -0
data/data/edifact/untdid/EDSD.d96a.csv +127 -0
data/data/edifact/untdid/IDCD.d01b.csv +95 -0
data/data/edifact/untdid/IDMD.d01b.csv +238 -0
data/data/edifact/untdid/IDSD.d01b.csv +75 -0
data/lib/edi4r.rb +928 -0
data/lib/edi4r/diagrams.rb +567 -0
data/lib/edi4r/edi4r-1.2.dtd +20 -0
data/lib/edi4r/edifact-rexml.rb +221 -0
data/lib/edi4r/edifact.rb +1627 -0
data/lib/edi4r/rexml.rb +256 -0
data/lib/edi4r/standards.rb +495 -0
data/test/eancom2webedi.rb +380 -0
data/test/groups.edi +1 -0
data/test/in1.edi +1 -0
data/test/in1.inh +3 -0
data/test/in2.edi +1 -0
data/test/in2.xml +350 -0
data/test/test_basics.rb +209 -0
data/test/test_edi_split.rb +53 -0
data/test/test_loopback.rb +21 -0
data/test/test_minidemo.rb +84 -0
data/test/test_rexml.rb +98 -0
data/test/test_tut_examples.rb +131 -0
data/test/webedi2eancom.rb +408 -0
metadata +110 -0

@@ -0,0 +1,10 @@
+== Author
+Heinz W. Werntges, FH Wiesbaden
+(edi@informatik.fh-wiesbaden.de)
+== Copyright
+Copyright (c) 2006  Heinz W. Werntges.
+Licensed under the same terms as Ruby.

data/COPYING ADDED

@@ -0,0 +1,56 @@
+  Ruby is copyrighted free software by Yukihiro Matsumoto <matz@netlab.jp>.
+  You can redistribute it and/or modify it under either the terms of the GPL
+  (see the file GPL), or the conditions below:
+  1. You may make and give away verbatim copies of the source form of the
+     software without restriction, provided that you duplicate all of the
+     original copyright notices and associated disclaimers.
+  2. You may modify your copy of the software in any way, provided that
+     you do at least ONE of the following:
+       a) place your modifications in the Public Domain or otherwise
+          make them Freely Available, such as by posting said
+	  modifications to Usenet or an equivalent medium, or by allowing
+	  the author to include your modifications in the software.
+       b) use the modified software only within your corporation or
+          organization.
+       c) give non-standard binaries non-standard names, with
+          instructions on where to get the original software distribution.
+       d) make other distribution arrangements with the author.
+  3. You may distribute the software in object code or binary form,
+     provided that you do at least ONE of the following:
+       a) distribute the binaries and library files of the software,
+	  together with instructions (in the manual page or equivalent)
+	  on where to get the original distribution.
+       b) accompany the distribution with the machine-readable source of
+	  the software.
+       c) give non-standard binaries non-standard names, with
+          instructions on where to get the original software distribution.
+       d) make other distribution arrangements with the author.
+  4. You may modify and include the part of the software into any other
+     software (possibly commercial).  But some files in the distribution
+     are not written by the author, so that they are not under these terms.
+     For the list of those files and their copying conditions, see the
+     file LEGAL.
+  5. The scripts and library files supplied as input to or produced as
+     output from the software do not automatically fall under the
+     copyright of the software, but belong to whomever generated them,
+     and may be sold commercially, and may be aggregated with this
+     software.
+  6. THIS SOFTWARE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR
+     IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED
+     WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
+     PURPOSE.

data/ChangeLog ADDED

@@ -0,0 +1,106 @@
+= Change log
+== 0.9.4.1
+=== News
+* XML support added (again; no separate module anymore)!
+* DIN 16557-4 now supported (one-way, EDI-to-XML)
+* New standalone tools: edi2xml.rb, xml2edi.rb
+* New testcase: test_rexml.rb
+* EDI::Interchange: New convenience methods: parse(), peek(), detect()
+* Support for compressed data (gzip via zlib, bzip2 if "bzcat" available)
+* peek(): High-speed access to (just the) Interchange header data
+=== Changes
+* Some optional calling parameters added
+* editool.rb: Now supporting more standards, compression, and peek mode/reports
+=== Misc improvements & bug fixes
+* Bug in EDIFACT scanner fixed - scanner re-implemented
+* Windows supported now (workaround for issue with Pathname#realpath).
+  Note: Put "ruby" in front of edi4r scripts & tools when invoked
+	in pipe mode in a DOS shell, or the pipe fill fail.
+== 0.9.4.0
+=== New structures
+* Time: Class method "edifact", method "format" added
+* MsgGroup: Now fully supported
+=== Changes
+* New test case "test_minidemo" aimed at fixes in DE#to_s, see below
+* "test_basics" now covering MsgGroup tests as well
+* Special setters de0020=, de0048=, de0062=, de0340= added
+  to make header/trailer changes consistent.
+* Setters de0001=, de0002= added to prevent changes to charset and version
+* E::Message#validate: Improved
+* Tests updated to reflect new features
+=== Misc improvements & bug fixes
+* E::Interchange.parse: hnd.close removed; some code cleanup
+* EDI::MsgGroup: Completed
+* EDI::E::MsgGroup: Completed, now fully supported
+* E::DE#to_s:  Fix - now calling super()
+* DE#to_s:     Fix - adding leading zeroes for too short numeric values
+* DE#validate: No more warnings if format e.g. n6 and fixable
+* E::Illegal_Charset_Patterns: Bugfix ('+' missing, '*' included twice)
+== 0.9.3.1
+Bugfix release, now integrated in 0.9.4
+=== New structures
+* (none so far)
+=== Changes
+* New test case "test_minidemo" aimed at fixes in DE#to_s, see below
+=== Misc improvements & bug fixes
+* E::Interchange.parse: hnd.close removed; some code cleanup
+* E::DE#to_s:  Fix - now calling super()
+* DE#to_s:     Fix - adding leading zeroes for too short numeric values
+* DE#validate: No more warnings if format e.g. n6 and fixable
+== 0.9.3
+=== New structures
+* Improved class hierarchy
+* Methods: More consistent, better inheritance
+* Removed: write() (use to_s !)
+* inspect(): more attributes for segments
+* EDI::E::UNA now bundles all UNA related methods
+=== Changes
+* New access to DE/CDE arrays (like ) through prefix 'a'
+  prevents users from accidentally overwriting a DE.
+  Previous access through cCxxx[] or dxxxx[] will fail now.
+* Some parameters of EDI::E::Interchange.new have changed,
+  several more added.
+* UNA handling: Now through attributes of UNA class.
+=== Misc improvements
+* Improved handling of both decimal signs
+* Bug removed in escaping of special chars
+* Refactoring of some internals
+* RDoc now usable - completely overhauled.
+== 0.9.2
+=== Misc improvements
+* New parsing basics: "String::separate" replaced by EDI::E::edi_split
+* Parsing bug removed
+* More validation features
+* "inspect", "find_all" added
+* "descendants_and_self" etc. added to EDI::Segment
+* bin/editool.rb added (validate, list, inspect EDI data)
+== 0.9.1
+=== First release as a gem
+* Split from the edi4r core package, turned into a separate one.
+* Unit tests added
+* RDoc documentation added
+* Modular loading scheme for normdata added (through EDI_NDB_PATH)
+* Parameter usage overhauled
+* Rolled into a gem
+== 0.8.x
+* Used internally for projects and teaching

data/README ADDED

@@ -0,0 +1,66 @@
+= EDI TOOLKIT for RUBY (edi4r)
+This is Ruby gem <b>edi4r</b> version
+:include:VERSION
+Edi4r was created to greatly simplify the creation and processing of data for
+Electronic Data Interchange (EDI). In particular, it supports the UN/EDIFACT
+syntax (ISO 9573) and optionally SAP IDocs.
+== Installation
+Install it as any other Ruby gem, e.g.:
+  sudo gem install edi4r-<version>.gem
+== Usage
+  require 'rubygems'
+  require_gem 'edi4r'
+  require 'edi4r/edifact' # optional
+  # Build a UN/EDIFACT interchange from its character representation in a file:
+  ic = nil
+  File.open("received.edi") {|hnd| ic = EDI::E::Interchange.parse( hnd ) }
+  ic.each do |msg|
+    # Process message, here: Just list document numbers from (only) segment BGM
+    puts msg['BGM'].first.d1004
+  end
+  # Create a minimalistic interchange
+  ic  = EDI::E::Interchange.new  # Default: syntax version=3, charset = UNOB
+  msg = ic.new_message           # Default: ORDERS D.96A
+  bgm = msg.new_segment('BGM')   # Obtain an empty segment
+  bgm.cC002.d1001 = '220'	 # Add some content to mandatory elements
+  bgm.d1004 = 'PO-54321'
+  dtm = msg.new_segment('DTM')
+  dtm.cC507.d2005 = '137'
+  uns = msg.new_segment('UNS')
+  uns.d0081 = 'S'
+  [bgm, dtm, uns].each {|seg| msg.add seg} # Add segments to message
+  ic.add msg                     # Add message to interchange - ready to go!
+  ic.header.cS002.d0004 = 'sender'; ic.header.cS003.d0010 = 'recipient' # UNB
+  ic.validate # Conforming to standard?
+  print ic    # Could be sent that way!
+== See also
+* Background[link:files/lib/edi4r_rb.html] info about data structure and classes.
+* A Tutorial[link:files/Tutorial.html] for examples of use.
+* A ChangeLog[link:files/ChangeLog.html] will be maintained; it is just starting now.
+* Finally, see TO-DO[link:files/TO-DO.html] for the current wish list.
+* This code is put under the Ruby license, see COPYING[link:files/COPYING.html] for details.
+:include:AuthorCopyright
+Enjoy,
+Heinz

data/TO-DO ADDED

@@ -0,0 +1,35 @@
+= To-do list
+== Documentation
+* Further improve RDoc support
+* DocBook based tutorial
+* FAQ section
+* Intro to EDI in general
+* Outlook: More components (of a real EDI/EAI server), e.g. for messaging
+== XML support
+* DTD generation from Diagrams
+== Core features
+* E: more SV4 support, I-EDI support
+* E: Support for code lists, subsets, restricted codelists
+* Recovery from parsing errors, ability to skip faulty messages
+* Consider a "C" extension for fast segmentation of plain EDIFACT data
+* Generic support for CSV/variable-length and fixed record formats?
+* Add a generic number generator
+* Add EAN/GTIN/EPC helpers
+== XPath support
+* method "each" accepting XPath parameter
+* Index op [] accepting XPath
+* More XPath expressions + corresponding methods
+== Applications
+* Better list mode in editool.rb
+== Test suite
+* Extend
+== Details
+* Avoid module name 'Dir' - might collide with std. classname 'Dir'

data/Tutorial ADDED

@@ -0,0 +1,609 @@
+= Tutorial
+== Getting started
+=== Installation
+ sudo gem install edi4r
+ sudo gem install edi4r-tdid
+=== Require statements
+ require "rubygems"
+ require "edi4r"         # Try require_gem if this fails on your site
+ require "edi4r/edifact"
+ require "edi4r-tdid"    # Try require_gem if this fails on your site
+== Creating an (outbound) UN/EDIFACT interchange
+=== An empty interchange
+ ic = EDI::E::Interchange.new
+creates an empty interchange object with syntax version 3
+and charset UNOB. You can make this a bit more explicit
+by passing parameters as hash components:
+  ic = EDI::E::Interchange.new( :version => 3, :charset => 'UNOB' )
+See the source for more parameters.
+=== An empty message
+  msg = ic.new_message
+creates an empty message in the context of the given interchange,
+i.e. the syntax version, charset, UNA settings, interactive or batch EDI.
+By default, the message type is <tt>ORDERS D.96A</tt>. Select any
+message from any UN/TDID by passing the corresponding parameters
+as hash components:
+  msg = ic.new_message( :msg_type=>'ORDERS', :version=>'D', :release=>'96A',
+			:resp_agency=>'UN' )
+Hash components which you do not specify are taken from a set of defaults.
+=== Filling an interchange
+You may add messages to the interchange any time by calling method add():
+  ic.add( msg )
+When adding new messages to an interchange, they get appended to the
+current interchange content. There is no method to insert a message
+at any other location. If you need to do that, hold your messages
+in an array, sort them any way you like, and finally add them
+to the interchange in the desired sequence.
+Note that each messag gets validated by default when you add it to
+the interchange. If your message needs to be completed only later,
+you may disable validation by calling:
+  ic.add( msg, false )
+=== Filling a message
+A freshly created message is empty, aside from its header and trailer
+which we shall discuss later. Simply create the segments you want to add,
+fill them, and add them to the message:
+  seg = msg.new_segment( 'BGM' )
+Here, we derived a BGM segment from the current context,
+i.e. an UN/TDID like D.96A which we specified when creating the message given.
+Note that <tt>new_segment()</tt> accepts all segment tags available in the
+whole TDID's  segment directory - not just those usable within
+this message type.
+Add content to the segment (see below) and add it to the message:
+  msg.add( seg )
+Like with messages added to an interchange, it is your responsibility
+to assure the proper sequence of segments. You will need the UN/EDIFACT
+message structure, a subset description, or a message implementation
+guideline (MIG) handy in order to comply.
+It is possible to add empty or partially filled segments to a message.
+Just keep a reference to them and fill in their required data elements later.
+== Accessing Composites and Data Elements
+=== Background
+While interchanges and messages are basically empty when created,
+segments are not: They come equipped with the composites (CDE, composite
+data elements ) and data elements (DE) they comprise in their current
+context. Likewise, CDEs come equipped with the sequence of DEs
+which they contain according to the underlying TDID.
+Segments and CDEs are basically sequences (arrays) of their component
+elements. This sequence depends on the TDID of their context.
+E.g., a BGM segment from D.96A looks different from a BGM in D.01B.
+These sequences are fixed and cannot/should not be altered after
+creation of a segment or CDE.
+=== Getters for CDE
+There is a getter method for each CDE of a segment.
+Its name is simply prefix 'c' and the CDE name.
+In order to access, say, C002 in segment BGM, the getter
+is named 'c' + 'C002':
+  cde = seg.cC002
+(See below how to handle arrays)
+In most cases you'll need the CDE object only to access
+its component data elements. Let's see how that works:
+=== Getters for DE values
+During mapping tasks, we normally do not deal with the
+internal organization of DE objects. All we need is access
+to their <em>values</em>.
+Similarly to CDE getters, we build a DE getter by prepending
+a 'd' to the DE name. The result is a method that returns
+the current value of this DE (nil if unassigned), not the
+DE object itself:
+  order_number = seg.d1004  if cde.d1001 == 220
+The example shows that this concept is very convenient and works
+both with component DEs in a CDE and plain DEs in a segment.
+=== Setters for DE values
+We use the same approach for setters - a DE setter actually
+changes its value, not the DE object itself:
+  bgm = msg.new_segment( 'BGM' )
+  bgm.d1004 = '123456ABC'
+  bgm.cC002.d1001 = 220
+Well, that's both easy and readable! But what about the
+integer assigned to DE 1004? Don't worry - its string value
+is still what we later want to see in the interchange file.
+=== Setters for CDE and segments?
+There is no such thing! A CDE should always be derived
+from its proper context and must not be changed thereafter.
+Likewise, a segment's content is nothing a user should
+interfere with.
+Does that sound like dictatorship? Well, there *are* ways
+to manipulate CDEs and segments, but that's an advanced
+and rarely used topic well out of scope of this tutorial ...
+=== DE and CDE arrays
+Sometimes, a DE occurs multiple times within a segment or CDE,
+and a CDE may occur multiple times within a segment.
+Before syntax version 4, EDIFACT does not really employ
+the concept of an array. Instead, there are multiple
+occurrences of a particular DE or CDE listed in a row.
+In such a case, the corresponding getters and setters
+won't work. Actually, they raise a TypeError to make sure
+that you don't accidentally overlook that there is more
+than one instance of the given (C)DE.
+We obtain a whole array of *all* matching (C)DE instead
+and indicate this with a prefix 'a':
+  seg = msg.new_segment('PIA')
+  cde_list = seg.aC212
+  cde_list[0].d7140 = '54321'
+  cde_list[0].d7143 = 'SA'
+  cde_list[0].d3055 = 91
+  cde_list[1].d7140 = ... # etc
+  seg = msg.new_segment('NAD')
+  seg.cC080.a3036[0].value = 'E. X. Ample'
+  seg.cC080.a3036[1].value = 'Sales dept.'
+Note that a3036 returns an array of DE objects, not their values!
+We thus use DE setter <tt>value</tt> to actually assign
+new values to those DE objects.
+Sometimes, a CDE or segment contains the same DE more than once
+even if both instances are separated by a different element,
+like DE 3055 and 1131 in C088 of segment FII, which you may
+find in invoices. In that case the same concept holds: cde.a1131
+would fetch all instances, no matter if some other elements
+occur in between or not.
+== Building it all together
+OK, so we keep generating segments, filling their data elements
+with content, and adding them to the message that we are
+about to build - fine.
+Likewise, we add messages to the interchange. Fine - but how do we
+eventually get the output, how do we make sure that we have
+not forgotten anything, and how do we deal with the
+service segments (e.g. to define sender and recipient IDs) ?
+=== Headers and trailers
+Interchanges and messages are objects which may come with
+a header and trailer segment. In UN/EDIFACT, these are
+UNB/UNZ and UNH/UNT, respectively.
+In order to let us focus on the content, edi4r keeps these
+service segments away from us and tries its best to
+treat them automatically. For example, we do not have to
+count segments or messages - edi4r takes care of that
+and updates the corresponding DE in UNT and UNZ, respectively.
+If we really need to access data there,
+that's possible anytime through getters <tt>header</tt> and
+<tt>trailer</tt>. From then on, we just use the usual DE and
+CDE getters and setters. E.g., setting the UNB sender ID
+works like this:
+  ic.header.cS002.d0004 = '1234567'
+Setting the test indicator may look like this:
+  ic.header.d0035 = 1
+=== UNA handling
+The pseudo segment UNA commonly introduces an UN/EDIFACT
+interchange. It is shown there by default, which is a good
+idea in most cases. If you really have to switch it off, use:
+  ic.show_una = false
+It can be easily viewed e.g. by:
+  puts ic.una
+The six special characters it containes are both readable
+and modifiable. Please note that these characters are represented
+as their ASCII integer codes, not as one-character strings.
+Here is the list of corresponding getter methods:
+ ce_sep()	# Component data element separator, default ?:
+ de_sep()       # Data element separator, default: ?+
+ decimal_sign() # Both ?. and ?, are eligible
+ esc_char()     # Escape character, default: ??
+ rep_sep()      # Repetition occurrence indicator, default:
+                #  ?  (SV1-3), ?* (syntax version 4)
+ seg_term()     # Segment terminator, default: ?'
+Corresponding setters allow you to change all of them.
+Remember to pass ASCII values, not strings. Example:
+  pri.cC509.d5118 = 30.1
+  pri.to_s      --> "PRI+AAA:30.1::LIU"
+  ic.una.decimal_sign = ?,
+  pri.to_s      --> "PRI+AAA:30,1::LIU"
+  ic.una.ce_sep = ?/
+  pri.to_s      --> "PRI+AAA/30,1//LIU"
+=== Validation
+Edi4r comes with a set of built-in validation rules. In order to
+validate your interchange, just call
+  ic.validate
+This method will return the number of (recoverable) issues found.
+Error messages and warnings are written to $stderr. There are
+a few non-recoverable errors which raise exceptions.
+Actually, you may apply method validate() to almost all of the
+EDI objects mentioned so far. However, doing this once for the
+whole interchance will validate everything.
+=== Printing and saving
+Once our data is validated, we want to send it to our business partner.
+Actually, that's very simple: Output it e.g. to stdout or to a file
+by just printing it! This works nicely, because our EDI objects are
+all equipped with reasonable methods "to_s()".
+== Processing (inbound) interchanges
+=== Building the interchange object
+We build a whole interchange with a single call by passing
+its character representation as a String or IO object to class method
+parse():
+  ic = nil
+  File.open("inbound_01.edi") {|hnd| ic = EDI::E::Interchange.parse( hnd )}
+That's it - from now on we may access all its contents in much the
+same way as we did during generation.
+=== Iterating over messages
+Typically we'd like to loop through the messages contained:
+  ic.each { |msg|  map_message( msg ) }
+with some suitably created mapping procedure map_message().
+In this context, the interchange is treated as a container
+of messages. We therefore use the standard iterator each().
+Actually, index access is also available, as are following
+array-like methods:
+ [](int), index(obj), each(&b), find_all(&b), size(),
+ length(), first(), last().
+Examples:
+  second_msg = ic[1]
+  last_msg = ic.last
+=== Awaiting segments of a message
+Similarly, we iterate through the segments of a message. The following
+construction lets you select segments in their proper context
+(which is the segment group):
+  def map_message( msg )
+    # do your initialization here, then
+    msg.each do |seg|
+      seg_name = seg.name
+      seg_name += ' ' + seg.sg_name if seg.sg_name
+      case seg_name
+      when "BGM"
+        # do this ...
+      when "DTM"
+        # do that ...
+      when 'NAD SG2'
+        # react only if NAD occurs in segment group 2
+      # ... etc., finally:
+      default
+        raise "Segment #{seg_name}: Not accounted for!"
+    end
+  end
+If you need to obtain all segments with a given tag, pass the
+tag as a string to the index operator:
+  d = msg['DTM']  # Array of all DTM segments, any segment group
+Actually, a message behaves like a container just as an interchange does,
+so the array-like methods listed in the previous section also apply here:
+  d = msg.find_all {|seg| seg.name == 'DTM' && seg.sg_name == 'SG20'}
+=== Selecting segment group instances
+Iterating over all segments of a message sequentially tends to produce
+cluttered code when messages grow complex. Wouldn't it be nice
+if we could delegate e.g. the mapping of a whole NAD group to a
+specialized procedure? Still better - could we delegate mapping
+of a whole item group to a specialized routine?
+Actually that's quite easy when we recall that segment groups
+are side branches of the main trunk (or a higher-level branch)
+of a message diagram, with their trigger segments being the
+T-shaped "joints". Thus, segments of a segment group are mere
+descendants of their trigger segment in the branching diagram.
+Here are some helpful methods to select segments of a group:
+  # ...
+  when 'NAD SG2'
+    map_nad_sg2( seg.children_and_self ) # skip segment COM...
+  # ...
+  when 'LIN SG28'
+    map_item( seg.descendants_and_self )
+  # ...
+Methods
+  descendants(), descendants_and_self(), children(), and
+  children_and_self()
+are inspired by XPath axes. They return an array of segments
+which depend on the given (trigger) segment as their common
+ancestor.
+Using these selectors, writing modular mapping code
+now becomes an easy task.
+== Peeking into interchanges
+=== Background
+Sometimes we only need to extract some header information
+out of EDI files, e.g. in order to find out whether the content
+is UN/EDIFACT, who sent it to whom, or just to see if the
+UNB test indicator is set.
+We could of course apply EDI::E::Interchange.parse() and access
+the header of the resulting interchange object when we know that
+a given file contains UN/EDIFACT data. However, that would be
+a big waste of resources, especially for large interchanges.
+=== Method "peek()" for UN/EDIFACT data
+Edi4r instead offers method "peek()". It reads just enought bytes
+from the file to determine its contents and to decode the header.
+Like "parse()" it returns an Interchange object, but that one
+is empty except for the header (and a dummy trailer) segment.
+You can then extract any header element you need through the
+usual getters. Example: Find out if the test indicator is set.
+  def is_testdata?( hnd )
+    ic = EDI::E::Interchange.peek( hnd )
+    ic.d0035 == '1' || ic.d0035 == 1
+  end
+=== Auto-detection and implicit decompression
+Regular EDI users need to archive their business data.
+In simple cases, moving interchange files into proper folders
+after successfully processing them already does the job.
+You can save a lot of space though by compressing them.
+Applying "gzip" to EDIFACT data easily shrinks them to
+10 % of their original volume.
+So far, so good. Later though, you may need to extract
+a specific file from the archive, e.g. the interchange
+with control reference "ref3456" from a customer
+with sender id "xyz". Well, you do not need to maintain
+a separate index or decompress all files in the archive
+in order to find it. There is a generic class method
+"Interchange.peek()" that does all this for you.
+Consider the following code fragment that assumes
+that ARGV contains the list of files to search:
+  require 'zlib'
+  found = ARGV.find do |fname|
+            ic = EDI::Interchange.peek(File.open(fname))
+            h = ic.header
+            ic.syntax=='E' && h.cS002.d0004=='xyz' && h.d0020=='ref1234'
+          end
+  ic = EDI::Interchange.parse(File.open(found))
+  # ...
+Note that this code will work with both zipped and
+unzipped data, and with UN/EDIFACT as well as other content.
+== XML representation
+=== Background
+EDI interchanges may be regarded as abstract objects which need
+some representation when stored or exchanged. E.g., UN/EDIFACT
+interchanges may be expressed (represented) by syntax version 1-4
+and a choice of separator and termination characters
+without changing their identity. Likewise, interchanges
+may be represented by some suitable XML document type.
+There was a time when classical EDI representations were
+considered outdated and to be replaced by XML documents.
+We know by now that a mere change in representation does not help
+to resolve the real issues of e-business. Nonetheless,
+XML-based technology has become much more wide-spread than EDI,
+so chances are high that one has to integrate classical EDI data
+into a XML-driven architecture.
+There are (too) many ways to do this, and attempts have been made
+to standardize them. In particular, DIN 16557-4
+(http://www.beuth.de/cmd?workflowname=CSVList&websource=&artid=43768898)
+describes a way how to represent UN/EDIFACT interchanges as XML documents
+and how to describe them with DTDs.
+The DIN approach however focuses only on UN/EDIFACT and does not represent
+the logical structure of documents. It merely encodes segments as
+XML elements and data elements and composites as their attributes.
+The value of DTDs is limited, as they need to be generated for
+any particular interchange and lack information about mandatory
+data elements and CDEs, let alone code lists.
+This library therefore supports a generic approach that allows us
+to represent any interchange object as an XML document, be it
+a UN/EDIFACT interchange, a file of SAP IDocs, or an ANSI
+X.12 or other interchange when such a module becomes available.
+Native and XML representation are fully interchangeable,
+and the XML representation reflects information from the
+branching diagram, thus supporting considerably XPath-based
+processing (e.g. you could easily select an instance of
+a line item group). Formal validation through a single generic DTD
+is available, while in-depth validation remains available
+through this library at the abstract level.
+=== Generating an XML representation of an interchange
+The current implementation of XML features is built on
+Ruby's REXML module (alternative implementations are conceivable).
+Simply load the additional methods <em>after</em> loading all other
+optional EDI4R modules, then use method "to_xml()" like this:
+  # Other require statements, finally:
+  require "edi4r/rexml"
+  # Generate your interchange "ic", then:
+  xdoc = REXML::Document.new   # Empty REXML document
+  ic.to_xml( xdoc )            # Fill it
+  # The rest is standard REXML handling. Here, we write the xdoc to a file.
+  # (See REXML::Document.write() for details on indenting)
+  xdoc.write( File.open( 'mydata.xml','w'), 0 )
+=== Building an interchange from its XML representation
+No matter what EDI standard the interchange represents,
+its corresponding EDI4R object can be re-generated easily.
+Just make sure that you have loaded the corresponding module(s):
+  # Other require statements, finally:
+  require "edi4r/rexml"
+  ic = EDI::Interchange.parse( File.open('mydata.xml') )
+Yes, that's right: It's the same statement that would
+also load UN/EDIFACT data!
+If you know already what to expect, you might bypass EDI4R's
+auto-detection and directly call one of the parse_xml() methods:
+  xdoc = REXML::Document.new( File.open('mydata.xml') )
+  ic = EDI::E::Interchange.parse_xml( xdoc )
+=== Utilities: edi2xml.rb, xml2edi.rb
+These two scripts are included to further simplify your transition
+between traditional EDI representation and XML representation.
+They are command-line tools that simply wrap the library calls
+mentioned above. Example:
+  $ edi2xml.rb foo.edi > foo.xml
+  $ xml2edi.rb foo.xml > bar.edi
+  $ diff foo.edi bar.edi   # There should be no differences
+== Tools
+=== editool.rb
+Use this command-line tool e.g. when you want to
+* <b>list</b> UN/EDIFACT data in a readable way (one segment per line,
+  optionally indented according to segment level),
+* <b>validate</b> your EDI data
+* <b>analyze</b> the data more thoroughly through method "inspect()"
+* <b>report</b> header data quickly, one line per file
+Called without option, it just builds an internal memory model
+of the passed file(s) and raises an exception upon parsing errors.
+Thorough validation can be requested optionally. Example:
+  $ editool.rb -l foo.edi bar.edi
+  $ editool.rb -p *.edi *.xml
+== Further reading
+A pair of full-blown mappings (inhouse-to-EANCOM, EANCOM-to-inhouse)
+shows in much more detail how to do outbound and inbound mapping.
+See the source codes in the "test" folder!
+== Misc topics
+To be supplied later; currently just a room to collect keywords to cover.
+=== Debugging and viewing
+  :linebreak, :indented
+  inspect()
+  Exception classes
+  Segments: T-nodes, ordinal number, index, occurrence, max. occurrence
+  empty?, required?
+=== More advances features
+  Validation: warnings and exceptions (logging?)
+  Add-ons to class Time
+  Consistency of references common to headers and trailers
+  Inheriting settings from parent: UNH from UNG, UNG from UNB
+  Low-level access to Collection contents
+Enjoy,
+  -- Heinz