RubyGems - edi4r - Versions diffs - 0.9.4.1 - Mend

edi4r 0.9.4.1

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (62) hide show

data/AuthorCopyright +10 -0
data/COPYING +56 -0
data/ChangeLog +106 -0
data/README +66 -0
data/TO-DO +35 -0
data/Tutorial +609 -0
data/VERSION +1 -0
data/bin/edi2xml.rb +103 -0
data/bin/editool.rb +151 -0
data/bin/xml2edi.rb +50 -0
data/data/edifact/iso9735/SDCD.10000.csv +10 -0
data/data/edifact/iso9735/SDCD.20000.csv +10 -0
data/data/edifact/iso9735/SDCD.30000.csv +11 -0
data/data/edifact/iso9735/SDCD.40000.csv +31 -0
data/data/edifact/iso9735/SDCD.40100.csv +31 -0
data/data/edifact/iso9735/SDED.10000.csv +37 -0
data/data/edifact/iso9735/SDED.20000.csv +37 -0
data/data/edifact/iso9735/SDED.30000.csv +43 -0
data/data/edifact/iso9735/SDED.40000.csv +129 -0
data/data/edifact/iso9735/SDED.40100.csv +130 -0
data/data/edifact/iso9735/SDMD.10000.csv +0 -0
data/data/edifact/iso9735/SDMD.20000.csv +0 -0
data/data/edifact/iso9735/SDMD.30000.csv +6 -0
data/data/edifact/iso9735/SDMD.40000.csv +17 -0
data/data/edifact/iso9735/SDMD.40100.csv +17 -0
data/data/edifact/iso9735/SDSD.10000.csv +8 -0
data/data/edifact/iso9735/SDSD.20000.csv +8 -0
data/data/edifact/iso9735/SDSD.30000.csv +12 -0
data/data/edifact/iso9735/SDSD.40000.csv +34 -0
data/data/edifact/iso9735/SDSD.40100.csv +34 -0
data/data/edifact/untdid/EDCD.d01b.csv +200 -0
data/data/edifact/untdid/EDCD.d96a.csv +161 -0
data/data/edifact/untdid/EDED.d01b.csv +641 -0
data/data/edifact/untdid/EDED.d96a.csv +462 -0
data/data/edifact/untdid/EDMD.d01b.csv +3419 -0
data/data/edifact/untdid/EDMD.d96a.csv +2144 -0
data/data/edifact/untdid/EDSD.d01b.csv +158 -0
data/data/edifact/untdid/EDSD.d96a.csv +127 -0
data/data/edifact/untdid/IDCD.d01b.csv +95 -0
data/data/edifact/untdid/IDMD.d01b.csv +238 -0
data/data/edifact/untdid/IDSD.d01b.csv +75 -0
data/lib/edi4r.rb +928 -0
data/lib/edi4r/diagrams.rb +567 -0
data/lib/edi4r/edi4r-1.2.dtd +20 -0
data/lib/edi4r/edifact-rexml.rb +221 -0
data/lib/edi4r/edifact.rb +1627 -0
data/lib/edi4r/rexml.rb +256 -0
data/lib/edi4r/standards.rb +495 -0
data/test/eancom2webedi.rb +380 -0
data/test/groups.edi +1 -0
data/test/in1.edi +1 -0
data/test/in1.inh +3 -0
data/test/in2.edi +1 -0
data/test/in2.xml +350 -0
data/test/test_basics.rb +209 -0
data/test/test_edi_split.rb +53 -0
data/test/test_loopback.rb +21 -0
data/test/test_minidemo.rb +84 -0
data/test/test_rexml.rb +98 -0
data/test/test_tut_examples.rb +131 -0
data/test/webedi2eancom.rb +408 -0
metadata +110 -0

data/AuthorCopyright ADDED

@@ -0,0 +1,10 @@
+== Author
+Heinz W. Werntges, FH Wiesbaden
+(edi@informatik.fh-wiesbaden.de)
+== Copyright
+Copyright (c) 2006  Heinz W. Werntges.
+Licensed under the same terms as Ruby.

data/COPYING ADDED

@@ -0,0 +1,56 @@
+  Ruby is copyrighted free software by Yukihiro Matsumoto <matz@netlab.jp>.
+  You can redistribute it and/or modify it under either the terms of the GPL
+  (see the file GPL), or the conditions below:
+  1. You may make and give away verbatim copies of the source form of the
+     software without restriction, provided that you duplicate all of the
+     original copyright notices and associated disclaimers.
+  2. You may modify your copy of the software in any way, provided that
+     you do at least ONE of the following:
+       a) place your modifications in the Public Domain or otherwise
+          make them Freely Available, such as by posting said
+	  modifications to Usenet or an equivalent medium, or by allowing
+	  the author to include your modifications in the software.
+       b) use the modified software only within your corporation or
+          organization.
+       c) give non-standard binaries non-standard names, with
+          instructions on where to get the original software distribution.
+       d) make other distribution arrangements with the author.
+  3. You may distribute the software in object code or binary form,
+     provided that you do at least ONE of the following:
+       a) distribute the binaries and library files of the software,
+	  together with instructions (in the manual page or equivalent)
+	  on where to get the original distribution.
+       b) accompany the distribution with the machine-readable source of
+	  the software.
+       c) give non-standard binaries non-standard names, with
+          instructions on where to get the original software distribution.
+       d) make other distribution arrangements with the author.
+  4. You may modify and include the part of the software into any other
+     software (possibly commercial).  But some files in the distribution
+     are not written by the author, so that they are not under these terms.
+     For the list of those files and their copying conditions, see the
+     file LEGAL.
+  5. The scripts and library files supplied as input to or produced as
+     output from the software do not automatically fall under the
+     copyright of the software, but belong to whomever generated them,
+     and may be sold commercially, and may be aggregated with this
+     software.
+  6. THIS SOFTWARE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR
+     IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED
+     WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
+     PURPOSE.

data/ChangeLog ADDED

@@ -0,0 +1,106 @@
+= Change log
+== 0.9.4.1
+=== News
+* XML support added (again; no separate module anymore)!
+* DIN 16557-4 now supported (one-way, EDI-to-XML)
+* New standalone tools: edi2xml.rb, xml2edi.rb
+* New testcase: test_rexml.rb
+* EDI::Interchange: New convenience methods: parse(), peek(), detect()
+* Support for compressed data (gzip via zlib, bzip2 if "bzcat" available)
+* peek(): High-speed access to (just the) Interchange header data
+=== Changes
+* Some optional calling parameters added
+* editool.rb: Now supporting more standards, compression, and peek mode/reports
+=== Misc improvements & bug fixes
+* Bug in EDIFACT scanner fixed - scanner re-implemented
+* Windows supported now (workaround for issue with Pathname#realpath).
+  Note: Put "ruby" in front of edi4r scripts & tools when invoked
+	in pipe mode in a DOS shell, or the pipe fill fail.
+== 0.9.4.0
+=== New structures
+* Time: Class method "edifact", method "format" added
+* MsgGroup: Now fully supported
+=== Changes
+* New test case "test_minidemo" aimed at fixes in DE#to_s, see below
+* "test_basics" now covering MsgGroup tests as well
+* Special setters de0020=, de0048=, de0062=, de0340= added
+  to make header/trailer changes consistent.
+* Setters de0001=, de0002= added to prevent changes to charset and version
+* E::Message#validate: Improved
+* Tests updated to reflect new features
+=== Misc improvements & bug fixes
+* E::Interchange.parse: hnd.close removed; some code cleanup
+* EDI::MsgGroup: Completed
+* EDI::E::MsgGroup: Completed, now fully supported
+* E::DE#to_s:  Fix - now calling super()
+* DE#to_s:     Fix - adding leading zeroes for too short numeric values
+* DE#validate: No more warnings if format e.g. n6 and fixable
+* E::Illegal_Charset_Patterns: Bugfix ('+' missing, '*' included twice)
+== 0.9.3.1
+Bugfix release, now integrated in 0.9.4
+=== New structures
+* (none so far)
+=== Changes
+* New test case "test_minidemo" aimed at fixes in DE#to_s, see below
+=== Misc improvements & bug fixes
+* E::Interchange.parse: hnd.close removed; some code cleanup
+* E::DE#to_s:  Fix - now calling super()
+* DE#to_s:     Fix - adding leading zeroes for too short numeric values
+* DE#validate: No more warnings if format e.g. n6 and fixable
+== 0.9.3
+=== New structures
+* Improved class hierarchy
+* Methods: More consistent, better inheritance
+* Removed: write() (use to_s !)
+* inspect(): more attributes for segments
+* EDI::E::UNA now bundles all UNA related methods
+=== Changes
+* New access to DE/CDE arrays (like ) through prefix 'a'
+  prevents users from accidentally overwriting a DE.
+  Previous access through cCxxx[] or dxxxx[] will fail now.
+* Some parameters of EDI::E::Interchange.new have changed,
+  several more added.
+* UNA handling: Now through attributes of UNA class.
+=== Misc improvements
+* Improved handling of both decimal signs
+* Bug removed in escaping of special chars
+* Refactoring of some internals
+* RDoc now usable - completely overhauled.
+== 0.9.2
+=== Misc improvements
+* New parsing basics: "String::separate" replaced by EDI::E::edi_split
+* Parsing bug removed
+* More validation features
+* "inspect", "find_all" added
+* "descendants_and_self" etc. added to EDI::Segment
+* bin/editool.rb added (validate, list, inspect EDI data)
+== 0.9.1
+=== First release as a gem
+* Split from the edi4r core package, turned into a separate one.
+* Unit tests added
+* RDoc documentation added
+* Modular loading scheme for normdata added (through EDI_NDB_PATH)
+* Parameter usage overhauled
+* Rolled into a gem
+== 0.8.x
+* Used internally for projects and teaching

data/README ADDED

@@ -0,0 +1,66 @@
+= EDI TOOLKIT for RUBY (edi4r)
+This is Ruby gem <b>edi4r</b> version
+:include:VERSION
+Edi4r was created to greatly simplify the creation and processing of data for
+Electronic Data Interchange (EDI). In particular, it supports the UN/EDIFACT
+syntax (ISO 9573) and optionally SAP IDocs.
+== Installation
+Install it as any other Ruby gem, e.g.:
+  sudo gem install edi4r-<version>.gem
+== Usage
+  require 'rubygems'
+  require_gem 'edi4r'
+  require 'edi4r/edifact' # optional
+  # Build a UN/EDIFACT interchange from its character representation in a file:
+  ic = nil
+  File.open("received.edi") {|hnd| ic = EDI::E::Interchange.parse( hnd ) }
+  ic.each do |msg|
+    # Process message, here: Just list document numbers from (only) segment BGM
+    puts msg['BGM'].first.d1004
+  end
+  # Create a minimalistic interchange
+  ic  = EDI::E::Interchange.new  # Default: syntax version=3, charset = UNOB
+  msg = ic.new_message           # Default: ORDERS D.96A
+  bgm = msg.new_segment('BGM')   # Obtain an empty segment
+  bgm.cC002.d1001 = '220'	 # Add some content to mandatory elements
+  bgm.d1004 = 'PO-54321'
+  dtm = msg.new_segment('DTM')
+  dtm.cC507.d2005 = '137'
+  uns = msg.new_segment('UNS')
+  uns.d0081 = 'S'
+  [bgm, dtm, uns].each {|seg| msg.add seg} # Add segments to message
+  ic.add msg                     # Add message to interchange - ready to go!
+  ic.header.cS002.d0004 = 'sender'; ic.header.cS003.d0010 = 'recipient' # UNB
+  ic.validate # Conforming to standard?
+  print ic    # Could be sent that way!
+== See also
+* Background[link:files/lib/edi4r_rb.html] info about data structure and classes.
+* A Tutorial[link:files/Tutorial.html] for examples of use.
+* A ChangeLog[link:files/ChangeLog.html] will be maintained; it is just starting now.
+* Finally, see TO-DO[link:files/TO-DO.html] for the current wish list.
+* This code is put under the Ruby license, see COPYING[link:files/COPYING.html] for details.
+:include:AuthorCopyright
+Enjoy,
+Heinz

data/TO-DO ADDED

@@ -0,0 +1,35 @@
+= To-do list
+== Documentation
+* Further improve RDoc support
+* DocBook based tutorial
+* FAQ section
+* Intro to EDI in general
+* Outlook: More components (of a real EDI/EAI server), e.g. for messaging
+== XML support
+* DTD generation from Diagrams
+== Core features
+* E: more SV4 support, I-EDI support
+* E: Support for code lists, subsets, restricted codelists
+* Recovery from parsing errors, ability to skip faulty messages
+* Consider a "C" extension for fast segmentation of plain EDIFACT data
+* Generic support for CSV/variable-length and fixed record formats?
+* Add a generic number generator
+* Add EAN/GTIN/EPC helpers
+== XPath support
+* method "each" accepting XPath parameter
+* Index op [] accepting XPath
+* More XPath expressions + corresponding methods
+== Applications
+* Better list mode in editool.rb
+== Test suite
+* Extend
+== Details
+* Avoid module name 'Dir' - might collide with std. classname 'Dir'

data/Tutorial ADDED

@@ -0,0 +1,609 @@
+= Tutorial
+== Getting started
+=== Installation
+ sudo gem install edi4r
+ sudo gem install edi4r-tdid
+=== Require statements
+ require "rubygems"
+ require "edi4r"         # Try require_gem if this fails on your site
+ require "edi4r/edifact"
+ require "edi4r-tdid"    # Try require_gem if this fails on your site
+== Creating an (outbound) UN/EDIFACT interchange
+=== An empty interchange
+ ic = EDI::E::Interchange.new
+creates an empty interchange object with syntax version 3
+and charset UNOB. You can make this a bit more explicit
+by passing parameters as hash components:
+  ic = EDI::E::Interchange.new( :version => 3, :charset => 'UNOB' )
+See the source for more parameters.
+=== An empty message
+  msg = ic.new_message
+creates an empty message in the context of the given interchange,
+i.e. the syntax version, charset, UNA settings, interactive or batch EDI.
+By default, the message type is <tt>ORDERS D.96A</tt>. Select any
+message from any UN/TDID by passing the corresponding parameters
+as hash components:
+  msg = ic.new_message( :msg_type=>'ORDERS', :version=>'D', :release=>'96A',
+			:resp_agency=>'UN' )
+Hash components which you do not specify are taken from a set of defaults.
+=== Filling an interchange
+You may add messages to the interchange any time by calling method add():
+  ic.add( msg )
+When adding new messages to an interchange, they get appended to the
+current interchange content. There is no method to insert a message
+at any other location. If you need to do that, hold your messages
+in an array, sort them any way you like, and finally add them
+to the interchange in the desired sequence.
+Note that each messag gets validated by default when you add it to
+the interchange. If your message needs to be completed only later,
+you may disable validation by calling:
+  ic.add( msg, false )
+=== Filling a message
+A freshly created message is empty, aside from its header and trailer
+which we shall discuss later. Simply create the segments you want to add,
+fill them, and add them to the message:
+  seg = msg.new_segment( 'BGM' )
+Here, we derived a BGM segment from the current context,
+i.e. an UN/TDID like D.96A which we specified when creating the message given.
+Note that <tt>new_segment()</tt> accepts all segment tags available in the
+whole TDID's  segment directory - not just those usable within
+this message type.
+Add content to the segment (see below) and add it to the message:
+  msg.add( seg )
+Like with messages added to an interchange, it is your responsibility
+to assure the proper sequence of segments. You will need the UN/EDIFACT
+message structure, a subset description, or a message implementation
+guideline (MIG) handy in order to comply.
+It is possible to add empty or partially filled segments to a message.
+Just keep a reference to them and fill in their required data elements later.
+== Accessing Composites and Data Elements
+=== Background
+While interchanges and messages are basically empty when created,
+segments are not: They come equipped with the composites (CDE, composite
+data elements ) and data elements (DE) they comprise in their current
+context. Likewise, CDEs come equipped with the sequence of DEs
+which they contain according to the underlying TDID.
+Segments and CDEs are basically sequences (arrays) of their component
+elements. This sequence depends on the TDID of their context.
+E.g., a BGM segment from D.96A looks different from a BGM in D.01B.
+These sequences are fixed and cannot/should not be altered after
+creation of a segment or CDE.
+=== Getters for CDE
+There is a getter method for each CDE of a segment.
+Its name is simply prefix 'c' and the CDE name.
+In order to access, say, C002 in segment BGM, the getter
+is named 'c' + 'C002':
+  cde = seg.cC002
+(See below how to handle arrays)
+In most cases you'll need the CDE object only to access
+its component data elements. Let's see how that works:
+=== Getters for DE values
+During mapping tasks, we normally do not deal with the
+internal organization of DE objects. All we need is access
+to their <em>values</em>.
+Similarly to CDE getters, we build a DE getter by prepending
+a 'd' to the DE name. The result is a method that returns
+the current value of this DE (nil if unassigned), not the
+DE object itself:
+  order_number = seg.d1004  if cde.d1001 == 220
+The example shows that this concept is very convenient and works
+both with component DEs in a CDE and plain DEs in a segment.
+=== Setters for DE values
+We use the same approach for setters - a DE setter actually
+changes its value, not the DE object itself:
+  bgm = msg.new_segment( 'BGM' )
+  bgm.d1004 = '123456ABC'
+  bgm.cC002.d1001 = 220
+Well, that's both easy and readable! But what about the
+integer assigned to DE 1004? Don't worry - its string value
+is still what we later want to see in the interchange file.
+=== Setters for CDE and segments?
+There is no such thing! A CDE should always be derived
+from its proper context and must not be changed thereafter.
+Likewise, a segment's content is nothing a user should
+interfere with.
+Does that sound like dictatorship? Well, there *are* ways
+to manipulate CDEs and segments, but that's an advanced
+and rarely used topic well out of scope of this tutorial ...
+=== DE and CDE arrays
+Sometimes, a DE occurs multiple times within a segment or CDE,
+and a CDE may occur multiple times within a segment.
+Before syntax version 4, EDIFACT does not really employ
+the concept of an array. Instead, there are multiple
+occurrences of a particular DE or CDE listed in a row.
+In such a case, the corresponding getters and setters
+won't work. Actually, they raise a TypeError to make sure
+that you don't accidentally overlook that there is more
+than one instance of the given (C)DE.
+We obtain a whole array of *all* matching (C)DE instead
+and indicate this with a prefix 'a':
+  seg = msg.new_segment('PIA')
+  cde_list = seg.aC212
+  cde_list[0].d7140 = '54321'
+  cde_list[0].d7143 = 'SA'
+  cde_list[0].d3055 = 91
+  cde_list[1].d7140 = ... # etc
+  seg = msg.new_segment('NAD')
+  seg.cC080.a3036[0].value = 'E. X. Ample'
+  seg.cC080.a3036[1].value = 'Sales dept.'
+Note that a3036 returns an array of DE objects, not their values!
+We thus use DE setter <tt>value</tt> to actually assign
+new values to those DE objects.
+Sometimes, a CDE or segment contains the same DE more than once
+even if both instances are separated by a different element,
+like DE 3055 and 1131 in C088 of segment FII, which you may
+find in invoices. In that case the same concept holds: cde.a1131
+would fetch all instances, no matter if some other elements
+occur in between or not.
+== Building it all together
+OK, so we keep generating segments, filling their data elements
+with content, and adding them to the message that we are
+about to build - fine.
+Likewise, we add messages to the interchange. Fine - but how do we
+eventually get the output, how do we make sure that we have
+not forgotten anything, and how do we deal with the
+service segments (e.g. to define sender and recipient IDs) ?
+=== Headers and trailers
+Interchanges and messages are objects which may come with
+a header and trailer segment. In UN/EDIFACT, these are
+UNB/UNZ and UNH/UNT, respectively.
+In order to let us focus on the content, edi4r keeps these
+service segments away from us and tries its best to
+treat them automatically. For example, we do not have to
+count segments or messages - edi4r takes care of that
+and updates the corresponding DE in UNT and UNZ, respectively.
+If we really need to access data there,
+that's possible anytime through getters <tt>header</tt> and
+<tt>trailer</tt>. From then on, we just use the usual DE and
+CDE getters and setters. E.g., setting the UNB sender ID
+works like this:
+  ic.header.cS002.d0004 = '1234567'
+Setting the test indicator may look like this:
+  ic.header.d0035 = 1
+=== UNA handling
+The pseudo segment UNA commonly introduces an UN/EDIFACT
+interchange. It is shown there by default, which is a good
+idea in most cases. If you really have to switch it off, use:
+  ic.show_una = false
+It can be easily viewed e.g. by:
+  puts ic.una
+The six special characters it containes are both readable
+and modifiable. Please note that these characters are represented
+as their ASCII integer codes, not as one-character strings.
+Here is the list of corresponding getter methods:
+ ce_sep()	# Component data element separator, default ?:
+ de_sep()       # Data element separator, default: ?+
+ decimal_sign() # Both ?. and ?, are eligible
+ esc_char()     # Escape character, default: ??
+ rep_sep()      # Repetition occurrence indicator, default:
+                #  ?  (SV1-3), ?* (syntax version 4)
+ seg_term()     # Segment terminator, default: ?'
+Corresponding setters allow you to change all of them.
+Remember to pass ASCII values, not strings. Example:
+  pri.cC509.d5118 = 30.1
+  pri.to_s      --> "PRI+AAA:30.1::LIU"
+  ic.una.decimal_sign = ?,
+  pri.to_s      --> "PRI+AAA:30,1::LIU"
+  ic.una.ce_sep = ?/
+  pri.to_s      --> "PRI+AAA/30,1//LIU"
+=== Validation
+Edi4r comes with a set of built-in validation rules. In order to
+validate your interchange, just call
+  ic.validate
+This method will return the number of (recoverable) issues found.
+Error messages and warnings are written to $stderr. There are
+a few non-recoverable errors which raise exceptions.
+Actually, you may apply method validate() to almost all of the
+EDI objects mentioned so far. However, doing this once for the
+whole interchance will validate everything.
+=== Printing and saving
+Once our data is validated, we want to send it to our business partner.
+Actually, that's very simple: Output it e.g. to stdout or to a file
+by just printing it! This works nicely, because our EDI objects are
+all equipped with reasonable methods "to_s()".
+== Processing (inbound) interchanges
+=== Building the interchange object
+We build a whole interchange with a single call by passing
+its character representation as a String or IO object to class method
+parse():
+  ic = nil
+  File.open("inbound_01.edi") {|hnd| ic = EDI::E::Interchange.parse( hnd )}
+That's it - from now on we may access all its contents in much the
+same way as we did during generation.
+=== Iterating over messages
+Typically we'd like to loop through the messages contained:
+  ic.each { |msg|  map_message( msg ) }
+with some suitably created mapping procedure map_message().
+In this context, the interchange is treated as a container
+of messages. We therefore use the standard iterator each().
+Actually, index access is also available, as are following
+array-like methods:
+ [](int), index(obj), each(&b), find_all(&b), size(),
+ length(), first(), last().
+Examples:
+  second_msg = ic[1]
+  last_msg = ic.last
+=== Awaiting segments of a message
+Similarly, we iterate through the segments of a message. The following
+construction lets you select segments in their proper context
+(which is the segment group):
+  def map_message( msg )
+    # do your initialization here, then
+    msg.each do |seg|
+      seg_name = seg.name
+      seg_name += ' ' + seg.sg_name if seg.sg_name
+      case seg_name
+      when "BGM"
+        # do this ...
+      when "DTM"
+        # do that ...
+      when 'NAD SG2'
+        # react only if NAD occurs in segment group 2
+      # ... etc., finally:
+      default
+        raise "Segment #{seg_name}: Not accounted for!"
+    end
+  end
+If you need to obtain all segments with a given tag, pass the
+tag as a string to the index operator:
+  d = msg['DTM']  # Array of all DTM segments, any segment group
+Actually, a message behaves like a container just as an interchange does,
+so the array-like methods listed in the previous section also apply here:
+  d = msg.find_all {|seg| seg.name == 'DTM' && seg.sg_name == 'SG20'}
+=== Selecting segment group instances
+Iterating over all segments of a message sequentially tends to produce
+cluttered code when messages grow complex. Wouldn't it be nice
+if we could delegate e.g. the mapping of a whole NAD group to a
+specialized procedure? Still better - could we delegate mapping
+of a whole item group to a specialized routine?
+Actually that's quite easy when we recall that segment groups
+are side branches of the main trunk (or a higher-level branch)
+of a message diagram, with their trigger segments being the
+T-shaped "joints". Thus, segments of a segment group are mere
+descendants of their trigger segment in the branching diagram.
+Here are some helpful methods to select segments of a group:
+  # ...
+  when 'NAD SG2'
+    map_nad_sg2( seg.children_and_self ) # skip segment COM...
+  # ...
+  when 'LIN SG28'
+    map_item( seg.descendants_and_self )
+  # ...
+Methods
+  descendants(), descendants_and_self(), children(), and
+  children_and_self()
+are inspired by XPath axes. They return an array of segments
+which depend on the given (trigger) segment as their common
+ancestor.
+Using these selectors, writing modular mapping code
+now becomes an easy task.
+== Peeking into interchanges
+=== Background
+Sometimes we only need to extract some header information
+out of EDI files, e.g. in order to find out whether the content
+is UN/EDIFACT, who sent it to whom, or just to see if the
+UNB test indicator is set.
+We could of course apply EDI::E::Interchange.parse() and access
+the header of the resulting interchange object when we know that
+a given file contains UN/EDIFACT data. However, that would be
+a big waste of resources, especially for large interchanges.
+=== Method "peek()" for UN/EDIFACT data
+Edi4r instead offers method "peek()". It reads just enought bytes
+from the file to determine its contents and to decode the header.
+Like "parse()" it returns an Interchange object, but that one
+is empty except for the header (and a dummy trailer) segment.
+You can then extract any header element you need through the
+usual getters. Example: Find out if the test indicator is set.
+  def is_testdata?( hnd )
+    ic = EDI::E::Interchange.peek( hnd )
+    ic.d0035 == '1' || ic.d0035 == 1
+  end
+=== Auto-detection and implicit decompression
+Regular EDI users need to archive their business data.
+In simple cases, moving interchange files into proper folders
+after successfully processing them already does the job.
+You can save a lot of space though by compressing them.
+Applying "gzip" to EDIFACT data easily shrinks them to
+10 % of their original volume.
+So far, so good. Later though, you may need to extract
+a specific file from the archive, e.g. the interchange
+with control reference "ref3456" from a customer
+with sender id "xyz". Well, you do not need to maintain
+a separate index or decompress all files in the archive
+in order to find it. There is a generic class method
+"Interchange.peek()" that does all this for you.
+Consider the following code fragment that assumes
+that ARGV contains the list of files to search:
+  require 'zlib'
+  found = ARGV.find do |fname|
+            ic = EDI::Interchange.peek(File.open(fname))
+            h = ic.header
+            ic.syntax=='E' && h.cS002.d0004=='xyz' && h.d0020=='ref1234'
+          end
+  ic = EDI::Interchange.parse(File.open(found))
+  # ...
+Note that this code will work with both zipped and
+unzipped data, and with UN/EDIFACT as well as other content.
+== XML representation
+=== Background
+EDI interchanges may be regarded as abstract objects which need
+some representation when stored or exchanged. E.g., UN/EDIFACT
+interchanges may be expressed (represented) by syntax version 1-4
+and a choice of separator and termination characters
+without changing their identity. Likewise, interchanges
+may be represented by some suitable XML document type.
+There was a time when classical EDI representations were
+considered outdated and to be replaced by XML documents.
+We know by now that a mere change in representation does not help
+to resolve the real issues of e-business. Nonetheless,
+XML-based technology has become much more wide-spread than EDI,
+so chances are high that one has to integrate classical EDI data
+into a XML-driven architecture.
+There are (too) many ways to do this, and attempts have been made
+to standardize them. In particular, DIN 16557-4
+(http://www.beuth.de/cmd?workflowname=CSVList&websource=&artid=43768898)
+describes a way how to represent UN/EDIFACT interchanges as XML documents
+and how to describe them with DTDs.
+The DIN approach however focuses only on UN/EDIFACT and does not represent
+the logical structure of documents. It merely encodes segments as
+XML elements and data elements and composites as their attributes.
+The value of DTDs is limited, as they need to be generated for
+any particular interchange and lack information about mandatory
+data elements and CDEs, let alone code lists.
+This library therefore supports a generic approach that allows us
+to represent any interchange object as an XML document, be it
+a UN/EDIFACT interchange, a file of SAP IDocs, or an ANSI
+X.12 or other interchange when such a module becomes available.
+Native and XML representation are fully interchangeable,
+and the XML representation reflects information from the
+branching diagram, thus supporting considerably XPath-based
+processing (e.g. you could easily select an instance of
+a line item group). Formal validation through a single generic DTD
+is available, while in-depth validation remains available
+through this library at the abstract level.
+=== Generating an XML representation of an interchange
+The current implementation of XML features is built on
+Ruby's REXML module (alternative implementations are conceivable).
+Simply load the additional methods <em>after</em> loading all other
+optional EDI4R modules, then use method "to_xml()" like this:
+  # Other require statements, finally:
+  require "edi4r/rexml"
+  # Generate your interchange "ic", then:
+  xdoc = REXML::Document.new   # Empty REXML document
+  ic.to_xml( xdoc )            # Fill it
+  # The rest is standard REXML handling. Here, we write the xdoc to a file.
+  # (See REXML::Document.write() for details on indenting)
+  xdoc.write( File.open( 'mydata.xml','w'), 0 )
+=== Building an interchange from its XML representation
+No matter what EDI standard the interchange represents,
+its corresponding EDI4R object can be re-generated easily.
+Just make sure that you have loaded the corresponding module(s):
+  # Other require statements, finally:
+  require "edi4r/rexml"
+  ic = EDI::Interchange.parse( File.open('mydata.xml') )
+Yes, that's right: It's the same statement that would
+also load UN/EDIFACT data!
+If you know already what to expect, you might bypass EDI4R's
+auto-detection and directly call one of the parse_xml() methods:
+  xdoc = REXML::Document.new( File.open('mydata.xml') )
+  ic = EDI::E::Interchange.parse_xml( xdoc )
+=== Utilities: edi2xml.rb, xml2edi.rb
+These two scripts are included to further simplify your transition
+between traditional EDI representation and XML representation.
+They are command-line tools that simply wrap the library calls
+mentioned above. Example:
+  $ edi2xml.rb foo.edi > foo.xml
+  $ xml2edi.rb foo.xml > bar.edi
+  $ diff foo.edi bar.edi   # There should be no differences
+== Tools
+=== editool.rb
+Use this command-line tool e.g. when you want to
+* <b>list</b> UN/EDIFACT data in a readable way (one segment per line,
+  optionally indented according to segment level),
+* <b>validate</b> your EDI data
+* <b>analyze</b> the data more thoroughly through method "inspect()"
+* <b>report</b> header data quickly, one line per file
+Called without option, it just builds an internal memory model
+of the passed file(s) and raises an exception upon parsing errors.
+Thorough validation can be requested optionally. Example:
+  $ editool.rb -l foo.edi bar.edi
+  $ editool.rb -p *.edi *.xml
+== Further reading
+A pair of full-blown mappings (inhouse-to-EANCOM, EANCOM-to-inhouse)
+shows in much more detail how to do outbound and inbound mapping.
+See the source codes in the "test" folder!
+== Misc topics
+To be supplied later; currently just a room to collect keywords to cover.
+=== Debugging and viewing
+  :linebreak, :indented
+  inspect()
+  Exception classes
+  Segments: T-nodes, ordinal number, index, occurrence, max. occurrence
+  empty?, required?
+=== More advances features
+  Validation: warnings and exceptions (logging?)
+  Add-ons to class Time
+  Consistency of references common to headers and trailers
+  Inheriting settings from parent: UNH from UNG, UNG from UNB
+  Low-level access to Collection contents
+Enjoy,
+  -- Heinz