RubyGems - codnar - Versions diffs - 0.1.64 - Mend

codnar 0.1.64

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (80) hide show

data/ChangeLog +165 -0
data/LICENSE +19 -0
data/README.rdoc +32 -0
data/Rakefile +66 -0
data/bin/codnar-split +5 -0
data/bin/codnar-weave +5 -0
data/codnar.html +10945 -0
data/doc/logo.png +0 -0
data/doc/root.html +22 -0
data/doc/story.markdown +180 -0
data/doc/system.markdown +671 -0
data/lib/codnar.rb +41 -0
data/lib/codnar/application.rb +92 -0
data/lib/codnar/cache.rb +61 -0
data/lib/codnar/data/contents.js +113 -0
data/lib/codnar/data/control_chunks.js +44 -0
data/lib/codnar/data/style.css +95 -0
data/lib/codnar/data/sunlight/README.txt +4 -0
data/lib/codnar/data/sunlight/css-min.js +1 -0
data/lib/codnar/data/sunlight/default.css +236 -0
data/lib/codnar/data/sunlight/javascript-min.js +1 -0
data/lib/codnar/data/sunlight/min.js +1 -0
data/lib/codnar/data/sunlight/ruby-min.js +1 -0
data/lib/codnar/data/yui/README.txt +3 -0
data/lib/codnar/data/yui/base.css +132 -0
data/lib/codnar/data/yui/reset.css +142 -0
data/lib/codnar/formatter.rb +180 -0
data/lib/codnar/grouper.rb +28 -0
data/lib/codnar/gvim.rb +132 -0
data/lib/codnar/hash_extensions.rb +41 -0
data/lib/codnar/markdown.rb +47 -0
data/lib/codnar/merger.rb +138 -0
data/lib/codnar/rake.rb +41 -0
data/lib/codnar/rake/split_task.rb +71 -0
data/lib/codnar/rake/weave_task.rb +59 -0
data/lib/codnar/rdoc.rb +9 -0
data/lib/codnar/reader.rb +121 -0
data/lib/codnar/scanner.rb +216 -0
data/lib/codnar/split.rb +58 -0
data/lib/codnar/split_configurations.rb +367 -0
data/lib/codnar/splitter.rb +32 -0
data/lib/codnar/string_extensions.rb +25 -0
data/lib/codnar/sunlight.rb +17 -0
data/lib/codnar/version.rb +8 -0
data/lib/codnar/weave.rb +58 -0
data/lib/codnar/weave_configurations.rb +48 -0
data/lib/codnar/weaver.rb +105 -0
data/lib/codnar/writer.rb +38 -0
data/test/cache_computations.rb +41 -0
data/test/deep_merge.rb +29 -0
data/test/embed_images.rb +12 -0
data/test/expand_markdown.rb +27 -0
data/test/expand_rdoc.rb +20 -0
data/test/format_code_gvim_configurations.rb +55 -0
data/test/format_code_sunlight_configurations.rb +37 -0
data/test/format_comment_configurations.rb +86 -0
data/test/format_lines.rb +72 -0
data/test/group_lines.rb +31 -0
data/test/gvim_highlight_syntax.rb +49 -0
data/test/identify_chunks.rb +32 -0
data/test/lib/test_with_configurations.rb +15 -0
data/test/merge_lines.rb +133 -0
data/test/rake_tasks.rb +38 -0
data/test/read_chunks.rb +110 -0
data/test/run_application.rb +56 -0
data/test/run_split.rb +38 -0
data/test/run_weave.rb +75 -0
data/test/scan_lines.rb +78 -0
data/test/split_chunk_configurations.rb +55 -0
data/test/split_code.rb +109 -0
data/test/split_code_configurations.rb +73 -0
data/test/split_combined_configurations.rb +114 -0
data/test/split_complex_comment_configurations.rb +73 -0
data/test/split_documentation.rb +92 -0
data/test/split_documentation_configurations.rb +97 -0
data/test/split_simple_comment_configurations.rb +50 -0
data/test/sunlight_highlight_syntax.rb +25 -0
data/test/weave_configurations.rb +144 -0
data/test/write_chunks.rb +28 -0
metadata +363 -0

data/doc/logo.png ADDED

Binary file

data/doc/root.html ADDED

@@ -0,0 +1,22 @@
+<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
+<html xmlns="http://www.w3.org/1999/xhtml">
+<head>
+<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
+<title>Code narrator - an inverse literate programming tool.</title>
+<style type="text/css">
+<embed src="codnar/data/yui/reset.css" type="x-codnar/file"/>
+<embed src="codnar/data/yui/base.css" type="x-codnar/file"/>
+<embed src="codnar/data/style.css" type="x-codnar/file"/>
+</style>
+</head>
+<body>
+<div id="contents"></div>
+<embed src="README.rdoc" type="x-codnar/include"/>
+<embed src="doc/story.markdown" type="x-codnar/include"/>
+<embed src="doc/system.markdown" type="x-codnar/include"/>
+<script type="text/javascript">
+<embed src="codnar/data/contents.js" type="x-codnar/file"/>
+<embed src="codnar/data/control_chunks.js" type="x-codnar/file"/>
+</script>
+</body>
+</html>

data/doc/story.markdown ADDED

@@ -0,0 +1,180 @@
+## The Story ##
+This is the story of the Code Narrator (Codnar) tool. It serves a dual purpose.
+It describes the Codnar tool itself, but it also serves as an example of why it
+exists in the first place. To explain this more fully, we'll have to make a
+little detour into the issue of system documentation.
+### The Documentation Problem ###
+Documentation for any system can be grouped to two kinds. The first kind is the
+reference manual. If you know of a small piece of the system, this kind of
+documentation will give you the details about it. A good reference will help
+you find this piece even if you only have a rough idea of what it is named. A
+really good reference will also link it to related pieces. A great reference
+will even give you example of how to use the related pieces in a realistic
+context.
+Reference manuals are invaluable, and there are plenty of tools to help you
+create them. The common approach is the use of structured comments (e.g.,
+[JavaDoc](http://en.wikipedia.org/wiki/Javadoc),
+[Doxygen](http://en.wikipedia.org/wiki/Doxygen), and a [host of similar
+tools](http://en.wikipedia.org/wiki/Comparison_of_documentation_generators)).
+However, reference manuals by themselves are insufficient.
+A reference manual only works if you have some idea about how the system works
+as a whole. For that, you need some sort of overview. Here there is much less
+to help you produce good documentation. The common practice is to sprinkle
+small tutorials inside your reference documentation (the [MSDN
+library](http://msdn.microsoft.com/en-us/library) is a good example). This
+doesn't really solve the problem: how do you sufficiently explain a complex new
+system, so that references and small tutorials become useful?
+One possible solution to this problem, [literate
+programming](http://en.wikipedia.org/wiki/Literate_programming), was proposed
+by Knuth. In a nutshell, the idea was that the source code for the system
+fulfilled a dual role. You could compile it into the executable code, as
+expected. But you could also generate documentation from it.
+So far this sounds a lot like structured comments, and indeed structured
+comments were inspired by literate programming. The key difference between the
+two approaches is that in literate programming, the generated documentation was
+not a reference manual. It was a linear narrative describing the system - a
+story which walked you through the system in an specific path chosen for
+optimal presentation.
+To achieve this, the sources contained the linear documentation, with embedded
+code "chunks". The order of the chunks in the sources was determined by the
+narrative, not the programming language requirements. Extracting and
+re-ordering these chunks was part of the build process, so the regular compiler
+could process them as usual.
+This was the great strength, but also the great weakness, of literate
+programming. For example, it is next to impossible to create IDEs and similar
+tools for literate programming source code. The code chunks are split any which
+way and spread around the source files in any order; the same source file may
+contain chunks in several languages; etc. Automatically figuring out, say, the
+list of members of some class would be a daunting task.
+In contrast, structured comments stay out of the way of the IDE and similar
+tools. The source code is still structured exactly the way the compiler wants,
+which allows for easy, localized processing. The trade-off, of course, is that
+structured comments produce a reference manual, not a narrative.
+Today, structured comments have taken over the coding world, and literate
+programming has all but been forgotten. The problem it tried to solve, however,
+is still very much with us. How do we explain a new complex system?
+### A Different Approach ###
+Codnar is an example of a different approach for solving this problem, "inverse
+literate programming" (similar to, for example,
+[antiweb](http://packages.python.org/antiweb/)). This approach is a combination
+of structured comments and literate programming. Note that this approach is
+similar to, but different in key aspects from, [reverse literate
+programming](http://ssw.jku.at/Research/Projects/RevLitProg/).
+In inverse literate programming, the source files are organized just
+the way the compiler, IDE, and similar tools expect them to be. Structured
+comments are used to document the pieces of code, and a reference manual can be
+generated from the sources as usual.
+In addition, the code is split into (possibly nested) named "chunks". This is
+done using specially formatted comments. It turns out this functionality is
+already supported by most coding editors and IDEs, in the form of "folds" or
+"regions". These allow the developer to collapse or expand such chunks at will.
+At this point, inverse literate programming kicks in. The developer writes
+additional documentation source files, next to the usual code source files.
+These documentation source files contain a narrative that describes the system,
+much in the same way that a literate programming documentation would have done,
+with two important differences.
+The first difference is that the documentation source files refer to and embed
+the code chunks (using their names), as opposed to a literate programming
+system, where the documentation source files actually contain the code chunks.
+The second difference is that the documentation source files do not need to
+repeat the information that is already covered in the structured comments. When
+a code chunk is embedded into the documentation, it includes these comments, so
+all the documentation source files need to contain is the narrative "glue" for
+placing these pieces into a comprehensible context for the reader.
+In this way, inverse literate programming allows generating a linear narrative
+describing the system, without abandoning the existing code processing tools.
+It also makes it easy to retrofit such documentation to an existing code base;
+all that's needed is to mark the already-documented code chunks (or even just
+treat each source code file as a single chunk), and provide the narrative glue
+around them.
+### Maintaining the Documentation ###
+Structutred comments have the advantage that they are easy to maintain. Every
+time you change a piece of code, change its comment to match. Simiarlt,
+literate programming forced one to maintain the documentation as well, since
+the same source file was used for code and documentation. Inverse literate
+programming does not share this advantage. The linear documentation is in a
+separate file, so it isn't immediately visible to the developer who is making
+the changes. Also, it is easy to just forget to include some chunks of code in
+the documentation.
+These issues are very similar to the issues of unit testing. Unit tests live in
+a separate file from the code they test, and it is easy to forget to test some
+chunks of code. One way to ensure all code is tested is to use a code coverage
+tool. Similarly, inverse literate programming tools should complain about code
+chunks that are left out of the final narrative.
+A different approach,
+[TDD](http://en.wikipedia.org/wiki/Test-driven_development), ensures that the
+tests are up-to-date and complete by writing the tests before the code. The
+same approach can be used for documentation.
+[DDD](http://thinkingphp.org/spliceit/docs/0.1_alpha/pages/ddd_info.html) means
+that you first document what you are about to do, and only then follow up with
+the actual coding. Inverse literate programming and TDD are an excellent
+practical way to achieve that.
+The unit tests are code like any other code. As such, they should be documented
+using structured comments. Certain unit test tools like
+[RSpec](http://rspec.info/), [Cucumber](http://cukes.info/) and other
+[BDD](http://en.wikipedia.org/wiki/Behavior_Driven_Development) tools blur the
+line between the tests-as-code and the tests-as-documentation anyway, so the
+amount of unit test structured documentation should be small.
+Therefore, if you are writing the tests first, you have done the heavy lifting
+of documenting what the new code will do. All that is left is providing a bit
+of surrounding context and embedding it all in the currect location in the
+narrative. Then, when you write the new code itself, it should be easy to
+connect it to the narrative at the appropriate point.
+In the case of Code Narrator itself, the number of (raw) lines in the code
+library itself is ~2100 lines, the number of test code lines is ~2200 lines,
+and the number of narrative documentation lines is only ~900 lines. Given
+narrative documentation are easier to write than system (or test) code, this
+indicates maintaining a narrative is not an unreasonable burden for a
+well-tested project.
+### Code Narrator ###
+Codnar is an inverse literate programming tool. It allows you to tell a story
+about your system, which will explain it to others: developers, maintainers,
+and/or users. It builds on the structured comments you would write anyway to
+generate a reference manual for the system, requires minimal or no changes to
+your source code files, and works perfectly well inside your favorite IDE or
+editor. If you follow TDD or BDD, Codnar will make it easier for you to
+complement it with DDD.
+Codnar is available under the MIT license:
+[[LICENSE|named_chunk_with_containers]]
+And the current Codnar version is:
+[[lib/codnar/version.rb|named_chunk_with_containers]]
+The rest of this document goes into the details of Codnar's implementation. The
+core of the system is the following simple data flow: A set of source files is
+split into chunks; the chunks are woven into a single HTML. This simple flow
+can be enhanced by pre-processing the sources, or post-processing the HTML. In
+a realistic project, all this would be managed by some build tool; either using
+the command-line (for arbitrary build tools) or using the provided Ruby classes
+for Rake integration.

data/doc/system.markdown ADDED

@@ -0,0 +1,671 @@
+## Splitting files into chunks ##
+Codnar makes the reasonable assumption that each source file can be effectively
+processed as a sequence of lines. This works well in practice for all "text"
+source files. It fails miserably for "binary" source files, but such files
+don't work that well in most generic source management tools (such as version
+management systems).
+A second, less obvious assumption is that it is possible to classify the source
+file lines to "kinds" using a simple state machine. The classified lines are
+then grouped into nested chunks based on the two special line kinds
+`begin_chunk` and `end_chunk`. The other line kinds are used to control how the
+lines are formatted into HTML.
+The collected chunks, with the formatted HTML for each one, are then stored in
+a chunks file to be used later for weaving the overall HTML narrative.
+### Scanning Lines ###
+Scanning a file into classified lines is done by the `Scanner` class.
+Here is a simple test that demonstrates using the scanner:
+[[test/scan_lines.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/scanner.rb|named_chunk_with_containers]]
+As we can see, the implementation is split into two main parts. First, all
+shorthands in the syntax definition are expanded (possibly generating errors).
+Then, the expanded syntax is applied to a file, to generate a sequence of
+classified lines.
+#### Scanner Syntax Shorthands ####
+The syntax is expected to be written by hand in a YAML file. We therefore
+provide some convenient shorthands (listed above) to make YAML syntax files
+more readable. These shorthands must be expanded to their full form before we
+can apply the syntax to a file. There are two sets of shorthands we need to
+expand:
+* [[Scanner pattern shorthands|named_chunk_with_containers]]
+* [[Scanner state shorthands|named_chunk_with_containers]]
+The above code modifies the syntax object in place. This is safe because we are
+working on a `deep_clone` of the original syntax:
+[[lib/codnar/hash_extensions.rb|named_chunk_with_containers]]
+#### Classifying Source Lines ####
+Scanning a file to classified lines is a simple matter of applying the current
+state transitions to each line:
+[[Scanner file processing|named_chunk_with_containers]]
+If a line matches a state transition, it is classified accordingly. Otherwise,
+it is reported as an error:
+[[Scanner line processing|named_chunk_with_containers]]
+### Merging scanned lines to chunks ###
+Once we have the array of scanned classified lines, we need to merge them into
+nested chunks. Here is a simple test that demonstrates using the merger:
+[[test/merge_lines.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/merger.rb|named_chunk_with_containers]]
+#### Merging nested chunk lines ####
+To merge the nested chunk lines, we maintain a stack of the current chunks.
+Each `begin_chunk` line pushes another chunk on the stack, and each `end_chunk`
+line pops it. If any chunks are not properly terminated, they will remain in
+the stack when all the lines are processed.
+[[Merging nested chunk lines|named_chunk_with_containers]]
+#### Unindenting merged chunk lines ####
+Nested chunks are typically indented relative to their container chunks.
+However, in the generated documentation, these chunks are displayed on their
+own, and preserving this relative indentation would reduce their readability.
+We therefore unindent all chunks as much as possible as the final step.
+[[Unindenting chunk lines|named_chunk_with_containers]]
+### Generating chunk HTML ###
+Now that we have each chunk's lines, we need to convert them to HTML.
+#### Grouping lines of the same kind ####
+Instead of formatting each line on its own, we batch the operations to work on
+all lines of the same kind at once. Here is a simple test that demonstrates
+using the grouper:
+[[test/group_lines.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/grouper.rb|named_chunk_with_containers]]
+#### Formatting lines as HTML ####
+Formatting is based on a configuration that specifies, for (a group of) lines
+of each kind, how to convert it to HTML. Here is a simple test that
+demonstrates using the formatter:
+[[test/format_lines.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/formatter.rb|named_chunk_with_containers]]
+#### Basic formatters ####
+The implementation contains some basic formatting functions. These are
+sufficient for generic source code processing.
+[[Basic formatters|named_chunk_with_containers]]
+#### Markup formats ####
+The `markup_lines_to_html` formatter above relies on the existence of a class
+for converting comments from the specific markup format to HTML. Currently, two
+such formats are supported:
+* RDoc, the default markup format used in Ruby comments. Here is a simple test
+  that demonstrates using RDoc:
+  [[test/expand_rdoc.rb|named_chunk_with_containers]]
+  And here is the implementation:
+  [[lib/codnar/rdoc.rb|named_chunk_with_containers]]
+* Markdown, a generic markup syntax used across many systems and languages.
+  Here is a simple test that demonstrates using Markdown:
+  [[test/expand_markdown.rb|named_chunk_with_containers]]
+  And here is the implementation:
+  [[lib/codnar/markdown.rb|named_chunk_with_containers]]
+In both cases, the HTML generated by the markup format conversion is a bit
+messy. We therefore clean it up:
+[[Clean html|named_chunk_with_containers]]
+#### Syntax highlighting using GVIM ####
+If you have `gvim` istalled, it is possible to use it to generate syntax
+highlighting. This is a *slow* operation, as `gvim` was never meant to be used
+as a command-line tool. However, what it lacks in speed it compensates for in
+scope; almost any language you can think of has a `gvim` syntax highlighting
+definition. Here is a simple test that demonstrates using `gvim` for syntax
+highlighting:
+[[test/gvim_highlight_syntax.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/gvim.rb|named_chunk_with_containers]]
+Since GVim is so slow, we are using caching to minimize the time it takes to
+recompute the same code's highlighted HTML. This is pretty useful in practice -
+making changes in one chunk in a file will not require recomputing the
+highlighting for any of the unchanged chunks in the same file. Here is a simple
+test of using the caching functionality:
+[[test/cache_computations.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/cache.rb|named_chunk_with_containers]]
+#### Syntax highlighting using Sunlight ####
+[Sunlight](http://sunlightjs.com/) offers a different approach for syntax
+highlighting. Instead of pre-processing the code to generate highlighted HTML
+while splitting, it provides Javascript files that examine the textual code in
+the DOM and convert it to highlighted HTML in the browser. This takes virtually
+no time when splitting the code, but requires recomputing highlighting for all
+the code chunks every time the HTML file is loaded. This can be pretty slow,
+especially if using a browser with a slow Javascript engine, like IE. However,
+given how slow GVIM is, this is a reasonable trade-off, at least for small
+projects. Since Sunlight is a new project, it doesn't offer the extensive
+coverage of different programming languages supported by GVIM.
+Here is a simple test that demonstrates using Sunlight for syntax highlighting:
+[[test/sunlight_highlight_syntax.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/sunlight.rb|named_chunk_with_containers]]
+### Putting it all together ###
+Now that we have all the separate pieces of functionality for splitting source
+files into HTML chunks, we need to combine them to a single convenient service.
+#### Splitting code files ####
+Here is a simple test that demonstrates using the splitter for source code
+files:
+[[test/split_code.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/splitter.rb|named_chunk_with_containers]]
+#### Splitting documentation files ####
+The narrative documentation is expected to reside in one or more files, which
+are also "split" to a single chunk each. Having both documentation and code
+exist as chunks allows for uniform treatment of both when weaving, as well as
+allowing for pre-processing the documentation files, if necessary. For example,
+Codnar currently supports for documentation the same two markup formats that
+are also supported for code comments. Here is a simple test that demonstrates
+"splitting" documentation (using the same implementation as above):
+[[test/split_documentation.rb|named_chunk_with_containers]]
+### Built-in configurations ###
+The splitting mechanism defined above is pretty generic. To apply it to a
+specific system requires providing the appropriate configuration. The system
+provides a few specific built-in configurations which may be useful "out of the
+box".
+If one is willing to give up altogether on syntax highlighting and comment
+formatting, the system would be applicable as-is to any programming language.
+Properly highlighting almost any known programming language syntax would be a
+simple matter of passing the correct syntax parameter to GVIM.
+Properly formatting comments in additional mark-up formats would be trickier.
+First, a proper pattern needs to be established for extracting the comments
+(`/*`, `//`, `--`, etc.). Them, the results need to be converted to HTML. One
+way would be to pass them through GVim syntax highlighting with an appropriate
+format (e.g, `syntax=doxygen`). Another would be to invoke some Ruby library;
+finally, one could invoke some external tool to do the job. The latter two
+options would require providing additional glue Ruby code, similar to the GVim
+class above.
+At any rate, here are the built-in configurations:
+[[lib/codnar/split_configurations.rb|named_chunk_with_containers]]
+#### Combining configurations ####
+Different source files require different overall configurations but reuse
+common building blocks. To support it, we allow comfigurations to be combined
+using a "deep merge". This allows complex nested structures to be merged. There
+is even a way for arrays to append elements before/after the array they are
+merged with. Here is a simple test that demonstrates deep-merging complex
+structures:
+[[test/deep_merge.rb|named_chunk_with_containers]]
+Here is the implementation:
+[[Deep merge|named_chunk_with_containers]]
+And here is a test module that automates the process of merging configurations
+and invoking the Splitter:
+[[test/lib/test_with_configurations.rb|named_chunk_with_containers]]
+#### Documentation "splitting" ####
+These are pretty simple configurations, applicable to files containing a piece
+of the narrative in some supported format. These configurations typically do
+not require to be combined with other configurations. Here is a simple test
+that demonstrates "splitting" documentation:
+[[test/split_documentation_configurations.rb|named_chunk_with_containers]]
+And here are the actual configurations:
+[[Documentation "splitting" configurations|named_chunk_with_containers]]
+#### Source code lines classification ####
+Splitting source code files is a more complex affair, which does typically
+require combining several configurations. The basic configuration marks all
+lines as belonging to some code syntax, as a single chunk:
+[[Source code lines classification configurations|named_chunk_with_containers]]
+Sometimes, a code in one syntax contains nested "islands" of code in another
+syntax. Here is a simple configuration to support that, which can be combined
+with the above basic configuration:
+[[Nested foreign syntax code islands configurations|named_chunk_with_containers]]
+Here is a simple test demonstrating using source code lines classifications:
+[[test/split_code_configurations.rb|named_chunk_with_containers]]
+#### Simple comment classification ####
+Many languages use a simple comment syntax, where some prefix indicates a
+comment that spans until the end of the line (e.g., shell `#` comments or C++
+`//` comments).
+[[Simple comment classification configurations|named_chunk_with_containers]]
+Here is a simple test demonstrating using simple comment classifications:
+[[test/split_simple_comment_configurations.rb|named_chunk_with_containers]]
+#### Complex comment classification ####
+Other languages use a complex multi-line comment syntax, where some prefix
+indicates the beginning of the comment, some suffix indicates the end, and by
+convention some prefix is expected for the inner comment lines (e.g., C's
+"`/*`", "` *`", "`*/`" comments or HTML's "`<!--`", "` -`", "`-->`" comments).
+[[Complex comment classification configurations|named_chunk_with_containers]]
+Here is a simple test demonstrating using complex comment classifications:
+[[test/split_complex_comment_configurations.rb|named_chunk_with_containers]]
+#### Comment formatting ####
+In many cases, the text inside comments is written using some markup format
+(e.g., RDoc for Ruby or JavaDoc for Java). Currently, two such formats are
+supported, as well as simply wrapping the comment in an HTML pre element:
+[[Comment formatting configurations|named_chunk_with_containers]]
+Here is a simple test demonstrating formatting comment contents:
+[[test/format_comment_configurations.rb|named_chunk_with_containers]]
+#### Syntax highlighting using GVim ####
+Supporting a specific programming language (other than dealing with comments)
+is very easy using GVim for syntax highlighting, as demonstrated here:
+[[GVim syntax highlighting formatting configurations|named_chunk_with_containers]]
+Here is a simple test demonstrating highlighting code syntax using `gvim`:
+[[test/format_code_gvim_configurations.rb|named_chunk_with_containers]]
+#### Syntax highlighting using Sunlight ####
+For small projects in languages supported by Sunlight, you may choose to use
+it instead of GVIM
+[[Sunlight syntax highlighting formatting configurations|named_chunk_with_containers]]
+Here is a simple test demonstrating highlighting code syntax using Sunlight:
+[[test/format_code_sunlight_configurations.rb|named_chunk_with_containers]]
+#### Chunk splitting ####
+There are many ways to denote code "regions" (which become Codnar chunks). The
+following covers GVim's default scheme; others are easily added. It is safest
+to merge this configuration as the last of all the combined configurations, to
+ensure its patterns end up before any others.
+[[Chunk splitting configurations|named_chunk_with_containers]]
+Here is a simple test demonstrating splitting code chunks:
+[[test/split_chunk_configurations.rb|named_chunk_with_containers]]
+### Putting it all together ###
+Here is a test demonstrating putting several of the above configurations
+together in a meaningful way:
+[[test/split_combined_configurations.rb|named_chunk_with_containers]]
+## Storing chunks on the disk ##
+### Writing chunks to disk ###
+In any realistic system, the number of source files and chunks will be such
+that it makes sense to store the chunks on the disk for further processing.
+This allows incorporating the split operation as part of a build tool chain,
+and only re-splitting modified files. Here is a simple test demonstrating
+writing chunks to the disk:
+[[test/write_chunks.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/writer.rb|named_chunk_with_containers]]
+### Reading chunks to memory ###
+Having written the chunks to the disk requires us, at some following point in
+time, to read them back into memort. This is the first time we will have a view
+of the whole documented system, which allows us to detect several classes of
+consistency errors: Some chunks may be left out of the final narrative
+(consider this the equivalent of tests code coverage); we may be referring to
+missing (or misspelled) chunk names; and, finally, we need to deal with
+duplicate chunks.
+In literate programming, it is trivial to write a chunk once and use it in
+several places in the compiled source code. The classical example is C/C++
+function signatures that need to appear in both the `.h` and `.c`/`.cpp` files.
+However, in some cases this practice makes sense for other pieces of code, and
+since the ultimate source code contains only one copy of the chunk, this does
+not suffer from the typical copy-and-paste issues.
+In inverse literate programming, if the same code appears twice (as a result of
+copy-and-paste), then it does suffer from the typical copy-and-paste issues.
+The most serious of these is, of course, that when only one copy is changed.
+The way that Codnar helps alleviate this problem is that if the same chunk
+appears more than once in the source code, its content is expected to be
+exactly the same in both cases (up to indentation). This should not be viewed
+as endorsement of copy-and-paste programming; Using duplicate chunks should be
+a last resort measure to combat restrictions in the programming language and
+compilation tool chain.
+#### Chunk identifiers ####
+The above definition raises the obvious question: what does "the same chunk"
+mean? As far as Codnar is concerned, a chunk is uniquely identified by its
+name, which is specified on the `begin_chunk` line. The unique identifier is
+not the literal name but a transformation of it. This allows us to ignore
+capitalization, white space, and any punctuation that may appear in the name.
+It also allows us to use the resulting ID as an HTML anchor name, without
+worrying about HTML's restictions on such names.
+Here is a simple test demonstrating converting names to identifiers:
+[[test/identify_chunks.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/string_extensions.rb|named_chunk_with_containers]]
+#### In-memory chunks storage ####
+Detecting unused and/or duplicate chunks requires us to have in-memory chunk
+storage that tracks all chunks access. Here is a simple test demonstrating
+reading chunks into the storage and handling the various error conditions
+listed above:
+[[test/read_chunks.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/reader.rb|named_chunk_with_containers]]
+## Weaving chunks into HTML ##
+Assembling the final HTML requires combining both the narrative documentation
+and source code chunks. This is done top-down starting at a "root"
+documentation chunk and recursively embedding nested documentation and code
+chunks into it.
+### Weaving chunks together ###
+When embedding a documentation chunk inside another documentation chunk, things
+are pretty easy - we just need to insert the embedded chunk HTML into the
+containing chunk. When embedding a source code chunk into the documentation,
+however, we may want to wrap it in some boilerplate HTML, providing a header,
+footer, borders, links, etc. Therefore, the HTML syntax we use to embed a chunk
+into the documentation is `<embed src="..." type="x-codnar/template-name"/>`.
+The templates are normal ERB templates, except for the magical `file` and
+`image` templates, described below.
+At any rate, here is a simple test demonstrating applying different templates
+to the embedded code chunks:
+[[test/weave_configurations.rb|named_chunk_with_containers]]
+Here is the implementation:
+[[lib/codnar/weaver.rb|named_chunk_with_containers]]
+And here are the pre-defined weaving template configurations:
+[[lib/codnar/weave_configurations.rb|named_chunk_with_containers]]
+#### Embedding files ####
+The template named `file` is special in two ways. First, the `src` is given
+special treatment. If it begins with a "`.`", it is assumed to be a normal path
+name relative to the current working directory; otherwise, it is assumed to be
+a name of a file packaged inside some gem and is searched for in Ruby's
+`$LOAD_PATH`. This allows gems (such as Codnar itself) to provide such files to
+be used in the woven documentation.
+Second, the content of the file is simply embedded into the generated
+documentation. This allows the documentation to be a stand-alone file,
+including all the CSS and Javascript required for proper display.
+[[Processing the file template|named_chunk_with_containers]]
+See the `doc/root.html` file for plenty of examples of using this
+functionality.
+#### Embedding images ####
+The `image` template is a specialization of the `file` template for dealing
+with embedded images. The specified image file is embedded into the generated
+HTML as an `img` tag, using a [data
+URL](http://en.wikipedia.org/wiki/Data_URI_scheme). This is very useful for
+small images, but is problematic when their size increase beyond
+browser-specific limits.
+Here is a simple test demonstrating processing embedded image files:
+[[test/embed_images.rb|named_chunk_with_containers]]
+Here is the implementation:
+[[Processing Base64 embedded data images|named_chunk_with_containers]]
+And here is a sample embedded image:
+[[doc/logo.png|image]]
+## Invoking the functionality ##
+There are two ways to invoke Codnar's functionality - from the command line,
+and (for Ruby projects) as integrated Rake tasks.
+### Command Line Applications ###
+Executable scripts (tests, command-line applications) start with a `require
+'codnar'` line to access to the full Codnar code. This also serves as a
+convenient list of all of Codnar's parts and dependencies:
+[[lib/codnar.rb|named_chunk_with_containers]]
+The base command line Application class handles execution from the command
+line, with the usual standard options, as well as some Codnar-specific ones:
+the ability to specify configuration files and/or built-in configurations, and
+the ability to include additional extension code triggered from these
+configurations. Together, these allow configuring and extending Codnar's
+behavior to cover the specific system's needs.
+Here is a simple test demonstrating the standard Codnar application behavior:
+[[test/run_application.rb|named_chunk_with_containers]]
+And here is the implementation:
+[[lib/codnar/application.rb|named_chunk_with_containers]]
+#### Application for splitting files ####
+Here is a simple test demonstrating invoking the command-line application for
+splitting files:
+[[test/run_split.rb|named_chunk_with_containers]]
+Here is the implementation:
+[[lib/codnar/split.rb|named_chunk_with_containers]]
+And here is the actual command-line application script:
+[[bin/codnar-split|named_chunk_with_containers]]
+#### Application for weaving chunks ####
+Here is a simple test demonstrating invoking the command-line application for
+weaving chunk to HTML:
+[[test/run_weave.rb|named_chunk_with_containers]]
+Here is the implementation:
+[[lib/codnar/weave.rb|named_chunk_with_containers]]
+And here is the actual command-line application script:
+[[bin/codnar-weave|named_chunk_with_containers]]
+### Rake Integration ###
+For Ruby projects (or any other project using Rake), it is also possible to
+invoke Codnar using Rake tasks. Here is a simple test demonstrating using the
+Rake tasks:
+[[test/rake_tasks.rb|named_chunk_with_containers]]
+To use these tasks in a Rakefile, one needs to `require 'codnar/rake'`. The
+code implements a singleton that holds the global state shared between tasks:
+[[lib/codnar/rake.rb|named_chunk_with_containers]]
+#### Task for splitting files ####
+To split one or more files to chunks, create a new SplitTask. Multiple such
+tasks may be created; this is required if different files need to be split
+using different configurations.
+[[lib/codnar/rake/split_task.rb|named_chunk_with_containers]]
+#### Task for weaving chunks ####
+To weave the chunks together, create a single WeaveTask.
+[[lib/codnar/rake/weave_task.rb|named_chunk_with_containers]]
+## Building the Codnar gem ##
+The following Rakefile is in charge of building the gem, with the help of some
+tools described below.
+[[Rakefile|named_chunk_with_containers]]
+The generated HTML requires some tweaking to yield aesthetic, readable results.
+This tweaking consists of using Javascript to control chunk visibility,
+generating a table of content, and using CSS to make the HTML look better.
+Here are the modified configurations for generating the correct HTML:
+[[Codnar configurations|named_chunk_with_containers]]
+### Javascript chunk visibilty control ###
+The following code injects visibility controls ("+"/"-" toggles) next to each
+embedded code chunk. It also hides all the chunks by default; this increases
+the readability of the overall narrative, turning it into a high-level summary.
+Expanding the embedded code chunks allows the reader to delve into the details.
+[[lib/codnar/data/control_chunks.js|named_chunk_with_containers]]
+### Javascript table of content ###
+The following code is not very efficient or elegant but it does a basic job of
+iunjecting a table of content into the generated HTML.
+[[lib/codnar/data/contents.js|named_chunk_with_containers]]
+### CSS style ###
+To avoid dealing with the different default styles used by different browsers,
+we employ the YUI CSS [reset](http://developer.yahoo.com/yui/reset/) and
+[base](http://developer.yahoo.com/yui/base/) files. Resetting and restoring the
+default CSS styles is inelegant, but it is the only current way to get a
+consistent presentation of HTML. Once this is out of the way, we apply styles
+specific to our HTML. Some of these override the default styles established by
+the base CSS file above. We do this instead of directly tweaking the base CSS
+file, to allow easy upgrade to new versions if/when YUI release any.
+[[lib/codnar/data/style.css|named_chunk_with_containers]]
+### Using Sunlight ###
+When using Sunlight for syntax highlighting, we also need to include some CSS
+and Javascript files to convert the classified `pre` elements into properly
+marked-up HTML. We also need to invoke this Javascript code (a one-line
+operations). Here is what such code might look like inside a Javascript block
+of the generated HTML:
+  &lt;embed src="codnar/data/sunlight/min.js" type="x-codnar/file"/&gt;
+  &lt;embed src="codnar/data/sunlight/ruby-min.js" type="x-codnar/file"/&gt;
+  Sunlight.globalOptions.lineNumbers = false;
+  Sunlight.highlightAll();